Evaluation Guidelines for Empirical Studies involving LLMs

Towards community guidlines for empirical studies in software engineering involving LLMs.

This website hosts a draft of community guidlines for empirical studies in software engineering involving LLMs. We present a first taxonomy of study types and corresponding guidelines.

The current draft is based on a position paper as well as discussion during the ISERN 2024 meeting and the 2nd Copenhagen Symposium on Human-Centered Software Engineering AI. To contribute to the guidelines, you can open an issue or a pull request in our GitHub repository.

Workstream Leads:

Team: