Contribution Summary
The paper demonstrates a scalable workflow for AI-assisted qualitative codebook development in student feedback research.
Draft enrichment generated from extracted publication text; pending human review.
Plain-Language Summary
This paper presents an NLP and generative text workflow for creating qualitative codebooks from student evaluations of teaching. The workflow extracts ideas from thousands of comments, embeds and clusters them, and uses a generative model to summarize candidate codes for researcher review.
Research Question
How can NLP and generative text models support qualitative codebook generation for large collections of student evaluations of teaching?
Methods
- Analyzed a corpus of 5,000 student evaluations of teaching from undergraduate science and engineering courses.
- Used an extract, embed, cluster, and summarize workflow to identify semantically related ideas and generate candidate code labels.
- Compared the generated codebook with prior human-generated categories and educational frameworks.
Key Findings
- The workflow produced a detailed codebook with substantial overlap with human-generated categories.
- The approach surfaced granular themes across a larger dataset than would typically be feasible through first-pass manual coding alone.
- The paper emphasizes that researchers still need to judge relevance, redundancy, abstraction level, and saturation.
Implications
Large-scale student feedback analysis can become more tractable when NLP workflows are paired with researcher interpretation.
Local or open-source model choices can help protect sensitive educational data.
Generated codebooks should be treated as analytic starting points requiring human review, not finished qualitative findings.
Research Artifacts
Abstract
Publication on Using Generative Text Models to Create Qualitative Codebooks for Student Evaluations of Teaching
Related Projects
Using Large Language Models and Generative AI to Scale Qualitative Data Analysis
How can researchers combine qualitative judgment with open-source generative AI to scale thematic analysis without hiding methodological choices?
CAREER: Minds and Machines: Exploring Engineering Faculty Member Mental Models of Generative AI and Instructional Decisions
How do engineering faculty understand generative AI, and how do those mental models shape instructional decisions?
EAGER: Natural Language Processing for Teaching and Research in Engineering Education (NLPTREE)
How can NLP methods help engineering education researchers and instructors analyze text-rich learning data responsibly and at scale?