Using Generative Text Models to Create Qualitative Codebooks for Student Evaluations of Teaching

Contribution Summary

The paper demonstrates a scalable workflow for AI-assisted qualitative codebook development in student feedback research.

Draft enrichment generated from extracted publication text; pending human review.

Plain-Language Summary

This paper presents an NLP and generative text workflow for creating qualitative codebooks from student evaluations of teaching. The workflow extracts ideas from thousands of comments, embeds and clusters them, and uses a generative model to summarize candidate codes for researcher review.

Research Question

How can NLP and generative text models support qualitative codebook generation for large collections of student evaluations of teaching?

Methods

Analyzed a corpus of 5,000 student evaluations of teaching from undergraduate science and engineering courses.
Used an extract, embed, cluster, and summarize workflow to identify semantically related ideas and generate candidate code labels.
Compared the generated codebook with prior human-generated categories and educational frameworks.

Key Findings

The workflow produced a detailed codebook with substantial overlap with human-generated categories.
The approach surfaced granular themes across a larger dataset than would typically be feasible through first-pass manual coding alone.
The paper emphasizes that researchers still need to judge relevance, redundancy, abstraction level, and saturation.

Implications

Large-scale student feedback analysis can become more tractable when NLP workflows are paired with researcher interpretation.

Local or open-source model choices can help protect sensitive educational data.

Generated codebooks should be treated as analytic starting points requiring human review, not finished qualitative findings.

Research Artifacts

protocolExtract-embed-cluster-summarize workflowA workflow for moving from student evaluation comments to candidate qualitative codes using embeddings, clustering, and generative summaries.

appendixPrompt examplesAppendix materials document prompts used to generate and refine codebook elements.

Abstract

Publication on Using Generative Text Models to Create Qualitative Codebooks for Student Evaluations of Teaching

Related Projects

Using Large Language Models and Generative AI to Scale Qualitative Data Analysis

How can researchers combine qualitative judgment with open-source generative AI to scale thematic analysis without hiding methodological choices?

Project

CAREER: Minds and Machines: Exploring Engineering Faculty Member Mental Models of Generative AI and Instructional Decisions

How do engineering faculty understand generative AI, and how do those mental models shape instructional decisions?

Project

EAGER: Natural Language Processing for Teaching and Research in Engineering Education (NLPTREE)

How can NLP methods help engineering education researchers and instructors analyze text-rich learning data responsibly and at scale?

Project

All publications