Journal Article

Using Generative Text Models to Create Qualitative Codebooks for Student Evaluations of Teaching

Andrew Katz, Mitch Gerhardt, Michelle Soledad

International Journal of Qualitative Methods2024Featured

Contribution Summary

The paper demonstrates a scalable workflow for AI-assisted qualitative codebook development in student feedback research.

Draft enrichment generated from extracted publication text; pending human review.

Plain-Language Summary

This paper presents an NLP and generative text workflow for creating qualitative codebooks from student evaluations of teaching. The workflow extracts ideas from thousands of comments, embeds and clusters them, and uses a generative model to summarize candidate codes for researcher review.

Research Question

How can NLP and generative text models support qualitative codebook generation for large collections of student evaluations of teaching?

Methods

  • Analyzed a corpus of 5,000 student evaluations of teaching from undergraduate science and engineering courses.
  • Used an extract, embed, cluster, and summarize workflow to identify semantically related ideas and generate candidate code labels.
  • Compared the generated codebook with prior human-generated categories and educational frameworks.

Key Findings

  • The workflow produced a detailed codebook with substantial overlap with human-generated categories.
  • The approach surfaced granular themes across a larger dataset than would typically be feasible through first-pass manual coding alone.
  • The paper emphasizes that researchers still need to judge relevance, redundancy, abstraction level, and saturation.

Implications

Large-scale student feedback analysis can become more tractable when NLP workflows are paired with researcher interpretation.

Local or open-source model choices can help protect sensitive educational data.

Generated codebooks should be treated as analytic starting points requiring human review, not finished qualitative findings.

Research Artifacts

protocolExtract-embed-cluster-summarize workflowA workflow for moving from student evaluation comments to candidate qualitative codes using embeddings, clustering, and generative summaries.
appendixPrompt examplesAppendix materials document prompts used to generate and refine codebook elements.

Abstract

Publication on Using Generative Text Models to Create Qualitative Codebooks for Student Evaluations of Teaching

Related Projects