LLM-Based Content Analysis of Sampling Methodology in Library and Information Science Research: A Cross-Model Comparison of Coding Performance by Task Type

Min Sein; 민세인; Kim Eungi; 김은기

doi:10.16981/kliss.57.1.202603.413

오늘 하루 그만보기

Journal of Korean Library and Information Science Society

P-ISSN2466-2542
KCI

Home

OA Policy

ISSN : 2466-2542

Article Contents

Prev Next

e-Submission

Vol.57 No.1

PDF Citation

LLM-Based Content Analysis of Sampling Methodology in Library and Information Science Research: A Cross-Model Comparison of Coding Performance by Task Type

Journal of Korean Library and Information Science Society / Journal of Korean Library and Information Science Society, (P)2466-2542;

2026, v.57 no.1, pp.413-438

https://doi.org/10.16981/kliss.57.1.202603.413

Sein Min
Eungi Kim

Min, S., & Kim, E. (2026). LLM-Based Content Analysis of Sampling Methodology in Library and Information Science Research: A Cross-Model Comparison of Coding Performance by Task Type. , 57(1), 413-438, https://doi.org/10.16981/kliss.57.1.202603.413

copy

Abstract

The purpose of this study is to compare and examine, across multiple dimensions, the conditions under which large language model (LLM)-based content analysis can be applied according to task type in the context of research methods analysis in library and information science. To this end, 100 survey and interview studies published between 2020 and 2024 in four major Korean journals in library and information science were selected using stratified random sampling. The coding results produced by one human coder and four large language models (Claude-3.5-Haiku, GPT-4o-Mini, Gemini-2.0-Flash, and Grok-4-Latest) were compared across twelve dimensions constituting sampling methodology. The results show that relatively high levels of agreement were observed in dimensions where classification could be made based on explicit criteria, whereas consistently lower levels of agreement appeared in dimensions requiring inferential or evaluative judgment. These findings suggest that the performance of LLM-based automated coding is influenced more by the decision structure of the task and the explicitness of the available information than by model performance itself. Therefore, the scope of LLM application should be more carefully examined from the perspectives of task type and judgment characteristics, and the systematic design of human-AI hybrid validation strategies is required.

keywords: Large Language Models, Automated Content Analysis, Coding Reliability, Library and Information Science Research Methods, Sampling Methodology

Received: 2026-02-21

Accepted: 2026-03-07

Published: 2026-03-30

바로가기메뉴

Article Contents

Vol.57 No.1

LLM-Based Content Analysis of Sampling Methodology in Library and Information Science Research: A Cross-Model Comparison of Coding Performance by Task Type

Abstract

Journal of Korean Library and Information Science Society