A Primer for Evaluating Large Language Models in Social-Science Research

Abdurahman, Suhaib; Ziabari, Alireza Salkhordeh; Moore, Alexander K.; Bartels, Daniel M.; Dehghani, Morteza

doi:10.6082/na808-pm811

Published April 16, 2025 | Version v1

Journal article Open

A Primer for Evaluating Large Language Models in Social-Science Research

1. University of Southern California
2. University of Illinois Chicago
3. University of Chicago

Autoregressive large language models (LLMs) exhibit remarkable conversational and reasoning abilities and exceptional flexibility across a wide range of tasks. Subsequently, LLMs are being increasingly used in scientific research to analyze data, generate synthetic data, or even write scientific articles. This trend necessitates that authors follow best practices for conducting and reporting LLM research and that journal reviewers can evaluate the quality of works that use LLMs. We provide authors of social-scientific research with essential recommendations to ensure replicable and robust results using LLMs. Our recommendations also highlight considerations for reviewers, focusing on methodological rigor, replicability, and validity of results when evaluating studies that use LLMs to automate data processing or simulate human data. We offer practical advice on assessing the appropriateness of LLM applications in submitted studies, emphasizing the need for transparency in methodological reporting and the challenges posed by the nondeterministic and continuously evolving nature of these models. By providing a framework for best practices and critical review, in this primer, we aim to ensure high-quality, innovative research in the evolving landscape of social-science studies using LLMs.

Files

Primer-for-Evaluating-Large-Language-Models-in-Social-Science-Research.pdf

Files (376.3 kB)

Name	Size	Download all
Primer-for-Evaluating-Large-Language-Models-in-Social-Science-Research.pdf md5:b41988e0183bc3be8b2600fe358484e6	376.3 kB	Preview Download

Additional details

DOI: 10.1177/25152459251325174
Other: oai:uchicago.tind.io:14892

Defense Advanced Research Projects Agency
Influence Campaign Awareness and Sensemaking
Air Force Office of Scientific Research
A9550-23-1-0463

Division(s): Booth School of Business
Department(s): Marketing

Views

Downloads

Show more details

	All versions	This version
Views	0	0
Downloads	0	0
Data volume	0 Bytes	0 Bytes

More info on how stats are collected....

DOI

Resource type

Journal article

Publisher

University of Chicago

Published in

Advances in Methods and Practices in Psychological Science, 2025.

Languages

English

License

Creative Commons Attribution Non Commercial 4.0 International

No further description. Read more
Distribution License

No further description.

Copyrights

© The Author(s) 2025 This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).

Technical metadata

Created: May 22, 2026
Modified: May 22, 2026

A Primer for Evaluating Large Language Models in Social-Science Research

Files

Primer-for-Evaluating-Large-Language-Models-in-Social-Science-Research.pdf

Files (376.3 kB)

Additional details

Identifiers

Funding

UChicago Information

A Primer for Evaluating Large Language Models in Social-Science Research

Creators

Description

Files

Primer-for-Evaluating-Large-Language-Models-in-Social-Science-Research.pdf

Files (376.3 kB)

Additional details

Identifiers

Funding

UChicago Information