Echoes in AI: Quantifying lack of plot diversity in LLM outputs.

Weijia Xu , Nebojsa Jojic , Sudha Rao , Chris Brockett , Bill Dolan

Proc Natl Acad Sci U S A

Microsoft Research, Redmond, WA 98052.

Published: September 2025

Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

With rapid advances in large language models (LLMs), there has been an increasing application of LLMs in creative content ideation and generation. A critical question emerges: can current LLMs provide ideas that are diverse enough to truly bolster collective creativity? We examine two state-of-the-art LLMs, GPT-4 and LLaMA-3, on story generation and discover that LLM-generated stories often consist of plot elements that are echoed across a number of generations. To quantify this phenomenon, we introduce the score, an automatic metric that measures the uniqueness of a plot element among alternative storylines generated using the same prompt under an LLM. Evaluating on 100 short stories, we find that LLM-generated stories often contain combinations of idiosyncratic plot elements echoed frequently across generations and across different LLMs, while plots from the original human-written stories are rarely recreated or even echoed in pieces. Moreover, our human evaluation shows that the ranking of scores among story segments correlates moderately with human judgment of surprise level, even though score computation is completely automatic without relying on human judgment.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC12415252	PMC
http://dx.doi.org/10.1073/pnas.2504966122	DOI Listing

Publication Analysis

Top Keywords

llm-generated stories

plot elements

elements echoed

human judgment

llms

echoes quantifying

quantifying lack

plot

lack plot

plot diversity

Similar Publications

Echoes in AI: Quantifying lack of plot diversity in LLM outputs.

Proc Natl Acad Sci U S A

September 2025

Microsoft Research, Redmond, WA 98052.

Weijia Xu , Nebojsa Jojic , Sudha Rao , Chris Brockett , Bill Dolan

View Article and Find Full Text PDF

Similar Publications

Gender Representation of Health Care Professionals in Large Language Model-Generated Stories.

JAMA Netw Open

September 2024

College of Medicine and Public Health, Flinders University, Adelaide, South Australia, Australia.

Bradley D Menz , Nicole M Kuderer , Benjamin Chin-Yee , Jessica M Logan , Andrew Rowland

Importance: With the growing use of large language models (LLMs) in education and health care settings, it is important to ensure that the information they generate is diverse and equitable, to avoid reinforcing or creating stereotypes that may influence the aspirations of upcoming generations.

Objective: To evaluate the gender representation of LLM-generated stories involving medical doctors, surgeons, and nurses and to investigate the association of varying personality and professional seniority descriptors with the gender proportions for these professions.

Design, Setting, And Participants: This is a cross-sectional simulation study of publicly accessible LLMs, accessed from December 2023 to January 2024.

View Article and Find Full Text PDF

Similar Publications