Powered by OpenAIRE graph
Found an issue? Give us feedback
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/ ZENODOarrow_drop_down
image/svg+xml art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos Open Access logo, converted into svg, designed by PLoS. This version with transparent background. http://commons.wikimedia.org/wiki/File:Open_Access_logo_PLoS_white.svg art designer at PLoS, modified by Wikipedia users Nina, Beao, JakobVoss, and AnonMoos http://www.plos.org/
ZENODO
Article . 2025
License: CC BY
Data sources: ZENODO
ZENODO
Article . 2025
License: CC BY
Data sources: Datacite
ZENODO
Article . 2025
License: CC BY
Data sources: Datacite
versions View all 2 versions
addClaim

Readability Assessment of Large Language Model Responses to Common Postnatal Questions Among Migrant Parents: A Comparative Analysis of ChatGPT and Gemini

Authors: Dönger, Utku; Doğan, Ahmet can;

Readability Assessment of Large Language Model Responses to Common Postnatal Questions Among Migrant Parents: A Comparative Analysis of ChatGPT and Gemini

Abstract

Language and communication barriers in migrant communities can substantially hinder access to health information, particularly regarding frequently asked questions in the neonatal period. Large language models (LLMs) have emerged as potential tools to address this information gap; however, the readability of their outputs is critically important from a public health perspective. This study comparatively evaluated the readability of responses generated by ChatGPT and Google Gemini to ten core postnatal questions commonly posed by migrant parents to pediatricians. The ten questions were translated into English and repeatedly submitted to both models on different days and in varying orders. The resulting texts were analyzed using multiple readability tests, including the Automated Readability Index, Flesch Reading Ease, Flesch–Kincaid Grade Level, SMOG Index, and Linsear Write Formula. The findings indicate that Gemini systematically produced more complex texts requiring a higher literacy level in most indices. In contrast, ChatGPT yielded more easily understandable content on scales closely related to health literacy, such as Flesch Reading Ease and SMOG. Overall, both models show potential to support access to health information among migrant populations; however, the complexity of their outputs remains above recommended public health literacy levels, highlighting the need for further optimization and text simplification, particularly for groups with low health literacy.

Related Organizations
Keywords

Postnatal Care, Migrant health, Large language models, Readability

  • BIP!
    Impact byBIP!
    selected citations
    These citations are derived from selected sources.
    This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    0
    popularity
    This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
    Average
    influence
    This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
    Average
    impulse
    This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
    Average
Powered by OpenAIRE graph
Found an issue? Give us feedback
selected citations
These citations are derived from selected sources.
This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Citations provided by BIP!
popularity
This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.
BIP!Popularity provided by BIP!
influence
This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).
BIP!Influence provided by BIP!
impulse
This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.
BIP!Impulse provided by BIP!
0
Average
Average
Average