The chat records of diagnoses for China's first batch of rare disease catalog by ChatGPT-4o and the four LLMs

Wei, Zhong; YiFan, Liu; Yan, Liu; Kai, Yang; HuiMin, Gao; HuiHui, Yan; WenJing, Hao; YouSheng, Yan; Chenghong, Yin

Found an issue? Give us feedback

ZENODOarrow_drop_down

ZENODO

Dataset . 2025

License: CC BY

Data sources: ZENODO

ZENODO

Dataset . 2025

License: CC BY

Data sources: Datacite

ZENODO

Dataset . 2025

License: CC BY

Data sources: Datacite

The chat records of diagnoses for China's first batch of rare disease catalog by ChatGPT-4o and the four LLMs

Research datakeyboard_double_arrow_right Dataset 27 Feb 2025 English Publisher:Zenodo

Authors: Wei, Zhong; YiFan, Liu; Yan, Liu; Kai, Yang; HuiMin, Gao; HuiHui, Yan; WenJing, Hao; +2 Authors

doi: 10.5281/zenodo.14934294 , 10.5281/zenodo.14934293

The chat records of diagnoses for China's first batch of rare disease catalog by ChatGPT-4o and the four LLMs

- Summary
- Subjects
- Metrics

Abstract

This study aimed to evaluate the diagnostic accuracy of ChatGPT-4o and 4 open-source LLMs (qwen2.5:7b, Llama3.1:8b, qwen2.5:72b, and Llama3.1:70b) for rare diseases, assesses the language effect on diagnostic performance, and explore retrieval augmented generation (RAG) and chain-of-thought (CoT) reasoning. This supplementary material includes the first rare disease catalog of ChatGPT-4o in China and all diagnostic chat records for four LLMs in this study. This file contains a collection of 11 chat content. The LLM diagnostic order of cases can be found in Appendix 2 of Multimedia.

Related Organizations

Beijing Obstetrics and Gynecology Hospital
China (People's Republic of)
Capital Medical University
China (People's Republic of)

Keywords

Rare Diseases

Impact byBIP!

	selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	0
	popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network.	Average
	influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically).	Average
	impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network.	Average

Found an issue? Give us feedback

0

Average