
Abstract Background The objective of this study was to evaluate the concordance between therapeutic recommendations proposed by a multidisciplinary team meeting and those generated by a large language model (ChatGPT) for colorectal cancer. Although multidisciplinary teams represent the ‘standard’ for decision-making in cancer treatment, they require significant resources and may be susceptible to human bias. Artificial intelligence, particularly large language models such as ChatGPT, has the potential to enhance or optimize the decision-making processes. The present study examines the potential for integrating artificial intelligence into clinical practice by comparing multidisciplinary team decisions with those generated by ChatGPT. Methods A retrospective, single-centre study was conducted involving consecutive patients with newly diagnosed colorectal cancer discussed at our multidisciplinary team meeting. The pre- and post-therapeutic multidisciplinary team meeting recommendations were assessed for concordance compared with ChatGPT-4. Results One hundred consecutive patients with newly diagnosed colorectal cancer of all stages were included. In the pretherapeutic discussions, complete concordance was observed in 72.5%, with partial concordance in 10.2% and discordance in 17.3%. For post-therapeutic discussions, the concordance increased to 82.8%; 11.8% of decisions displayed partial concordance and 5.4% demonstrated discordance. Discordance was more frequent in patients older than 77 years and with an American Society of Anesthesiologists classification ≥ III. Conclusion There is substantial concordance between the recommendations generated by ChatGPT and those provided by traditional multidisciplinary team meetings, indicating the potential utility of artificial intelligence in supporting clinical decision-making for colorectal cancer management.
Male, Patient Care Team, Aged, 80 and over, Adult, Humans; Colorectal Neoplasms/therapy; Colorectal Neoplasms/diagnosis; Retrospective Studies; Artificial Intelligence; Female; Male; Aged; Middle Aged; Patient Care Team; Clinical Decision-Making/methods; Aged, 80 and over; Interdisciplinary Communication; Adult; Decision Making; Generative Artificial Intelligence, Clinical Decision-Making, Decision Making, Middle Aged, Generative Artificial Intelligence, Artificial Intelligence, Humans, Original Article, Female, Interdisciplinary Communication, Colorectal Neoplasms, Retrospective Studies, Aged
Male, Patient Care Team, Aged, 80 and over, Adult, Humans; Colorectal Neoplasms/therapy; Colorectal Neoplasms/diagnosis; Retrospective Studies; Artificial Intelligence; Female; Male; Aged; Middle Aged; Patient Care Team; Clinical Decision-Making/methods; Aged, 80 and over; Interdisciplinary Communication; Adult; Decision Making; Generative Artificial Intelligence, Clinical Decision-Making, Decision Making, Middle Aged, Generative Artificial Intelligence, Artificial Intelligence, Humans, Original Article, Female, Interdisciplinary Communication, Colorectal Neoplasms, Retrospective Studies, Aged
| selected citations These citations are derived from selected sources. This is an alternative to the "Influence" indicator, which also reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | 4 | |
| popularity This indicator reflects the "current" impact/attention (the "hype") of an article in the research community at large, based on the underlying citation network. | Top 10% | |
| influence This indicator reflects the overall/total impact of an article in the research community at large, based on the underlying citation network (diachronically). | Average | |
| impulse This indicator reflects the initial momentum of an article directly after its publication, based on the underlying citation network. | Average |
