Background and Aims: Inflammatory bowel diseases (IBD) are chronic conditions that can lead to a physical, social, and economic burden. Generative artificial intelligence (AI), particularly ChatGPT, gained attention for its potential to support medical practice. However, concerns remain about the reliability and consistency of its responses. This study systematically reviews the existing evidence on the role of generative AI in IBD. Materials and methods: We conducted a systematic literature review following PRISMA guidelines. Studies investigating generative AI in IBD care were identified through PubMed and Embase (Jan 2020–Sep 2024). Results: From 2875 records, 8 studies (2023–2024) met inclusion criteria: 5 on patient education, 2 on decision support, and 1 on research ideation. For patient education, ChatGPT provided clear and accurate responses, with accuracy reaching 84.2 % in a study, though sometimes lacked consistency. In decision support, ChatGPT's classifications of ulcerative colitis severity aligned with clinician assessments in 80 % of cases and in 87.8 % of cases for guideline-based dysplasia management. For research ideation, ChatGPT generated highly relevant (mean score: 4.9 ± 0.26) and clear (4.8 ± 0.41) questions, but lacked specificity (2.86/5) and originality (1.07/5). Conclusions: Generative AI shows promise in IBD care, but concerns about accuracy, consistency, and outdated information highlight the need for expert oversight before clinical integration.
Applications of generative artificial intelligence in inflammatory bowel disease: A systematic review
Lusetti F.
;Maimaris S.;La Rosa G. P.;Scalvini D.;Schiepatti A.;Biagi F.;
2025-01-01
Abstract
Background and Aims: Inflammatory bowel diseases (IBD) are chronic conditions that can lead to a physical, social, and economic burden. Generative artificial intelligence (AI), particularly ChatGPT, gained attention for its potential to support medical practice. However, concerns remain about the reliability and consistency of its responses. This study systematically reviews the existing evidence on the role of generative AI in IBD. Materials and methods: We conducted a systematic literature review following PRISMA guidelines. Studies investigating generative AI in IBD care were identified through PubMed and Embase (Jan 2020–Sep 2024). Results: From 2875 records, 8 studies (2023–2024) met inclusion criteria: 5 on patient education, 2 on decision support, and 1 on research ideation. For patient education, ChatGPT provided clear and accurate responses, with accuracy reaching 84.2 % in a study, though sometimes lacked consistency. In decision support, ChatGPT's classifications of ulcerative colitis severity aligned with clinician assessments in 80 % of cases and in 87.8 % of cases for guideline-based dysplasia management. For research ideation, ChatGPT generated highly relevant (mean score: 4.9 ± 0.26) and clear (4.8 ± 0.41) questions, but lacked specificity (2.86/5) and originality (1.07/5). Conclusions: Generative AI shows promise in IBD care, but concerns about accuracy, consistency, and outdated information highlight the need for expert oversight before clinical integration.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


