TY - JOUR
T1 - From diagnostics to education: Multi-domain evaluation of LLM chatbots in neurology
AU - Battineni, Gopi
AU - Chintalapudi, Nalini
AU - Dhulipalla, Venkata R.
AU - Amenta, Francesco
N1 - Publisher Copyright:
© 2025 The Authors.
PY - 2026/2
Y1 - 2026/2
N2 - Objectives The development of large language models (LLMs) has shown promising results in enhancing research processes, data analysis, and communication in various domains of neurology. In this work, we systematically review and synthesize current evidence on the applications of LLMs in the assessment, diagnosis, and monitoring of neurological disorders. Methods Three databases, namely PubMed, Scopus, and Web of Science, were considered for document search. Article selection was according to PRISMA guidelines, and Newcastle–Ottawa Scale (NOS) was used to assess the article quality based on relevance, quality, and applicability. Results Nine studies were included in the final analysis. Based on the findings, LLMs have been utilized in diverse areas of neuroscience including hypothesis generation, clinical decision support, and cognitive modeling. LLMs can process large datasets, identify trends, and support personalized medicine. However, challenges such as interpretability, ethical considerations, and domain-specific training remain critical. Conclusions By facilitating workflows and uncovering new insights, LLMs can revolutionize different domains of neurology. Nevertheless, further research on their reliability, ethical implications, and adaptation to the unique demands of neuroscience is needed.
AB - Objectives The development of large language models (LLMs) has shown promising results in enhancing research processes, data analysis, and communication in various domains of neurology. In this work, we systematically review and synthesize current evidence on the applications of LLMs in the assessment, diagnosis, and monitoring of neurological disorders. Methods Three databases, namely PubMed, Scopus, and Web of Science, were considered for document search. Article selection was according to PRISMA guidelines, and Newcastle–Ottawa Scale (NOS) was used to assess the article quality based on relevance, quality, and applicability. Results Nine studies were included in the final analysis. Based on the findings, LLMs have been utilized in diverse areas of neuroscience including hypothesis generation, clinical decision support, and cognitive modeling. LLMs can process large datasets, identify trends, and support personalized medicine. However, challenges such as interpretability, ethical considerations, and domain-specific training remain critical. Conclusions By facilitating workflows and uncovering new insights, LLMs can revolutionize different domains of neurology. Nevertheless, further research on their reliability, ethical implications, and adaptation to the unique demands of neuroscience is needed.
KW - ChatGPT4
KW - Ethical concerns
KW - Large language model
KW - Neurology
KW - Specialist examinations
UR - https://www.scopus.com/pages/publications/105025896870
U2 - 10.1016/j.jtumed.2025.11.004
DO - 10.1016/j.jtumed.2025.11.004
M3 - Review article
AN - SCOPUS:105025896870
SN - 1658-3612
VL - 21
SP - 15
EP - 24
JO - Journal of Taibah University Medical Sciences
JF - Journal of Taibah University Medical Sciences
IS - 1
ER -