| Home > Publications database > The empirical structure of psychopathology is represented in large language models > print |
| 001 | 1049995 | ||
| 005 | 20251219202234.0 | ||
| 024 | 7 | _ | |a 10.1038/s44220-025-00527-y |2 doi |
| 024 | 7 | _ | |a 10.34734/FZJ-2025-05709 |2 datacite_doi |
| 037 | _ | _ | |a FZJ-2025-05709 |
| 082 | _ | _ | |a 610 |
| 100 | 1 | _ | |a Kambeitz, Joseph |0 P:(DE-Juel1)188257 |b 0 |e Corresponding author |
| 245 | _ | _ | |a The empirical structure of psychopathology is represented in large language models |
| 260 | _ | _ | |a London |c 2025 |b Nature Publishing Group UK |
| 336 | 7 | _ | |a article |2 DRIVER |
| 336 | 7 | _ | |a Output Types/Journal article |2 DataCite |
| 336 | 7 | _ | |a Journal Article |b journal |m journal |0 PUB:(DE-HGF)16 |s 1766153209_25560 |2 PUB:(DE-HGF) |
| 336 | 7 | _ | |a ARTICLE |2 BibTeX |
| 336 | 7 | _ | |a JOURNAL_ARTICLE |2 ORCID |
| 336 | 7 | _ | |a Journal Article |0 0 |2 EndNote |
| 500 | _ | _ | |a The original studies analyzed in this work were supported by the National Institute of Mental Health (Grant R01MH112612) to J.S. and the Deutsche Forschungsgemeinschaft (DFG) ET 31/7-1 to U.E. K.V. was supported within the project SIMSUB (Grant 01GP2215) of the German Ministery of Research, Technology and Space (BMFTR). The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript. |
| 520 | _ | _ | |a Clinical assessment and scientific research in psychiatry are largely based on questionnaires that are used to assess psychopathology. The development of large language models (LLMs) offers a new perspective for analysis of the language and terminology on which these questionnaires are based. We used state-of-the-art LLMs to derive numerical representations (‘text embeddings’) of the semantic and sentiment content of items from established questionnaires for the assessment of psychopathology. We compared the pairwise associations between empirical data from cross-sectional studies and text embeddings to test whether the empirical structure of psychopathology can be reconstructed by LLMs. Across four large-scale datasets (n = 1,555, n = 1,099, n = 11,807 and n = 39,755), we found a range of significant correlations between empirical item-pair associations and associations derived from text embeddings (r = 0.18 to r = 0.57, all P < 0.05). Random forest regression models based on semantic or sentiment embeddings predicted empirical item-pair associations with moderate to high accuracy (r = 0.33 to r = 0.81, all P < 0.05). Similarly, empirical clustering of items and grouping to established subdomain scores could be partly reconstructed by text embeddings. Our results demonstrate that LLMs are able to represent substantial components of the empirical structure of psychopathology. Consequently, the integration of LLMs into mental health research has the potential to unlock numerous promising avenues. These may encompass improving the process of developing questionnaires, optimizing generalizability and reducing the redundancy of existing questionnaires or facilitating the development of new conceptualizations of mental disorders. |
| 536 | _ | _ | |a 5251 - Multilevel Brain Organization and Variability (POF4-525) |0 G:(DE-HGF)POF4-5251 |c POF4-525 |f POF IV |x 0 |
| 588 | _ | _ | |a Dataset connected to CrossRef, Journals: juser.fz-juelich.de |
| 700 | 1 | _ | |a Schiffman, Jason |0 P:(DE-HGF)0 |b 1 |
| 700 | 1 | _ | |a Kambeitz-Ilankovic, Lana |0 P:(DE-HGF)0 |b 2 |
| 700 | 1 | _ | |a Mittal, Vijay A. |0 P:(DE-HGF)0 |b 3 |
| 700 | 1 | _ | |a Ettinger, Ulrich |0 0000-0002-0160-0281 |b 4 |
| 700 | 1 | _ | |a Vogeley, Kai |0 P:(DE-Juel1)176404 |b 5 |u fzj |
| 773 | _ | _ | |a 10.1038/s44220-025-00527-y |g Vol. 3, no. 12, p. 1482 - 1492 |0 PERI:(DE-600)3123130-5 |n 12 |p 1482 - 1492 |t Nature Mental Health |v 3 |y 2025 |x 2731-6076 |
| 856 | 4 | _ | |u https://juser.fz-juelich.de/record/1049995/files/PDF.pdf |y OpenAccess |
| 909 | C | O | |o oai:juser.fz-juelich.de:1049995 |p openaire |p open_access |p VDB |p driver |p dnbdelivery |
| 910 | 1 | _ | |a Forschungszentrum Jülich |0 I:(DE-588b)5008462-8 |k FZJ |b 5 |6 P:(DE-Juel1)176404 |
| 913 | 1 | _ | |a DE-HGF |b Key Technologies |l Natural, Artificial and Cognitive Information Processing |1 G:(DE-HGF)POF4-520 |0 G:(DE-HGF)POF4-525 |3 G:(DE-HGF)POF4 |2 G:(DE-HGF)POF4-500 |4 G:(DE-HGF)POF |v Decoding Brain Organization and Dysfunction |9 G:(DE-HGF)POF4-5251 |x 0 |
| 914 | 1 | _ | |y 2025 |
| 915 | _ | _ | |a OpenAccess |0 StatID:(DE-HGF)0510 |2 StatID |
| 915 | _ | _ | |a DBCoverage |0 StatID:(DE-HGF)0300 |2 StatID |b Medline |d 2024-12-20 |
| 915 | _ | _ | |a Creative Commons Attribution CC BY 4.0 |0 LIC:(DE-HGF)CCBY4 |2 HGFVOC |
| 915 | _ | _ | |a DEAL Nature |0 StatID:(DE-HGF)3003 |2 StatID |d 2024-12-20 |w ger |
| 920 | _ | _ | |l yes |
| 920 | 1 | _ | |0 I:(DE-Juel1)INM-3-20090406 |k INM-3 |l Kognitive Neurowissenschaften |x 0 |
| 980 | _ | _ | |a journal |
| 980 | _ | _ | |a VDB |
| 980 | _ | _ | |a UNRESTRICTED |
| 980 | _ | _ | |a I:(DE-Juel1)INM-3-20090406 |
| 980 | 1 | _ | |a FullTexts |
| Library | Collection | CLSMajor | CLSMinor | Language | Author |
|---|