001     1049995
005     20251219202234.0
024 7 _ |a 10.1038/s44220-025-00527-y
|2 doi
024 7 _ |a 10.34734/FZJ-2025-05709
|2 datacite_doi
037 _ _ |a FZJ-2025-05709
082 _ _ |a 610
100 1 _ |a Kambeitz, Joseph
|0 P:(DE-Juel1)188257
|b 0
|e Corresponding author
245 _ _ |a The empirical structure of psychopathology is represented in large language models
260 _ _ |a London
|c 2025
|b Nature Publishing Group UK
336 7 _ |a article
|2 DRIVER
336 7 _ |a Output Types/Journal article
|2 DataCite
336 7 _ |a Journal Article
|b journal
|m journal
|0 PUB:(DE-HGF)16
|s 1766153209_25560
|2 PUB:(DE-HGF)
336 7 _ |a ARTICLE
|2 BibTeX
336 7 _ |a JOURNAL_ARTICLE
|2 ORCID
336 7 _ |a Journal Article
|0 0
|2 EndNote
500 _ _ |a The original studies analyzed in this work were supported by the National Institute of Mental Health (Grant R01MH112612) to J.S. and the Deutsche Forschungsgemeinschaft (DFG) ET 31/7-1 to U.E. K.V. was supported within the project SIMSUB (Grant 01GP2215) of the German Ministery of Research, Technology and Space (BMFTR). The funders had no role in study design, data collection and analysis, decision to publish or preparation of the manuscript.
520 _ _ |a Clinical assessment and scientific research in psychiatry are largely based on questionnaires that are used to assess psychopathology. The development of large language models (LLMs) offers a new perspective for analysis of the language and terminology on which these questionnaires are based. We used state-of-the-art LLMs to derive numerical representations (‘text embeddings’) of the semantic and sentiment content of items from established questionnaires for the assessment of psychopathology. We compared the pairwise associations between empirical data from cross-sectional studies and text embeddings to test whether the empirical structure of psychopathology can be reconstructed by LLMs. Across four large-scale datasets (n = 1,555, n = 1,099, n = 11,807 and n = 39,755), we found a range of significant correlations between empirical item-pair associations and associations derived from text embeddings (r = 0.18 to r = 0.57, all P < 0.05). Random forest regression models based on semantic or sentiment embeddings predicted empirical item-pair associations with moderate to high accuracy (r = 0.33 to r = 0.81, all P < 0.05). Similarly, empirical clustering of items and grouping to established subdomain scores could be partly reconstructed by text embeddings. Our results demonstrate that LLMs are able to represent substantial components of the empirical structure of psychopathology. Consequently, the integration of LLMs into mental health research has the potential to unlock numerous promising avenues. These may encompass improving the process of developing questionnaires, optimizing generalizability and reducing the redundancy of existing questionnaires or facilitating the development of new conceptualizations of mental disorders.
536 _ _ |a 5251 - Multilevel Brain Organization and Variability (POF4-525)
|0 G:(DE-HGF)POF4-5251
|c POF4-525
|f POF IV
|x 0
588 _ _ |a Dataset connected to CrossRef, Journals: juser.fz-juelich.de
700 1 _ |a Schiffman, Jason
|0 P:(DE-HGF)0
|b 1
700 1 _ |a Kambeitz-Ilankovic, Lana
|0 P:(DE-HGF)0
|b 2
700 1 _ |a Mittal, Vijay A.
|0 P:(DE-HGF)0
|b 3
700 1 _ |a Ettinger, Ulrich
|0 0000-0002-0160-0281
|b 4
700 1 _ |a Vogeley, Kai
|0 P:(DE-Juel1)176404
|b 5
|u fzj
773 _ _ |a 10.1038/s44220-025-00527-y
|g Vol. 3, no. 12, p. 1482 - 1492
|0 PERI:(DE-600)3123130-5
|n 12
|p 1482 - 1492
|t Nature Mental Health
|v 3
|y 2025
|x 2731-6076
856 4 _ |u https://juser.fz-juelich.de/record/1049995/files/PDF.pdf
|y OpenAccess
909 C O |o oai:juser.fz-juelich.de:1049995
|p openaire
|p open_access
|p VDB
|p driver
|p dnbdelivery
910 1 _ |a Forschungszentrum Jülich
|0 I:(DE-588b)5008462-8
|k FZJ
|b 5
|6 P:(DE-Juel1)176404
913 1 _ |a DE-HGF
|b Key Technologies
|l Natural, Artificial and Cognitive Information Processing
|1 G:(DE-HGF)POF4-520
|0 G:(DE-HGF)POF4-525
|3 G:(DE-HGF)POF4
|2 G:(DE-HGF)POF4-500
|4 G:(DE-HGF)POF
|v Decoding Brain Organization and Dysfunction
|9 G:(DE-HGF)POF4-5251
|x 0
914 1 _ |y 2025
915 _ _ |a OpenAccess
|0 StatID:(DE-HGF)0510
|2 StatID
915 _ _ |a DBCoverage
|0 StatID:(DE-HGF)0300
|2 StatID
|b Medline
|d 2024-12-20
915 _ _ |a Creative Commons Attribution CC BY 4.0
|0 LIC:(DE-HGF)CCBY4
|2 HGFVOC
915 _ _ |a DEAL Nature
|0 StatID:(DE-HGF)3003
|2 StatID
|d 2024-12-20
|w ger
920 _ _ |l yes
920 1 _ |0 I:(DE-Juel1)INM-3-20090406
|k INM-3
|l Kognitive Neurowissenschaften
|x 0
980 _ _ |a journal
980 _ _ |a VDB
980 _ _ |a UNRESTRICTED
980 _ _ |a I:(DE-Juel1)INM-3-20090406
980 1 _ |a FullTexts


LibraryCollectionCLSMajorCLSMinorLanguageAuthor
Marc 21