Are language features associated with psychosis risk universal? A study in Mandarin-speaking youths at clinical high risk for psychosis

WORLD PSYCHIATRY(2023)

引用 0|浏览3
暂无评分
摘要
Natural language processing (NLP) analyses have shown de­creas­ed coherence (tangentiality, derailment) and complexity (poverty of content) in schizophrenia and in clinical high risk (CHR) states for psychosis. We reported previously in this journal1 that an NLP machine learning classifier, which included mea­sures of coherence and complexity, predicted psychosis onset in two inde­pendent English-speaking CHR samples. Moreover, reduced complexity has been associated with increased pauses and negative symptoms in at-risk youths2. Multiple recent NLP studies in schizophrenia and CHR cohorts, using different methods, have largely found this same pattern of disturbance in the structure of language and speech3. Most of these studies have been conducted in English, with notable exceptions including Dutch, Portuguese and Spanish4. It remains unknown, however, whether NLP findings obtained in English or other Indo-European languages would generalize to less similar languages, such as Mandarin, which has very different grammatical and prosodic conventions. This study included 20 help-seeking CHR youth and 25 healthy controls who were recruited as part of the US National Institute of Mental Health (NIMH)-funded Shanghai-At-Risk for Psychosis (SHARP) study at the Shanghai Mental Health Center, where institutional review board approval was obtained. Caseness and symptoms were determined using the Structured Interview for Psychosis-Risk Syndromes (SIPS)5. Subjects were Han Chinese and spoke Mandarin fluently, and they provided informed consent. Sex distribution was similar between CHR subjects and controls (55% vs. 48% female), but CHR subjects were younger (19.6±6.4 vs. 24.9±1.9 years) and had less education (11.4±4.0 vs. 16.7±1.4 years). Interviews were approximately 30 min in length, and were based on qualitative methods previously described6. They were transcribed verbatim in Mandarin and translated into English using Google Translate, with verification by bilingual researchers. Audio recordings were diarized (segmented by speaker using time stamps from transcription) so that acoustic analyses could be done of subjects' speech. NLP features analyzed for both English and Mandarin included coherence, complexity, and sentiment (i.e., emotional valence – positive, negative, neutral), as reported previously1, 7. For English NLP only, sentiment also included anger, fear, sadness, joy and disgust; frequency of wh-words (e.g., “which’) was also assessed. For Mandarin NLP only, frequency of measure words, possessives, and localizers (e.g., gongzuo-shang, “during work”; or liangge-ren-zhijian, “between two people”) was also calculated8. Acoustic features analyzed in Mandarin included those characteristic of schizophrenia or CHR states among English-speaking subjects, including abnormal pauses, flat intonation, voice breaks, and pitch variation7. All features were corrected for age and education by apply­ing regression coefficients from healthy controls, and highly cor­related features were removed from analysis. Machine learning classification was done using random forest and support vector machines (SVM) for Mandarin NLP, English NLP, and acoustics, with each experiment repeated 20 times, identifying the top five features of each model. Associations between linguistic features (cross-language analysis) and with symptoms (symptom inference) were also tested (see also supplementary information). Each of the three SVM machine learning classifiers showed high accuracy in discriminating spoken language in CHR subjects from that of healthy controls: English-specific NLP (95%), Mandarin-specific NLP (94%), and acoustic analysis (88%), with similar results for random forest. Top features for the English-specific NLP machine learning were wh-word and noun use (greater in CHR), and coherence, adjective use and adverb use (all less in CHR). Top features for the Mandarin-specific NLP machine learning were localizer use (greater in CHR), and positive sentiment, two metrics of coherence, and adjective use (all decreased in CHR). Of note, features common to the NLP machine learning for both languages were highly correlated, specifically coherence (r=.70) and adjective use (r=.60). For acoustics, the top features in the machine learning classifier were two pause metrics, and three indices of acoustic quality: chroma #11 (timbre/quality), bandwidth formant #1 (dysphonia/hoarseness), and spectral spread (energy – decibels/frequency). Of note, only acoustic features were significantly associated with symptoms (negative: r=0.69, p=8E–4, positive: r=0.49, p=3E–2) (see also supplementary information). Several important findings emerge from this proof-of-principle study. First, in Mandarin, spoken language can differentiate CHR subjects from healthy controls with high accuracy, using either linguistic or acoustic features. Second, the application of English-specific NLP to transcripts translated from Mandarin has utility, as there was comparable accuracy for both the English-specific and Mandarin-specific NLP. Further, there was overlap in top features in the two NLP classifiers, specifically decreases in adjective use and coherence, with both of these features highly correlated across the two languages, suggesting that these key metrics survive translation. Nonetheless, the application of Mandarin-specific NLP allowed the identification of a key linguistic feature that would not be captured otherwise – the increased use of localizers among CHR subjects – which may reflect concreteness or increased use of idioms; this is a new finding that merits replication and further investigation. Finally, the acoustic classifier, in addition to having high accuracy, identified features similar to those found in English-speaking CHR and schizophrenia cohorts, including abnormal pause behavior, and indices of voice quality and energy. As in prior studies, acoustic features were associated with symptoms, in particular negative symptoms. This study is the first to use natural language processing and acoustic analyses to characterize spoken language among native Mandarin speakers in China identified as at clinical high risk for psychosis. Our findings support the idea that there may be universal features of spoken language disturbance across psychosis and its risk states, particularly in respect to reduced coherence, but also word usage and pause behavior that may index reduced complexity. Yet, our study also shows that there are language-spe­cific features characteristic of psychosis risk, suggesting that it is essential to also analyze spoken language using language-specific NLP methods. This is a small proof-of-principle study with the potential confounds of age and education, and none of the classifiers generated were cross-validated in a second cohort. Therefore, these findings should be investigated and replicated in a larger cohort of Mandarin-speaking CHR subjects and healthy controls who are more similar in demographics. More broadly, future studies should include a similar heuristic of using both English-based and language-specific NLP approaches, as well as acoustic analyses, to assess spoken language in CHR cohorts (e.g., English, Mandarin, Cantonese, Korean, Spanish, German, Portuguese, Danish, French, Italian) from around the world, as is planned for the Accelerating Medicines Partnership® Program – Schizophrenia, to determine both universal and language-specific features of language disturbance characteristic of clinical risk for psychosis.
更多
查看译文
关键词
psychosis risk,clinical high risk,language features
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要