The University of Southampton
University of Southampton Institutional Repository

Childhood trauma and recent stressors in predicting subclinical psychotic symptoms among Chinese university students in southwest China: a machine learning analysis within a gender-specific framework

Childhood trauma and recent stressors in predicting subclinical psychotic symptoms among Chinese university students in southwest China: a machine learning analysis within a gender-specific framework
Childhood trauma and recent stressors in predicting subclinical psychotic symptoms among Chinese university students in southwest China: a machine learning analysis within a gender-specific framework
Background: subclinical psychotic symptoms (SPS) are common among college students and can lead to future mental health issues. However, it is still not clear which specific childhood trauma, stressors and health factors lead to SPS, partly due to confounding factors and multicollinearity.

Objective: to use machine learning to find the main predictors of SPS among university students, with special attention to gender differences.

Methods: a total of 21,208 university students were surveyed regarding SPS and a wide range of stress-related factors, including academic pressure, interpersonal difficulties, and abuse. Nine machine learning models were used to predict SPS. We examined the relationship between SPS and individual stressors using chi-square tests, multicollinearity analysis, and Pearson heatmaps. Feature engineering, t-SNE, and SHAP values helped identify the most important predictors. We also assessed calibration with calibration curves and Brier scores, and evaluated clinical usefulness with decision curve analysis (DCA) to provide a thorough assessment of the models. In addition, we validated this model using independent external data.

Findings: the XGBoost model had the best prediction results, with an AUC of 0.89, and validated with external data. It also showed good calibration, and decision curve analysis indicated clear clinical benefit. Interpersonal difficulties, academic pressure, and emotional abuse emerged as the strongest predictors of SPS. Gender-stratified analyses revealed that academic pressure and emotional abuse affected males more, while health issues like chest pain and menstrual pain were stronger predictors for females.

Conclusions: machine learning models effectively identified key stressors associated with SPS in university students. These findings highlight the importance of gender-sensitive approaches for the early detection and prevention of psychotic symptoms.

Clinical implications: SPS in college students can be predicted by interpersonal difficulties, academic stress, and childhood emotional abuse. This information can help mental health professionals develop better ways to prevent and address SPS.

What is already known on this topic: the close associations between childhood trauma, stressful life events, health factors, and SPS have already been established. However, it remains unclear which specific types of trauma and stressful events have the greatest impact. Comprehensive predictive models based on these factors have not yet been developed, especially for undergraduate populations.

What this study adds: by applying nine machine learning models, this study identified the optimal predictive model and found that interpersonal difficulties, academic stress, and childhood emotional abuse are the three most influential factors for SPS. Additionally, gender differences were observed: males face higher risk mainly due to greater academic stress, while pain symptoms have a stronger impact on females. The model's performance was also validated using external data.

How this study might affect research, practice, or policy: this study provides a foundation for further optimization and refinement of predictive models for assessing SPS risk in college students, as well as the development of targeted intervention strategies. It also offers clinical guidance for prevention and early intervention.
Child & adolescent psychiatry, Cross-Sectional Studies, Data Interpretation, Statistical, Machine Learning
2755-9734
Tang, Wanjie
8ef59307-d08d-4686-a8c1-ea2f97631bc5
Deng, Zijian
e64478e4-d06b-4450-b32a-e9735c5814fc
Sun, Zeyuan
930ef096-c213-4fc0-82d0-5859dcdea734
Zhao, Qijun
6152a1aa-deb1-4196-967f-be37ac18ec01
Garcia-Argibay, Miguel
e5a6941e-4dcc-401a-9de4-09557c8856ef
Kadan, Anoop
9cc17e26-a329-49fe-b73b-2fce75084966
Hu, Tao
030a82db-eb1f-4dea-b238-a801203329b3
Xue, Shuang
87ae01d7-98a8-427a-b92d-bf405602733f
Bozhilova, Natali S.
26780039-aa04-4479-9b01-1dfd9e8394b2
Conti, Aldo Alberto
4900702e-d877-485e-80e3-25ae0de19a90
Lukito, Steve
910e0488-3797-4b64-af43-2e0ffc574e65
Wu, Siqi
6d489ff2-84f5-4095-974d-7b5886f65c8f
Wang, Gang
5ea02268-47c4-4078-b2d1-0f80ee401dc0
Jin, Chunhan
d0473496-b7e5-45bd-b752-8c0a2f7c6e33
Qiu, Changjian
7bfcaac1-8093-4631-ae0e-698d0523bb6c
Liu, Qiaolan
36ce5dda-1122-46bf-b11e-c77b0f54c821
Pan, Jay
709830b7-e1cc-43cf-9a64-465a7c8b38cb
Cortese, Samuele
53d4bf2c-4e0e-4c77-9385-218350560fdb
Rubia, Katya
5f6c0771-6e32-4924-b3f1-e48ace6de377
Tang, Wanjie
8ef59307-d08d-4686-a8c1-ea2f97631bc5
Deng, Zijian
e64478e4-d06b-4450-b32a-e9735c5814fc
Sun, Zeyuan
930ef096-c213-4fc0-82d0-5859dcdea734
Zhao, Qijun
6152a1aa-deb1-4196-967f-be37ac18ec01
Garcia-Argibay, Miguel
e5a6941e-4dcc-401a-9de4-09557c8856ef
Kadan, Anoop
9cc17e26-a329-49fe-b73b-2fce75084966
Hu, Tao
030a82db-eb1f-4dea-b238-a801203329b3
Xue, Shuang
87ae01d7-98a8-427a-b92d-bf405602733f
Bozhilova, Natali S.
26780039-aa04-4479-9b01-1dfd9e8394b2
Conti, Aldo Alberto
4900702e-d877-485e-80e3-25ae0de19a90
Lukito, Steve
910e0488-3797-4b64-af43-2e0ffc574e65
Wu, Siqi
6d489ff2-84f5-4095-974d-7b5886f65c8f
Wang, Gang
5ea02268-47c4-4078-b2d1-0f80ee401dc0
Jin, Chunhan
d0473496-b7e5-45bd-b752-8c0a2f7c6e33
Qiu, Changjian
7bfcaac1-8093-4631-ae0e-698d0523bb6c
Liu, Qiaolan
36ce5dda-1122-46bf-b11e-c77b0f54c821
Pan, Jay
709830b7-e1cc-43cf-9a64-465a7c8b38cb
Cortese, Samuele
53d4bf2c-4e0e-4c77-9385-218350560fdb
Rubia, Katya
5f6c0771-6e32-4924-b3f1-e48ace6de377

Tang, Wanjie, Deng, Zijian, Sun, Zeyuan, Zhao, Qijun, Garcia-Argibay, Miguel, Kadan, Anoop, Hu, Tao, Xue, Shuang, Bozhilova, Natali S., Conti, Aldo Alberto, Lukito, Steve, Wu, Siqi, Wang, Gang, Jin, Chunhan, Qiu, Changjian, Liu, Qiaolan, Pan, Jay, Cortese, Samuele and Rubia, Katya (2025) Childhood trauma and recent stressors in predicting subclinical psychotic symptoms among Chinese university students in southwest China: a machine learning analysis within a gender-specific framework. BMJ Mental Health, 28 (1), [e301761]. (doi:10.1136/bmjment-2025-301761).

Record type: Article

Abstract

Background: subclinical psychotic symptoms (SPS) are common among college students and can lead to future mental health issues. However, it is still not clear which specific childhood trauma, stressors and health factors lead to SPS, partly due to confounding factors and multicollinearity.

Objective: to use machine learning to find the main predictors of SPS among university students, with special attention to gender differences.

Methods: a total of 21,208 university students were surveyed regarding SPS and a wide range of stress-related factors, including academic pressure, interpersonal difficulties, and abuse. Nine machine learning models were used to predict SPS. We examined the relationship between SPS and individual stressors using chi-square tests, multicollinearity analysis, and Pearson heatmaps. Feature engineering, t-SNE, and SHAP values helped identify the most important predictors. We also assessed calibration with calibration curves and Brier scores, and evaluated clinical usefulness with decision curve analysis (DCA) to provide a thorough assessment of the models. In addition, we validated this model using independent external data.

Findings: the XGBoost model had the best prediction results, with an AUC of 0.89, and validated with external data. It also showed good calibration, and decision curve analysis indicated clear clinical benefit. Interpersonal difficulties, academic pressure, and emotional abuse emerged as the strongest predictors of SPS. Gender-stratified analyses revealed that academic pressure and emotional abuse affected males more, while health issues like chest pain and menstrual pain were stronger predictors for females.

Conclusions: machine learning models effectively identified key stressors associated with SPS in university students. These findings highlight the importance of gender-sensitive approaches for the early detection and prevention of psychotic symptoms.

Clinical implications: SPS in college students can be predicted by interpersonal difficulties, academic stress, and childhood emotional abuse. This information can help mental health professionals develop better ways to prevent and address SPS.

What is already known on this topic: the close associations between childhood trauma, stressful life events, health factors, and SPS have already been established. However, it remains unclear which specific types of trauma and stressful events have the greatest impact. Comprehensive predictive models based on these factors have not yet been developed, especially for undergraduate populations.

What this study adds: by applying nine machine learning models, this study identified the optimal predictive model and found that interpersonal difficulties, academic stress, and childhood emotional abuse are the three most influential factors for SPS. Additionally, gender differences were observed: males face higher risk mainly due to greater academic stress, while pain symptoms have a stronger impact on females. The model's performance was also validated using external data.

How this study might affect research, practice, or policy: this study provides a foundation for further optimization and refinement of predictive models for assessing SPS risk in college students, as well as the development of targeted intervention strategies. It also offers clinical guidance for prevention and early intervention.

Text
maintext_20250712Dclean-include figures - Accepted Manuscript
Available under License Creative Commons Attribution.
Download (633kB)
Text
e301761.full - Version of Record
Available under License Creative Commons Attribution.
Download (1MB)

More information

Accepted/In Press date: 19 July 2025
e-pub ahead of print date: 31 July 2025
Published date: 31 July 2025
Keywords: Child & adolescent psychiatry, Cross-Sectional Studies, Data Interpretation, Statistical, Machine Learning

Identifiers

Local EPrints ID: 504544
URI: http://eprints.soton.ac.uk/id/eprint/504544
ISSN: 2755-9734
PURE UUID: f64e304a-c8b7-4c74-a45a-98bb969ce9ac
ORCID for Miguel Garcia-Argibay: ORCID iD orcid.org/0000-0002-4811-2330
ORCID for Anoop Kadan: ORCID iD orcid.org/0000-0002-4335-5544
ORCID for Samuele Cortese: ORCID iD orcid.org/0000-0001-5877-8075

Catalogue record

Date deposited: 15 Sep 2025 16:33
Last modified: 18 Sep 2025 02:13

Export record

Altmetrics

Contributors

Author: Wanjie Tang
Author: Zijian Deng
Author: Zeyuan Sun
Author: Qijun Zhao
Author: Miguel Garcia-Argibay ORCID iD
Author: Anoop Kadan ORCID iD
Author: Tao Hu
Author: Shuang Xue
Author: Natali S. Bozhilova
Author: Aldo Alberto Conti
Author: Steve Lukito
Author: Siqi Wu
Author: Gang Wang
Author: Chunhan Jin
Author: Changjian Qiu
Author: Qiaolan Liu
Author: Jay Pan
Author: Samuele Cortese ORCID iD
Author: Katya Rubia

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×