TY - JOUR
T1 - Identifying schizophrenia stigma on Twitter
T2 - a proof of principle model using service user supervised machine learning
AU - Jilka, Sagar
AU - Odoi, Clarissa Mary
AU - van Bilsen, Janet
AU - Morris, Daniel
AU - Erturk, Sinan
AU - Cummins, Nicholas
AU - Cella, Matteo
AU - Wykes, Til
N1 - Funding Information:
We thank all user-researchers and advisors who supported this work; Chinelo Daniels-Ifekwe, Helena Griffiths, Catherine Kilkenny, Sazan Meran, Caecilia Pawitra, Joel Vasama, Emma Wilson, Khaizer Rizvi, Magano Mutepua, Gregory Verghese, Sumithra Velupillai, and Angus Roberts. We thank the Young Person’s Mental Health Advisory Group for their constant input and advice on this important issue, and the King’s College London NLP reading group. This work was supported by the National Institute for Health Research (NIHR) Biomedical Research Centre at South London and Maudsley NHS Foundation Trust and King’s College London (IS-BRC-1215-20018).
Publisher Copyright:
© 2022, The Author(s).
PY - 2022/2/7
Y1 - 2022/2/7
N2 - Stigma has negative effects on people with mental health problems by making them less likely to seek help. We develop a proof of principle service user supervised machine learning pipeline to identify stigmatising tweets reliably and understand the prevalence of public schizophrenia stigma on Twitter. A service user group advised on the machine learning model evaluation metric (fewest false negatives) and features for machine learning. We collected 13,313 public tweets on schizophrenia between January and May 2018. Two service user researchers manually identified stigma in 746 English tweets; 80% were used to train eight models, and 20% for testing. The two models with fewest false negatives were compared in two service user validation exercises, and the best model used to classify all extracted public English tweets. Tweets classed as stigmatising by service users were more negative in sentiment (t (744) = 12.02, p < 0.001 [95% CI: 0.196–0.273]). Our linear Support Vector Machine was the best performing model with fewest false negatives and higher service user validation. This model identified public stigma in 47% of English tweets (n5,676) which were more negative in sentiment (t (12,143) = 64.38, p < 0.001 [95% CI: 0.29–0.31]). Machine learning can identify stigmatising tweets at large scale, with service user involvement. Given the prevalence of stigma, there is an urgent need for education and online campaigns to reduce it. Machine learning can provide a real time metric on their success.
AB - Stigma has negative effects on people with mental health problems by making them less likely to seek help. We develop a proof of principle service user supervised machine learning pipeline to identify stigmatising tweets reliably and understand the prevalence of public schizophrenia stigma on Twitter. A service user group advised on the machine learning model evaluation metric (fewest false negatives) and features for machine learning. We collected 13,313 public tweets on schizophrenia between January and May 2018. Two service user researchers manually identified stigma in 746 English tweets; 80% were used to train eight models, and 20% for testing. The two models with fewest false negatives were compared in two service user validation exercises, and the best model used to classify all extracted public English tweets. Tweets classed as stigmatising by service users were more negative in sentiment (t (744) = 12.02, p < 0.001 [95% CI: 0.196–0.273]). Our linear Support Vector Machine was the best performing model with fewest false negatives and higher service user validation. This model identified public stigma in 47% of English tweets (n5,676) which were more negative in sentiment (t (12,143) = 64.38, p < 0.001 [95% CI: 0.29–0.31]). Machine learning can identify stigmatising tweets at large scale, with service user involvement. Given the prevalence of stigma, there is an urgent need for education and online campaigns to reduce it. Machine learning can provide a real time metric on their success.
UR - http://www.scopus.com/inward/record.url?scp=85124453167&partnerID=8YFLogxK
U2 - 10.1038/s41537-021-00197-6
DO - 10.1038/s41537-021-00197-6
M3 - Article
AN - SCOPUS:85124453167
SN - 2334-265X
VL - 8
JO - NPJ SCHIZOPHRENIA
JF - NPJ SCHIZOPHRENIA
IS - 1
M1 - 1
ER -