Identifying First Episodes of Psychosis in Psychiatric Patient Records using Machine Learning

Genevive Gorrell, Sherifat Oduola , Angus Roberts, Thomas Craig, Craig Morgan, Rob Stewart

Research output: Contribution to conferencePaperpeer-review

3 Citations (Scopus)


Natural language processing is being pressed into use to facilitate the selection
of cases for medical research in electronic health record databases, though study inclusion criteria may be complex, and the linguistic cues indicating eligibility may be subtle. Finding cases of first episode psychosis raised a number of problems for automated approaches, providing an opportunity to explore how machine learning technologies might be used to overcome them. A system was delivered that achieved an AUC of 0.85, enabling 95% of relevant cases to be identified whilst halving the work required in manually reviewing cases. The techniques that made this possible are presented.
Original languageEnglish
Publication statusPublished - 2016
Event15th Workshop on Biomedical Natural Language Processing - Berlin, Germany
Duration: 12 Aug 2016 → …


Workshop15th Workshop on Biomedical Natural Language Processing
Period12/08/16 → …
Internet address

Cite this