Projects per year
Personal profile
Biography
Website: https://www.uea.ac.uk/computing/speech-language-and-audio-processing
Follow this link for details of current PhD opportunities in Computing Sciences. But feel free to email me to discuss projects outside these areas and alternative sources of funding.
Career
I trained firstly as a physicist and then as an electronic engineer, and began my career at the UK Government Communications Centre developing signal-processing algorithms. I then joined British Telecoms's research laboratories to work on speech recognition, and spent two years at the speech research unit of the Royal Signals and Radar Establishment (now Qinetiq) at Malvern, where I researched into adaptation of speech recognition algorithms to new speakers. I returned to BT to lead a team of researchers developing speech recognition algorithms for use on the UK telephone network. I joined the School of Computing Sciences at UEA as a lecturer in 1991 and was appointed professor in 2003. My research interests include speech recognition, music processing, audio identification and automatic lip-reading and I am author or co-author of over 100 publications in these fields. I was an invited consultant at AT&T Bell Labs, New Jersey in 1994, a visiting scientist at Nuance Communications Inc., CA, in 2000, and an invited researcher at Apple Inc., CA, in 2010. I have also acted as a consultant and reviewer for several national governments as well as the European Commission, and also consulted for industry. I am an ex committee member of the IEEE Speech and Language Technical Committee.
Since 2017, I have been working on the MRC-funded CAVA project, https://sites.uea.ac.uk/cava-project. This is a collaboration between an ENT surgeon at the NNUH, Mr John Phillips, and Dr Jacob Newman and myself in CMP. The goal is to develop a 24/7 dizziness monitor by monitoring patients' eye movements.
For a full list of my publications, most downloadable, go to http://www2.cmp.uea.ac.uk/~sjc/
Key Research Interests
Stephen Cox is part of the Speech, Language and Audio Processing Group
His principal research interest is in speech processing, especially automatic speech recognition. Current research projects are in the use of speaker adaptation for speech recognition, speech synthesis, confidence measures for speech recognisers and automatic routing of telephone enquiries. He is the author of over 60 papers in the field of speech processing.
Publications:
Caballero Morales, S.O. and Cox, S.J., Modelling Confusion-Matrices to Improve Speech Recognition Accuracy, with an Application to Dysarthric Speech. Proc. 10th International Conference on Spoken Language Processing (Interspeech), Antwerp, August 2007
Read, I. and Cox, S.J., Automatic Pitch Accent Prediction for Text-To-Speech Synthesis. Proc. 10th International Conference on Spoken Language Processing (Interspeech), Antwerp, August 2007
Cox, S.J., On Estimation of Speakers’ Confusion Matrices from Sparse Data. Proc. 11th International Conference on Spoken Language Processing (Interspeech), Brisbane, September 2008
Caballero Morales, S.O. and Cox, S.J., Application of Weighted Finite-State Transducers to Improve Recognition Accuracy for Dysarthric Speech. Proc. 11th International Conference on Spoken Language Processing (Interspeech), Brisbane, September 2008
Cox, S.J., Harvey, R., Lan, Y., Newman, J.L. and Theobald, B.J., The challenge of multispeaker lip-reading. Proc. International Conference on Auditory-Visual Speech Processing 2008, Tangalooma, Australia.
Theobald, B.J., Harvey, R., Cox, S.J., Lewis, C. and Owen, G.P., Lip-reading enhancement for law enforcement. Proc. SPIE conference on Optics and Photonics for Counterterrorism and Crime Fighting, G. Owen and C. Lewis, Eds., vol. 6402, September 2006, pp. 205–9.
Newman, J.L. and Cox, S.J., Automatic Visual-Only Language Identification: A Preliminary Study. Proc. IEEE Conf. on Acoustics, Speech and Signal Processing, Taiwan, 2009.
Caballero Morales, S.O. and Cox, S.J., Modelling Errors in Automatic Speech Recognition for Dysarthric Speakers. EURASIP Journal on Advances in Signal Processing. Volume 2009 (2009), Article ID 308340, 14 pages. doi:10.1155/2009/308340.
Caballero Morales, S.O. and Cox, S.J., On the Estimation and the Use of Confusion-Matrices for Improving ASR Accuracy. Proc. 12th International Conference on Spoken Language Processing (Interspeech), Brighton, September 2009.
Watkins, C. and Cox,S.J.,Example-Based Speech Recognition using Formulaic Phrases. Proc. 12th International Conference on Spoken Language Processing (Interspeech), Brighton, September 2009.
Newman, J.L. and Cox, S.J., Speaker Independent Visual-Only Language Identification. Proc. IEEE Conf. on Acoustics, Speech and Signal Processing, Dallas, 2010.
Huang, Q. and Cox, S.J., Hierarchical Language Modeling for Audio Events Detection in a Sports Game. Proc. IEEE Conf. on Acoustics, Speech and Signal Processing, Dallas, 2010.
Selected Publications:
Read, I. and Cox, S. J., Stochastic and Syntactic Techniques for Predicting Phrase Breaks. Computer Speech and Language, Volume 21, Issue 3, Page(s) 519-542, 2007.
Huang, Q. and Cox, S. J., Task-Independent Call-Routing. Speech Communication, Volume 48, Issues 3-4, Page(s) 374-389, 2006.
Cox, S. J., Lincoln, M., Nakisa, M., Wells, M., Tutt, M. and Abbott, S., The Development and Evaluation of a Speech to Sign Translation System to Assist Transactions. Int. Journal of Human Computer Interaction, Volume 16, Issue 2, Page(s) 141-161, 2003.
Cox, S. J. and Dasmahapatra, S., High Level Approaches to Confidence Estimation in Speech Recognition. IEEE Transactions on Speech and Audio, Volume 10, Issue 7, Page(s) 460-471, 2002.
Key Responsibilities
Head of Speech Language Group
Director of Research
Areas of Expertise
Expertise related to UN Sustainable Development Goals
In 2015, UN member states agreed to 17 global Sustainable Development Goals (SDGs) to end poverty, protect the planet and ensure prosperity for all. This person’s work contributes towards the following SDG(s):
Collaborations and top research areas from the last five years
-
Development of a System to Provide an Automatic Diagnosis for Vestibular Conditions (CAVA 2.0)
Phillips, J., Cox, S., Fordham, R., Colles, A., High, J., Pond, M., Shepstone, L., Swart, A. M. & Wright, A.
National Institute for Health and Care Research
1/04/22 → 31/03/25
Project: Research
-
Production of a device to obtain continuous ambulatory vestibular assessment (CAVA)
Phillips, J., Cox, S., Frenneaux, M. & Smith, R.
1/11/17 → 30/09/24
Project: Research
-
Robust video-to-text sytems
Harvey, R., Bowden, R., Cox, S. & Southam, P.
Government Procurement Services
1/08/13 → 31/12/15
Project: Research
-
Improving the robustness of a smartphone-based emotion-from-voice app
1/07/13 → 31/03/14
Project: Research
-
Video-to-text (phase 3)
Harvey, R., Bowden, R., Cox, S. & Theobald, B.
1/05/12 → 31/12/15
Project: Research
-
Reconstructing animated eye movements from electrooculography data to aid the diagnosis of vestibular disorders
Newman, J. L., Phillips, J. S. & Cox, S. J., Jan 2022, In: International Journal of Audiology. 61, 1, p. 78-83 6 p.Research output: Contribution to journal › Article › peer-review
Open AccessFile1 Citation (Scopus)22 Downloads (Pure) -
The Suitability of the CAVA Device as an Ambulatory Monitor for Detecting Dizziness
Newman, J., Cox, S. & Phillips, J., 9 May 2022.Research output: Contribution to conference › Poster › peer-review
Open AccessFile19 Downloads (Pure) -
Using the CAVA Device to Assess Patients with Menieres Disease
Newman, J., Cox, S. & Phillips, J., 9 May 2022.Research output: Contribution to conference › Poster › peer-review
Open AccessFile16 Downloads (Pure) -
1D convolutional neural networks for detecting nystagmus
Newman, J. L., Phillips, J. S. & Cox, S. J., May 2021, In: IEEE Journal of Biomedical and Health Informatics. 25, 5, p. 1814-1823 10 p., 9201308.Research output: Contribution to journal › Article › peer-review
Open AccessFile18 Citations (Scopus)27 Downloads (Pure) -
Clinical techniques and technology: Vestibular telemetry
Phillips, J. S., Newman, J. L. & Cox, S. J., 1 Nov 2021, In: Otolaryngology - Head and Neck Surgery. 165, 5, p. 751-753 3 p.Research output: Contribution to journal › Article › peer-review
Open AccessFile2 Citations (Scopus)16 Downloads (Pure)
Prizes
-
Outstanding Impact in Health, Wellbeing and Welfare
Phillips, John (Recipient), Cox, Stephen (Recipient) & Newman, Jacob (Recipient), 17 May 2022
Prize: Prize (including medals and awards)
-
IEEE Speech and Language Processing Technical Committee (External organisation)
Stephen Cox (Member)
2006Activity: Membership › Committee
-
AMI Workshop on Multimodal Interaction and Related Machine Learning Algorithms
Stephen Cox (Keynote/plenary speaker)
2004Activity: Participating in or organising an event › Participation in workshop or seminar
-
UK Institute of Acoustics Speech Group (External organisation)
Stephen Cox (Chair)
1998 → 2003Activity: Membership › Committee