Stephen Cox

Professor Emeritus, Professor

  • 2.17 Biology

If you made any changes in Pure these will be visible here soon.

Personal profile

Biography

Website: https://www.uea.ac.uk/computing/speech-language-and-audio-processing

Follow this link for details of current PhD opportunities in Computing Sciences. But feel free to email me to discuss projects outside these areas and alternative sources of funding.

Career

I trained firstly as a physicist and then as an electronic engineer, and began my career at the UK Government Communications Centre developing signal-processing algorithms. I then joined British Telecoms's research laboratories to work on speech recognition, and spent two years at the speech research unit of the Royal Signals and Radar Establishment (now Qinetiq) at Malvern, where I researched into adaptation of speech recognition algorithms to new speakers. I returned to BT to lead a team of researchers developing speech recognition algorithms for use on the UK telephone network. I joined the School of Computing Sciences at UEA as a lecturer in 1991 and was appointed professor in 2003. My research interests include speech recognition, music processing, audio identification and automatic lip-reading and I am author or co-author of over 100 publications in these fields.  I was an invited consultant at AT&T Bell Labs, New Jersey in 1994, a visiting scientist at Nuance Communications Inc., CA, in 2000, and an invited researcher at Apple Inc., CA, in 2010.  I have also acted as a consultant and reviewer for several national governments as well as the European Commission, and also consulted for industry.  I am an ex committee member of the IEEE Speech and Language Technical Committee.

Since 2017, I have been working on the MRC-funded CAVA project, https://sites.uea.ac.uk/cava-project. This is a collaboration between an ENT surgeon at the NNUH, Mr John Phillips, and Dr Jacob Newman and myself in CMP. The goal is to develop a 24/7 dizziness monitor by monitoring patients' eye movements.   

For a full list of my publications, most downloadable, go to http://www2.cmp.uea.ac.uk/~sjc/

 

External Activities

  • Member IEEE Speech and Language Processing Technical Committee, 2006
  • Chairman, UK Institute of Acoustics Speech Group, 1998-2003
  • Keynote speech, AMI Workshop on “Multimodal Interaction and Related Machine Learning Algorithms”, Martigny, Switzerland, 2004

Key Research Interests and Expertise

Stephen Cox is part of the Speech, Language and Audio Processing Group

His principal research interest is in speech processing, especially automatic speech recognition. Current research projects are in the use of speaker adaptation for speech recognition, speech synthesis, confidence measures for speech recognisers and automatic routing of telephone enquiries. He is the author of over 60 papers in the field of speech processing.
 

Publications:

Caballero Morales, S.O. and Cox, S.J., Modelling Confusion-Matrices to Improve Speech Recognition Accuracy, with an Application to Dysarthric Speech. Proc. 10th International Conference on Spoken Language Processing (Interspeech), Antwerp, August 2007

Read, I. and Cox, S.J., Automatic Pitch Accent Prediction for Text-To-Speech Synthesis. Proc. 10th International Conference on Spoken Language Processing (Interspeech), Antwerp, August 2007

Cox, S.J., On Estimation of Speakers’ Confusion Matrices from Sparse Data. Proc. 11th International Conference on Spoken Language Processing (Interspeech), Brisbane, September 2008

Caballero Morales, S.O. and Cox, S.J., Application of Weighted Finite-State Transducers to Improve Recognition Accuracy for Dysarthric Speech. Proc. 11th International Conference on Spoken Language Processing (Interspeech), Brisbane, September 2008

Cox, S.J., Harvey, R., Lan, Y., Newman, J.L. and Theobald, B.J., The challenge of multispeaker lip-reading. Proc. International Conference on Auditory-Visual Speech Processing 2008, Tangalooma, Australia.

Theobald, B.J., Harvey, R., Cox, S.J., Lewis, C. and Owen, G.P., Lip-reading enhancement for law enforcement. Proc. SPIE conference on Optics and Photonics for Counterterrorism and Crime Fighting, G. Owen and C. Lewis, Eds., vol. 6402, September 2006, pp. 205–9.

Newman, J.L. and Cox, S.J., Automatic Visual-Only Language Identification: A Preliminary Study. Proc. IEEE Conf. on Acoustics, Speech and Signal Processing, Taiwan, 2009.

Caballero Morales, S.O. and Cox, S.J., Modelling Errors in Automatic Speech Recognition for Dysarthric Speakers. EURASIP Journal on Advances in Signal Processing. Volume 2009 (2009), Article ID 308340, 14 pages. doi:10.1155/2009/308340.

Caballero Morales, S.O. and Cox, S.J., On the Estimation and the Use of Confusion-Matrices for Improving ASR Accuracy. Proc. 12th International Conference on Spoken Language Processing (Interspeech), Brighton, September 2009.

Watkins, C. and Cox,S.J.,Example-Based Speech Recognition using Formulaic PhrasesProc. 12th International Conference on Spoken Language Processing (Interspeech), Brighton, September 2009.

Newman, J.L. and Cox, S.J., Speaker Independent Visual-Only Language Identification. Proc. IEEE Conf. on Acoustics, Speech and Signal Processing, Dallas, 2010.

 Huang, Q. and Cox, S.J., Hierarchical Language Modeling for Audio Events Detection in a Sports Game. Proc. IEEE Conf. on Acoustics, Speech and Signal Processing, Dallas, 2010.

 

Selected Publications:

Read, I. and Cox, S. J., Stochastic and Syntactic Techniques for Predicting Phrase Breaks. Computer Speech and Language, Volume 21, Issue 3, Page(s) 519-542, 2007.

Huang, Q. and Cox, S. J., Task-Independent Call-Routing. Speech Communication, Volume 48, Issues 3-4, Page(s) 374-389, 2006.

Cox, S. J., Lincoln, M., Nakisa, M., Wells, M., Tutt, M. and Abbott, S., The Development and Evaluation of a Speech to Sign Translation System to Assist Transactions. Int. Journal of Human Computer Interaction, Volume 16, Issue 2, Page(s) 141-161, 2003.

Cox, S. J. and Dasmahapatra, S., High Level Approaches to Confidence Estimation in Speech Recognition. IEEE Transactions on Speech and Audio, Volume 10, Issue 7, Page(s) 460-471, 2002.
 

Key Responsibilities

Director of Graphics, Vision and Speech Language Laboratory
Head of Speech Language Group
Director of Research

Specialisms

Computer speech processing, esp. speech recognition and speech synthesis; computer music processing, including musical similarity estimation; computer pattern recognition.

Network

Recent external collaboration on country/territory level. Dive into details by clicking on the dots or