Speech Processing Reading List
(Local) on-line resources
The publicly visible face of the book website is at http://www.islp.org.uk/
The real place where all those files are stored is http://www.phon.ox.ac.uk/files/slp/outline.htm
The contents of the book CD are available internally at http://www.phon.ox.ac.uk/files/SLP_CD/
Core textbook: Coleman, J. (2005) Introducing Speech and Language Processing. Cambridge.
Other useful textbooks
Johnson, K. (1997) Acoustic and Auditory Phonetics. Blackwell.
Jurafsky, D. and J. H. Martin (2000) Speech and Language Processing. Prentice-Hall. But this book is directed at computer science students, and is focussed more on language than on speech.
Preparatory reading for class 1
Coleman chapter 1 and p. 14.
Class 1. Digital signals
Coleman chapter 2; Johnson pp. 12-28.
ReferenceKernighan, B. W. and D. M. Ritchie (1988) The C Programming Language. Prentice-Hall.
Embree, P. M. and B. Kimble (1991) C Language Algorithms for Digital Signal Processing. Prentice-Hall.
Preparatory reading for class 2
Johnson pp. 3-18
Class 2. Digital filters and resonators
Coleman chapter 3; Johnson pp. 28-44.
More detail, if you really want it
Javkin, H. R. (1996) Speech analysis and synthesis. Chapter 7 of N. J. Lass, ed. Principles of Experimental Phonetics. Mosby.
Wakita, H. (1996) Instrumentation for the Study of Speech Acoustics. In N. J. Lass, ed. Principles of Experimental Phonetics. Mosby. Or, alternatively, an earlier version of the same paper: Wakita, H. (1976) Instrumentation for the Study of Speech Acoustics. In N. J. Lass, ed. Contemporary Issues in Experimental Phonetics. Academic Press. 3-40.
Schroeder, M. R. (1985) Linear Predictive Coding of Speech: Review and Current Directions. IEEE Communications Magazine 23 (8). 54-61.
Class 3. Finite-state machines
Coleman chapter 5
Background reading: Chomsky, N. (1956) Syntactic Structures. Mouton. pp. 18-20.
Jurafsky and Martin pp. 33-52, 105-110.
Prolog reference textbook: Clocksin, W. F. and C. S. Mellish (2003) Programming in Prolog.
Class 4. Introduction to speech recognition techniques
Coleman chapter 6
Other reading:
Ainsworth, W. A. (1997) Some Approaches to Automatic Speech Recognition. Chapter 24 of W. J. Hardcastle and J. Laver, eds. The Handbook of Phonetic Sciences. Blackwell.
Schmandt, C. (1994) Voice Communication with Computers: Conversational Systems. Van Nostrand Reinhold. Chapter 7.
Preparatory reading for class 5: Charniak, E. (1993) Statistical Language Learning. MIT Press. Chapter 2.
Class 5. Probabilistic finite-state models (including Hidden Markov Models)
Coleman chapter 7
Other reading: Charniak Ch. 3.
Manning, C. D. and H. Schütze (1999) Foundations of Statistical Natural Language Processing. MIT Press. Chapter 9.
Rabiner, L. R. (1989) A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE. Reprinted in A. Waibel and K.-F. Lee (eds) Readings in Speech Recognition. Morgan Kaufmann. 267-297.