Lecture Notes in Speech Production, Speech Coding, and Speech Recognition

Mark Hasegawa-Johnson, University of Illinois at Urbana-Champaign


These lecture notes were written for a series of three courses (one undergraduate, two graduate) which I lectured or co-taught at UCLA in the spring of 1998. Since them, many folks have asked for copies, so I decided to put them on my web page.

Caveat emptor: I do not know of any errors in these notes, but the notes have not been edited nearly as carefully as a published textbook, so errors almost certainly exist. The writing style is the style of lecture notes (i.e. dense with equations), not the style of a textbook.

Chapter PostScript PDF
Table of Contents, Table of Figures (click here) (click here)
1. Review of Basic DSP (click here) (click here)
2. Speech Production (click here) (click here)
3. Short-Time Signal Processing (click here) (click here)
4. Linear Predictive Coding (click here) (click here)
5. Spectral and Cepstral Distance Measures (click here) (click here)
6. Engineering Models of Audition (click here) (click here)
7. Speech Coding (click here) (click here)
8. Speech Recognition (click here) (click here)
References (click here) (click here)

Note on the "References" section: Unfortunately, I have not had time to properly cite original sources for many of these ideas. Eventually, I will...

In the mean time, I have tried to include a good list of published textbooks in speech, signal processing, phonetics, hearing, and statistics. I keep most of these books right next to me on my desktop, and I recommend all of them to folks interested in learning about speech.