Courses:

Automatic Speech Recognition >> Content Detail

Lecture Notes

Lecture Notes

Media player software, such as Quicktime® Player, RealOne™ Player, or Windows Media® Player, is required to run the .wav files in this section.

This section contains a complete set of lecture slides for the course, including guest lectures. Lectures 3, 4, and 6 have audio links to speech samples presented during the lectures.


WEEK #	LEC #	TOPICS


1	1 2	Course Overview (PDF) Acoustic Theory of Speech Production (PDF - 1.4 MB)


2	3 4	Speech Sounds (PDF - 3.6 MB) Speech Sounds (continued)


3	5 6	Signal Representation (PDF - 1.9 MB) Vector Quantization (PDF - 1.8 MB)


4	7 8	Pattern Classification (1) (PDF - 1.1 MB) Pattern Classification (2) (PDF)


5	9 10	Search (PDF) Hidden Markov Modeling (1) (PDF)


6	11 12	Language Modeling (PDF) Language Modeling (continued)


7	13	Guest Lecture by Karen Livescu: Graphical Models (PDF) Quiz 1


8	14 15	Guest Lecture by Rita Singh: Hidden Markov Modeling (2) (PDF - 2.1 MB) Guest Lecture by Rita Singh: Hidden Markov Modeling (3) (PDF - 1.4 MB)


9	16 17	Segment-Based ASR (PDF) Guest Lecture by Lee Hetherington: Finite-State Transducers (PDF)


10	18 19	Acoustic-Phonetic Modeling (PDF) Robust ASR (1) (PDF)


11	20 21	Guest Lecture by Timothy Hazen: Robust ASR (2) (PDF) Guest Lecture by Timothy Hazen: Adaptation (PDF)


12	22 23	Speech Understanding (PDF - 1.1 MB) Guest Lecture by Timothy Hazen: Paralinguistic Information (PDF - 1.0 MB)


13		Quiz 2 No Lecture


14		Term Project Presentations

RealOne™ is a trademark or a registered trademark of RealNetworks, Inc.
QuickTime® is a trademark of Apple Computer, Inc., registered in the U.S. and other countries.
Windows Media® is a registered trademark or trademark of Microsoft Corporation in the U.S. and/or other countries.

Course Home

Courses:

Automatic Speech Recognition >> Content Detail

Table of Contents

Lecture Notes