Recent Changes - Search:

Home

Peer Reviewed

Courses

Allen's Personal Website

PmWiki

Edit SideBar

http://www.pmwiki.org/

ECE537-2008SpeechProcessing

ECE 537 Schedule (Fall 2008)

L/WDDateLecture and Assignment
1/1M8/25Self-introductions from each student.
Lecture: Overview: Speech production, Acoustics, Psychophysics, neural and information processing.
Entropy: Its meaning, definition and the intuition behind it;
Read: Flanagan, Chapt. 1 djvu
2W8/27Lecture: Mechanisms of Speech production: Sounds of speech: Vowels and consonants; Chapter 2 djvu;
HW1: Basic Acoustics (due Wed 9/3/07) pdf
3F8/29Lecture: Basic acoustics and the ABCD-Transmission (Chain) matrix; Acoustic Transmission lines;
Text: Flanagan, Ch. 3.1-3.2 djvu;
Read: Flanagan, Ch. 2.2 djvu
-/2M9/1 Labor Day Holiday -- No class
4W9/3Lecture: Solution of 1-D transmission line equation:
Text: Sec. 6.26; Lip radiation impedance + Sec. 3.3
-F9/5Special Lecture place: 2269BI for the ASDN Seminar series: The role of the cochlea in Human speech recognition
5/3M9/8Lecture: Intensity, speech power; sound level; dB, dB-SPL, Pressure, volume velocity, impedance. Vowels, Formants;
HW2: TL and reflectance (due Mon 9/15) pdf
6W9/10Lecture:d'Alembert solutions for 1-D and 3-D transmission lines; Sound propagation in tubes in speech production; Introduction to reflectance; Conversion tables for 2-ports (djvu);
Read:: Model of Ear drum, Parent and Allen 2007 (pp 918-920) (djvu, pdf);
Review Matlab filter design, bilinear Z, FIR, IIR
7F9/12Lecture: Transmission Lines with complex loads and the Propagated Reflectance;
Text: Flanagan, Ch. VI (Sect. 6.262, pp 272-276);
Readings: Bilbao PhD Thesis p. 1-15 djvu
8/4M9/15Lecture:Impedance and reflectance at a tube junction (Karal correction djvu), and half-sphere (the mouth)
Text:: Ch. 3, Sec. 3.3, pages 136-152
9W9/17This lecture will be given on 9/17 at noon;
Lecture : 3-port networks and the nasal tract;
HW3: TLs with complex terminations (due in two weeks on Wed 10/1);
Bilabo Thesis (see page 14) djvu; Simulation of the middle ear: pdf;
Read:: Text Chap. 3, sect. 3.4.1 pages 153-156 radiation impedance;
Review: Rosowski, Carney and Peak (1988) on the cat middle ear djvu;
Background material: Guinan and Peak djvu; Lynch et al. 1982 djvu
10W9/17This lecture will be delivered on Wed 9/17, at 1PM
Lecture: Signal processing review: Fourier Series, Fourier Transform, Laplace Transform, ZT, DTFT, DFT, FFT;
Read: Peterson and Barney (1952) djvu and Peterson (1952) djvu
-F9/19 Allen out of town
11/5M9/22Lecture: Signal processing Review cont.; Network Postulates djvu; Optional Read: Thevenin; Norton
12W9/24Lecture: STFT window methods; Inverse STFT; STFT for speech processing with analysis/synthesis; filtering; Reading spectrograms
13F9/26Lecture: History of acoustics:
BC: Pythagoras; Aristotle;
17C: Mersenne, Marin; Galilei, Galileo; Hooke, Robert; Boyle, Robert; Newton, Sir Issac;
18C: Bernoulli, Daniel; Euler; d'Alembert;
19C: Gauss; Laplace; Fourier; Lagrange; Helmholtz; Heaviside; Strutt, William; Rayleigh, Lord; Bell, AG
20C: Campbell, George; Hilbert, David; Noether, Emmy; Fletcher, Harvey; Nyquist, Harry; Bode, Henrik; Dudley, Homer; Shannon, Claude; Flanagan, James;
14/6M9/29Lecture: HW2 review; Impedance and reflectance; Bernoulli's equation Wiki
Text: Flanagan pp 41-53; Read: vanDen Berg (1957) djvu
15W10/1Lecture: The glottal oscillator; Bernoulli Eq. Derivation;
Read:: Flanagan Sec. 3.74 pages 69-72
16F10/3 HW3 due;
Lecture: Linear prediction of speech;
HW4: Vocal-Tract Simulation: Due Mon 10/13 (pdf, files);
Read: Flanagan Sec. 8.112, pp 372-376; Sec. 8.13, pp 390-395; Atal and Hanauer (1971) pdf,djvu;
17/7M10/6Review of HW3;
Lecture: Cepstral analysis;
Read:: Flanagan Chap. 8, pp 361-363;
Begin writing your Final exam: Part I;
18W10/8Lecture: CELP coding;
In-class review for Exam I
19F10/10Lecture: Room acoustics; point source; 1, 2 and 6 wall Image method; Wall reflection coef. for finite impedance walls Di and Gilbert (1993) djvu
20/8M10/13no class due to Exam I
HW5 (ver 1.03 10/21/08): LPC (Due 1 week 10/20), Speech samples
-W10/15Exam I: Modeling the VT, STFT, Signal processing of speech;
Time: Wed night (Oct. 15), 7-9PM;
Place: 214 Ceramics Building;
Rules: 1 8.5x11 crib sheet, two sides, hand written. No computer print sheets!!; Leave cell phones home. Calculators on the floor and off when your not using them.
Scores: [94 90 89 89 89 88 85 81 80 79 78 78]
21W10/15Lecture: Psychoacoustics I: Intensity JND and the near-miss;
Internal noise model of the JND Riesz pure-tone intensity JND (1928) djvu Masking;
Weber's and Fechner's Law;
Introduction to loudness, Steven's Law;
Loudness Lecture notes (Allen): pdf
Read:: Flanagan Chapter 4 (pdf, original pdf, djvu)
HW4 due
22F10/17Lecture: Psychoacoustics II: Frequency JND, semitone, Internal noise and Masking; relation between the intensity and frequency JND (Cochlear frequency response and the slope of the tuning curve);
Read:: Allen Review (pages 20-30) djvu, Fletcher and Munson (1933) (pp 82-94) djvu
23/9M10/20Lecture: Cochlear Physiology I:Middle ear and inner ear (Cochlear) anatomy, basilar membrane, 1D Models, Hair cells, Nonlinear basilar membrane;
Read:: Review of Cochlear Modeling: Part II (pp 19-28): (pdf, djvu); Wegel and Lane (1924);
MIT/HST-725: The auditory system pdf
24W10/22Lecture: Cochlear Physiology II: traveling waves, neural tuning curves, critical bands, hair cells, neural masking, Upward spread of masking; Forward masking;
Auditory Pathway I: Neural Tuning pdf
Read:: Review of Cochlear Modeling (pp 1-19) (djvu);
Part I Supplement: Pitch: MIT HST725-5 Pitch models pdf
Fletcher and Pitch djvu
25F10/24Lecture: Cochlear Physiology III: Micromechanics, OHC,
IHC Lecture Notes (djvu): Modeling the Cochlea and Organ of Corti, and
Read:: Wegel and Lane (1924), Part II (djvu)
HW6: STFT/OLA/Speech coding (Due Mon 11/3)HW6
26/10M10/27Lecture: Psychoacoustics III: Relations between Psychophysics and the cochlea; Greenwood's place-map function; forward masking; upward spread of masking
Read:: Review of Cochlear Modeling (pp 1-19) (djvu), Part I; Fletcher and Munson (1933) (pp 82-94) (djvu)
27W10/29Lecture: Cochlear Physiology IV: Nonlinear Cochlear model; supplement: The Auditory Nerve (djvu)
Read:: 'Representation of speech-like sounds ... auditory-nerve fibers' Delgutte (1980)(djvu);
HW7: (not assigned this year)
28F10/31Lecture: Cochlear Critical bands: upward spread of masking; 2 tone suppression;
Read:: 'Carlin's Postulates' of network theory (djvu); 'Harvey Fletcher's role ...' (all but the AI section) (djvu)
29/11M11/3Lecture: Prof. Wickesberg, Auditory Pathway (AN+CN): Fall 2008 ppt (Fall 2006 Part I: djvu, Part II: djvu)
30W11/5Lecture: Prof. Pandya: The central auditory system (A1 with BN stimulation) (pdf)
31F11/7Lecture: Pandya-continued; Allen Review for Exam II;
32/12M11/10Exam II: NO CLASS Monday at 1PM Psychoacoustics, Physiology, Speech, LPC, Tubes, Historical items;
Rules: one 8.5x11 facts-sheet two sides (hand written), plus your Exam I facts-sheet
Date: 11/5/2008 Time: 7:00 PM Place:'' 214CB
33W11/12Lecture: Information theory I: Information, Entropy, Relative Entropy; Channel Capacity
HW8: Information processing (Due Fri 11/21); (pdf, files)
Read:: pp 1-10 Shannon (1948) (djvu, pdf I+pdf II)
34F11/14Lecture: Information theory II: Morse code example Shannon Channel
Read:: Shannon (1950) (djvu, pdf)
35/13M11/17Lecture: Information theory III:; Entropy, Relative Entropy, Markov models, State diagram;
DUE: Preliminary version of your Final Exam to me for my review
36W11/19Lecture: EM algorithm: Example: Speech and noise separation
Read:: French and Steinberg (1947) (djvu); Good-Turing djvu
Discussion of the EM alg. with examples
37F11/21Lecture: Prof. Michael Heinz Purdue: Quantifying envelope and fine-structure coding in auditory-nerve responses to chimaeric speech; Support material: RV Shannon et al (1995) djvu, Smith et al (2002) djvu
Read:: Steeneken & Houtgast (1980) Speech Transmission Index (djvu) and Houtgast (1989) Modulation detection (djvu)
--- Thanksgiving Holiday (11/22-12/1)
---HW8 solution: pdf
38/14M12/1Lecture: Human speech recognition (HSR), Articulation Index (AI), average score: Pc(AI)=1-echanceeminAI, The confusion matrix (CM), maximum entropy syllable error models, etc.; Allen notes (djvu)
Read:: Continue with French and Steinberg (1947)djvu; Miller Nicely (1955) djvu; Miller Nicely confusions as a function of the articulation index; entropy, grouping and chance djvu
Discussion of HW8
Assignment: Work on Final exam
39W12/3Lecture: Effects of language and semantic context Miller (1962), Boothroyd (1988),
Allen Notes: Events and the AI(djvu)
Read: Miller, Heisen and Lichten (1951) (djvu)
40F12/5Lecture: Language context models, Bronkhorst93.djvu
41/15M12/8Lecture: Prof. Richard Sproat and ``Alice in wonderland`` ppt
Read: W. Li Random texts exhibit Zipf's-Law djvu
43W12/10Last class: Final overview of the course material, re your final exam.
Free Pizza.
-/16M12/11 Final Due on reading day, Dec 11.
--- Not proofed beyond here

Final

The final is a 15-25 page paper, written in the style and format (but single column) of a journal paper, that discusses everything that you have learned in this course. Writing style, spelling, figures, labels of figures, are all part of the grade.

Allen's Grading of the final

The final is graded based on a list of all the topics that are covered. If there is a paragraph that discusses each topic on my list, then you get at least 1 point, and if the discussion covers the topic effectively, you can get up to 5 points. There are at least 20 topics on the list. When you get to 100 points, you get an A+ on the exam. I expect that you draw on the homework as a starting point. Don't just dump the homework into the exam without modification, that wont get you points. Don't just dump a large number of unexplained figures (that you got from someone else for example) and expect to get points. I need words around each figure. I am looking for insightful comments that link the material together.

Your comments on the relevance of each of the topics I covered in this course, homework problems, exams, etc., are welcome. No points will be taken off, nor given, for strong opinions on my teaching style, or lack thereof, organization, or lack thereof, etc. Please put all such comments in a discussion section at the end of the paper, isolated from the rest of the material.

course description

Fall 2008 calendar

Edit - History - Print - Recent Changes - Search
Page last modified on December 03, 2009, at 10:36 AM