Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
7/23/2015
7/23/2015
Acoustic waves
Speed = wavelength x frequency
Information in speech?
Linguistic (message -> sentences -> words -> phonemes)
The speech signal is characterised by an enormous range
of elementary perceptually contrasting sounds!
Paralinguistic:
--expressive (emotions, mood)
--speaker-based (age, gender, accent and style)
7/23/2015
Generating speech*
Respiration->phonation
->articulation
Vibrating vocal cords
create puffs of air giving
rise to air pressure
variations which reach
our ears.
*HyperPhysics, Sound and
Hearing, Georgia State
University
Department of Electrical Engineering , IIT Bombay
7/23/2015
f1
c
4L
; f2
3c
;
4L
f3
5c
; .......
4L
10
7/23/2015
Velum
Pharyngeal
cavity
Oral
Cavity
Teeth
Articulators
Lips
Tongue
Jaw
Vocal
cavity
Vocal cords
Moving muscles
which alter the
resonant cavities
*Securivox
tutorial
Dynamic cavity
Static cavity
11
12
7/23/2015
13
"Briefly, the device was operated in the following manner. The right arm rested on the main bellows
1875
Alexander Bell invents the method of, and apparatus for,
transmitting vocal or other sounds telegraphically ... by causing
electrical undulations, similar in form to the vibrations of the air
accompanying the said vocal or other sound.
=> Major impetus to modern speech processing.
1930s: Electrical synthesis of speech by Dudleys vocoder
14
7/23/2015
15
Speech waveform
16
7/23/2015
(b) ee vowel
(c) s consonant
17
T0 = 10 msec
1 Hertz = 1 vibration/sec
Frequency = 300 Hz
T0 =
3.3 msec
Department of Electrical Engineering , IIT Bombay
18
7/23/2015
Components of sound
A sound is usually comprised of several frequency
components.
Depending on the relationships of the frequency
components, the sound can elicit a sensation of pitch.
19
300 Hz
600 Hz
900 Hz
300 Hz
+ 600Hz
300 Hz +
600Hz +
900Hz
Department of Electrical Engineering , IIT Bombay
20
10
7/23/2015
21
Place of articulation
(constriction of vocal tract)
22
11
7/23/2015
23
24
12
7/23/2015
PRAAT examples
25
26
13
7/23/2015
27
28
14
7/23/2015
29
Outline
Speech production (physiology)
Classification of sounds: articulatory, acoustic
Speech analysis (signal processing methods for
information extraction)
Hearing, and speech perception
Audio/music technology
30
15
7/23/2015
Text / References
Douglas O'Shaughnessy, Speech Communications:
Human and Machine, Universities Press (India) Ltd.,
2001
Rabiner and Schafer, Digital Processing of Speech
Signals
IITB Moodle for all course-related hand-outs
31
Evaluation
Computing assignments (Python preferred)
Exams: mid semester, end semester
32
16