Assignment 1: Digital Audio (voice/ speech) processing
EC60601: Digital voice and picture communication
(Jul. 28, 2016) I) Audio processing study 1. Capture (or record) audio using a microphone, reading out your name. Plot the time domain signal and the corresponding frequency spectrum. 2. Using any audio file containing music, plot the time domain signal and the corresponding frequency spectrum. 3. Vary the sampling rate (choose three different sampling rates) for the captured audio and plot the distortion for PCM (uniform), companded PCM, DM, ADM, CELP (open and closed loop). Make a note of the corresponding data rates. 4. Select any 5 audio file formats and transcode the captured audio into these file formats. Observe and comment on the corresponding distortion and data rate. II) Research in the domain of voice (speech) processing/communication Read the paper (given in the following list) that has been mentioned against your group no. Write atleast a two-page technical write-up in your own words. (Please don't cut-copy-paste from the paper or any other source, assignment will not be evaluated if content is found to have been copied). Prepare a 5-10 min presentation on the given paper's work, in not more than 7-10 slides. Try to cover the motivation, key contributions, main claims (findings), methodology (approach), and evaluation parameters, both in the write-up as well as presentation. Group Title of the paper
Published in
Packet Loss Concealment Based on Deep Neural
Networks for Digital Speech Transmission
IEEE/ACM Trans. on Audio, speech, and
language processing, 2016
Softbit Speech Decoding: A New Approach
to Error Concealment
IEEE Trans. on speech and audio processing,
2001
Speech Codecs for High-Quality Voice
over ZigBee Applications: Evaluation and Implementation Challenges
IEEE Communications Magazine April 2012
Low-Complexity Compression for Sensory Systems
IEEE Trans. on circuits and systems-II: express
briefs, 2015
Source-Optimized Channel Coding for
Digital Transmission Channels
IEEE Trans. on commun, 2005
A Joint Source-Channel Speech Coder Using Hybrid
DigitalAnalog (HDA) Modulation
IEEE Trans. on speech and audio processing,
2002
Reverberant Speech Enhancement by Temporal and
Spectral Processing
IEEE Transactions on Audio, Speech, and
Language Processing, 2009
Interpolation of Lost Speech Segments Using LP-HNM
Model With Codebook Post-Processing
IEEE Transactions on Multimedia, 2008
Speech-Centric Information Processing: An
Optimization-Oriented Approach
Proceedings of the IEEE, 2013
10
Multiple Descriptions and Path Diversity for Voice
Communications Over Wireless Mesh Networks
IEEE Transactions on Multimedia, 2007
11
ASM: Adaptive Voice Stream Multicast over Low-Power