Sei sulla pagina 1di 1

Assignment 1: Digital Audio (voice/ speech) processing

EC60601: Digital voice and picture communication


(Jul. 28, 2016)
I) Audio processing study
1. Capture (or record) audio using a microphone, reading out your name. Plot the time domain
signal and the corresponding frequency spectrum.
2. Using any audio file containing music, plot the time domain signal and the corresponding
frequency spectrum.
3. Vary the sampling rate (choose three different sampling rates) for the captured audio and plot the
distortion for PCM (uniform), companded PCM, DM, ADM, CELP (open and closed loop). Make a
note of the corresponding data rates.
4. Select any 5 audio file formats and transcode the captured audio into these file formats. Observe
and comment on the corresponding distortion and data rate.
II) Research in the domain of voice (speech) processing/communication
Read the paper (given in the following list) that has been mentioned against your group no. Write
atleast a two-page technical write-up in your own words. (Please don't cut-copy-paste from the
paper or any other source, assignment will not be evaluated if content is found to have been copied).
Prepare a 5-10 min presentation on the given paper's work, in not more than 7-10 slides. Try to
cover the motivation, key contributions, main claims (findings), methodology (approach), and
evaluation parameters, both in the write-up as well as presentation.
Group Title of the paper

Published in

Packet Loss Concealment Based on Deep Neural


Networks for Digital Speech Transmission

IEEE/ACM Trans. on Audio, speech, and


language processing, 2016

Softbit Speech Decoding: A New Approach


to Error Concealment

IEEE Trans. on speech and audio processing,


2001

Speech Codecs for High-Quality Voice


over ZigBee Applications: Evaluation
and Implementation Challenges

IEEE Communications Magazine April 2012

Low-Complexity Compression for Sensory Systems

IEEE Trans. on circuits and systems-II: express


briefs, 2015

Source-Optimized Channel Coding for


Digital Transmission Channels

IEEE Trans. on commun, 2005

A Joint Source-Channel Speech Coder Using Hybrid


DigitalAnalog (HDA) Modulation

IEEE Trans. on speech and audio processing,


2002

Reverberant Speech Enhancement by Temporal and


Spectral Processing

IEEE Transactions on Audio, Speech, and


Language Processing, 2009

Interpolation of Lost Speech Segments Using LP-HNM


Model With Codebook Post-Processing

IEEE Transactions on Multimedia, 2008

Speech-Centric Information Processing: An


Optimization-Oriented Approach

Proceedings of the IEEE, 2013

10

Multiple Descriptions and Path Diversity for Voice


Communications Over Wireless Mesh Networks

IEEE Transactions on Multimedia, 2007

11

ASM: Adaptive Voice Stream Multicast over Low-Power


Wireless Networks

IEEE Transactions on Parallel and Distributed


Systems, 2011

Useful software tools to implement I:


1. ffmpeg for transcoding
2. Matlab and simulink

Potrebbero piacerti anche