Sei sulla pagina 1di 2

Session T1C

Work in Progress - Teaching Speech Signal


Processing and Coding using LabVIEWTM
Andreas Spanias1, Karthikeyan Natesan1, Jayaraman Jayaraman1, Photini Spanias2
1
Dept. Electrical Engineering, SenSIP Center, 2College of Education, Arizona State University
spanias@asu.edu

Abstract - This paper presents a visual programming and native graphical functions that enable users to visualize
visualization tool for use in covering signal processing different aspects and parameters of a speech coding algorithm.
aspects of speech coding in a DSP class. The tool is based The block diagram approach also enables the users to
on the National Instruments LabVIEW™ environment. understand the basic blocks of a speech analysis-synthesis
The framework of this tool was built using existing C code system. This speech coding tool is scalable in the sense that
as a library along with LabVIEW’s native functionalities. additional options and capabilities could be added.
Standardized linear predictive coding algorithms have Exercises that expose students to the non-stationarity of the
been implemented; these are used to demonstrate in our speech signal, the all-pole spectral modeling performed by
DSP classes how digital filters and signal modeling is LPC analysis-synthesis, the estimation of the pitch have been
utilized in cellular and military communications. developed. We note that the tool is of value not only to
Experiments covered include the introduction of speech undergraduate and graduate students but also to DSP
synthesis models, parameterization of speech in terms of practitioners. Also with some simplifications, the tool can be
filter and excitation parameters, and robustness of speech used for demonstrations in high school science classes.
parameters to additive and channel noise. The tool Assessment instruments have been developed and preliminary
provides capabilities for both quantitative and subjective results from pre- and post-quizzes and summaries of student
assessment of the synthesized speech signal. An interviews will be presented at the conference.
assessment process for the tool is in place and preliminary
results will be presented at the conference. THE LABVIEW SPEECH CODING TOOL
The graphical user interface of the LabVIEW speech coding
Index Terms – educational software, LabVIEW, real-time tool is shown in Figure 1. Presently, the FS-1015 LPC 10e and
DSP, speech coding. the FS-1016 CELP 4.8 kbps standards have been implemented
INTRODUCTION by interfacing the shared C libraries [10] with the framework
developed in LabVIEW. The speech coder is represented
Speech coding is concerned with compact digital using individual blocks for analysis (encoder) and synthesis
representations of voice signals for the purpose of efficient (decoder). The tool also has additional features for real-time
transmission or storage [1-6]. There have been several DSP, signal playback and graphical plots of different sets of
standardized speech coders that use the linear prediction parameters.
representation of speech [7,8,9]. In particular source-system The software can access either an audio (‘.wav’) file or
analysis-synthesis systems that use all-pole filters to represent real-time speech input. The user also has options to change
the vocal tract are to this day used in most standardized certain speech parameters to analyze the performance and
algorithms. In fact, the Levinson-Durbin linear prediction behavior of the algorithm under different conditions. The
algorithm is embedded in every cell phone. We choose the preprocessed input speech is displayed and processed on a
linear prediction source-system paradigm for implementation frame-by-frame basis. Frame-by-frame display is also used to
because it can be connected to several concepts covered in view the excitation, the filter parameters, the spectrum of the
DSP classes including digital filter theory, estimation of preprocessed and decoded speech, the quantized LPC spectral
periodicity, autocorrelation computation, filter stability, and envelopes, the pitch estimates, pole-zero plots of the synthesis
estimation of non-stationary signal parameters. In our study filter, formants, speech synthesis waveforms SNRs, etc. The
we worked with the FS-1015 LPC (open loop Linear software has options to save and read the input data and the
Predictive Coding), FS-1016 CELP (closed loop or Code coded data. The user can also analyze the subjective quality
Excited Linear Prediction) standards, and the full rate ETSI of these algorithms by listening to the synthesized speech with
GSM standard [1,7]. Although the Federal Standards 1015 the aid of the playback feature.
and 1016 have been around for some time, they were chosen
because they are open source and capture the basic aspects of UTILITY IN EDUCATION
speech compression. The capabilities of this tool allow the students to experiment
The choice of LabView [10] as a visual programming with a wide range of speech data thereby understanding the
environment was motivated by the real time signal acquisition several aspects of the algorithms that are not immediately
capabilities of this environment and the presence of several
1-4244-1084-3/07/$25.00 ©2007 IEEE October 10 – 13, 2007, Milwaukee, WI
37th ASEE/IEEE Frontiers in Education Conference
T1C-22
Session T1C

FIGURE 1 USER INTERFACE OF THE LABVIEW SPEECH CODING TOOL a) FS1015, b) FS1016

evident from standard documentation. One significant [3] Spanias A., Painter E.M., “A Software Tool for
extension planned is the integration of a digital signal Introducing Speech Coding Fundamentals in a DSP Course”,
processor interface with the LabVIEW tool. Such an addition Education, IEEE Trans on, Vol.39,2, pp.143-152, May 1996.
will enable students to experiment real-time with fixed-point [4] A. Spanias, T Painter, V. Atti, Audio Signal Processing
implementation issues without having to code the entire and Coding, ISBN: 0-471-79147-4, Wiley, February 2007.
algorithm on the processor chip. In addition to the currently [5] A. Spanias, Digital Signal Processing; An Interactive
implemented FS-1015 LPC and FS-1016 standards (Figure 1), Approach, ISBN: 978-1-4243-2524-5, January 2007.
the GSM RPE-LTP algorithm is also being implemented. [6] V. Atti, “Interactive On-line Undergraduate Laboratories
Using J-DSP,” IEEE Trans. on Education Special Issue on
ASSESSMENT Web-based Instruction, vol. 48, no. 4, pp. 735-749, Nov. 2005.
[7] Andreas Spanias, Chapter 3: Speech Coding Standards,
A collection of experiments that utilize this LabVIEW tool pp. 25-44, Invited. Academic Press, Ed: G. Gibson, ISBN
was designed. We had a select group of students from our 2000 0-12- 282160-2.
DSP class evaluate this tool and fill out assessment forms. [8] “FS-1015 LPC C Code Implementation”, Available at
Preliminary results will be presented at the conference. World Wide Web: http://www.arl.wustl.edu/~jaf/lpc/
[9] “FS-1016 CELP C Code Implementation”, Available at
REFERENCES World Wide Web: ftp://svr-ftp.eng.cam.ac.uk/ comp.speech/
coding/celp_3.2a.tar.Z.
[1] A. Spanias, “Speech Coding: A Tutorial Review”, [10] “LabVIEW Fundamentals”, Available at World Wide
Proceedings of the IEEE, Vol.82, Issue 10, Oct 1994. Web: http://www.ni.com/pdf/manuals/374029a.pdf
[2] V. Atti, “A Simulation Tool For Introducing Algebraic
CELP (ACELP) Coding Concepts In A DSP Course,” IEEE ACKNOWELEDGEMENT
2002 DSP Workshop, Callaway, Georgia, October 2002.
Portions of this work have been sponsored by the ASU
SenSIP center and the NSF CCLI award 0443137.

1-4244-1084-3/07/$25.00 ©2007 IEEE October 10 – 13, 2007, Milwaukee, WI


37th ASEE/IEEE Frontiers in Education Conference
T1C-23

Potrebbero piacerti anche