Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Bessel basis
Senthil Murugan S∗ , Manoj C Rakesh Peter†
Sathiesh Kumaar L, Magesh S Computational Engg. and Networking
Electronics and Communication Engineering Amrita Vishwa Vidyapeetham
Amrita Vishwa Vidyapeetham Coimbatore, India
Coimbatore, India † rakesh.peter@gmail.com
∗ amrita.senthil@gmail.com
Abstract—Compressive sampling is an evolving approach for only for sparsely excited speech but also for voiced speech.
sampling signals at rates much less than Nyquist rate. For Com- We use ‘Basis pursuit’ framework [3] for CS recovery.
pressive sampling of signal, suitable basis function is required The paper is organised in the following order. In Section II,
which gives sparsity on transformation. This paper proposes com-
pressive sampling of speech signals by using Bessel function as we give the mathematical model for compressive sampling.
basis function for transformation of signal from time domain. By Section III describes the characteristics of Bessel function
using Bessel function as basis, it is shown in this paper that speech and Fourier Bessel transform for speech signal. Section IV
signal has been sampled at a rate less than the Nyquist rate. proposes the compressive sampling of speech signal using
The Signal-to-Distortion ratio (SDR) of reconstructed speech Bessel as basis function. The experiment details and results are
signals using Bessel basis and Sinusoidal basis are compared
for compressive sampling rate of 8000Hz, 6400Hz and 3200Hz. dealt in Section V. Finally, the conclusion is given in Section
Results shows that the proposed Bessel basis performance is far VI followed by overview of future work in Section VII.
superior to Sinusoidal basis in terms of signal reconstruction.
Index Terms—Sampling, Compressive sensing (CS), Bessel II. M ATHEMATICAL MODEL FOR C OMPRESSIVE SAMPLING
function, Sampling of speech signal
A. Compressive sampling
I. I NTRODUCTION Compressive sampling relies on the fact that many nat-
ural signals are sparse under suitable basis function for
The compression of speech signals by signal processing
sampling and reconstruction. Consider a discrete time sig-
techniques after sampling the analog signal at twice the
nal x of N samples as vector in vector space ℜN . Let
maximum frequency -Nyquist-Shannon sampling theorem-is
Ψ = [ψ1 |ψ2 |ψ3 . . . ψN ] be N x N basis matrix where
generally accepted and found in many literature. Though the
ψ1 , ψ2 , . . . ψN are basis vectors of space ℜN .
amount of data after compression is reduced, the data to be
sampled are generally large in these methods. To avoid this
x = Ψs (1)
resource(sensing) wastage, the work on compressive sampling
have been progressing in recent years. Compressive sam- where s = [s1 s2 . . . sN ] is a vector of length N containing
pling[1] takes only small amount of samples directly from projection coefficients si = hx, ψi i.
the sampling process for reconstruction of speech signal. The vector x is said to be K-sparse in Ψ domain if K
For compressive sampling of signal, the signal have to non-zero coefficients of s are enough to reconstruct x with
be sparse-expanded with suitable basis functions. Generally, minimal distortion. We observe sampling redundancy in signal
sinusoids are used as basis function for expansion of speech acquistion which will be removed by compressive sampling by
signals in speech recognition,speaker identification and veri- taking only M samples as follows:
fication,etc. But, the sinusoidal expansions i.e. Fourier trans- Let y be measurement vector of length M given by y=Φx
forms doesn’t give sparse coefficients on expansion. Though where Φ is an M x N measurment matrix and K < M << N .
there are many works on Compressive sampling of audio By equation (1), y is written as
signals [14], [17], [18], the papers on speech signals [13],
[14] are rare because of the above mentioned problem. In [13], y = Θx where Θ = ΦΨ (2)
compressive sampling is done only for sparsely excited speech
using Matching Pursuit method. The selection of Φ have to be done carefully in a way
In our paper, we propose Zeroth order Bessel function, that Φ and Ψ are incoherent which is the essential condition
which are decaying in nature, as the basis function for (Restricted Isometry Property) for reconstructing x from y [4].
compressive sampling of speech signals, as it captures the In papers [1], [2], [3], it is shown that Gaussian matrix of size
characteristics of speech better than sinusoids [9] and produces M x N satisfy isometry property for all most any Ψ with high
sparse coefficients. Our proposed basis function works not probability if M ≥ cKlog(N/K) for small constant c.
B. Reconstruction Algorithm Bessel Basis
1
N
The reconstruction of signal x in space ℜ is equivalent to
reconstruction of sparse vector s in Ψ domain by equation (1).
Unfortunately, finding exact s in ℜN which satisfy Θs′ = y is
not that trivial because of the following reason: Θs′ = y for
all s′ = s + r where vector r is in the Null space N (Θ) of Θ. 0.5
However, approximate solution can be obtained using l1 norm
Amplitude
minimization given by
tion
′′ ′
x2 y + xy + (x2 − n2 )y = 0, n > 0 (4) 0
0 0
−0.1 −0.1
0 50 100 150 200 250 300 350 0 50 100 150 200 250 300 350
Recovered signal taking only 160 samples − Bessel basis Recovered signal taking only 64 samples − Bessel basis
0.1 0.1
0 0
−0.1 −0.1
0 50 100 150 200 250 300 350 0 50 100 150 200 250 300 350
Recovered signal taking only 160 samples − Sinusoidal basis Recovered signal taking only 64 samples − Sinusoidal basis
0.1 0.1
0 0
−0.1 −0.1
0 50 100 150 200 250 300 350 0 50 100 150 200 250 300 350
chch chch
Fig. 3. Reconstructing speech signal using 160 samples. Fig. 5. Reconstructing speech signal using 64 samples.
−0.1
0 50 100 150 200 250 300 350
Recovered signal taking only 128 samples − Bessel basis
0.1
−0.1
0 50 100 150 200 250 300 350
Recovered signal taking only 128 samples − Sinusoidal basis
0.1
0
chch
−0.1
0 50 100 150 200 250 300 350 Fig. 6. SDR values of reconstructed speech signal for various M/N ratio
chch
Fig. 4. Reconstructing speech signal using 128 samples. VII. F UTURE WORK
Our proposed method is being applied for tracking people
The SDR values for reconstructed speech for various M/N in Wireless Sensor Networks. Using microphone sensors and
ratios are also provided for selecting Compressive sampling the person’s speech signal as identity, person tracking systems
rate according to desired quality. can be developed. Similar approach in [18] requires the use of
audio signal source attached to a person for tracking. In our
VI. S UMMARY AND C ONCLUSION future work, we are planning to use USRP based Software
In this paper, Compressive sampling of speech using Bessel Radio Basestation [21] to receive the compressed samples
function as basis is proposed. It is shown that recovered of speech signal from wireless microphone sensor nodes and
speech signal using Bessel basis has outperformed Sinusoidal compute the Bessel sparse coefficients by Convex optimization
basis (Fourier transform). It is important to notice that our technique in the base station PC. The estimated Bessel sparse
proposed algorithm is the first of its kind to perform well coefficients will be given to speaker identification system [10],
on speech audio.The computational complexity involved in [12] without transferring to time domain, opposed to the case
solving convex optimization problem for sparse vector is also in [16]. The Compressive sampling of speech signal has a
reduced in our proposed method due to real-valued Fourier lot of application in security systems using Wireless Sensor
Bessel coefficients. The quality of reconstructed speech can Network and our proposed system will help further research
be enhanced by improving the optimization techniques. in such areas.
R EFERENCES speech signals,” Final Rep. Summer Faculty Res. Program, Armstrong
Lab., AFOSR, Aug. 1993.
[1] Emamnuel J Candés, “Compressive sampling,” Proceedings of the In- [11] K. Gopalan and T. R. Anderson, “Speech processing using Bessel func-
ternational Congress of Mathematicians,vol. 3,pp. 1433-1452, Madrid, tions,” Proc. Symp. Intelligent Systems in Communications and Power,
Spain, 2006. Mayaguez, PR, Feb. 1994, pp. 255-259.
[2] Massimo Fornasier,Holger Rauhut, “Compressive Sensing,” Handbook of [12] K. Gopalan and T. R. Anderson, “Speaker Identification using Bessel
Mathematical Methods in Imaging (O. Scherzer Ed.), Springer, 2011. function representation and a back-propagation neural network,” Proc. of
[3] Emmanuel Candés, Justin Romberg, and Terence Tao, “Robust uncer- the IEEE International Symposium on Industrial Electronics, 1995.
tainity principles: Exact signal reconstruction from highly incomplete [13] T.V. Sreenivas, W. Batiaan Kleijn, “Compressive sensing for sparsely
frequency information,” IEEE Trans.on Information Theory, Vol. 52,no. excited speech signal,” Proc. IEEE Int. Conf. Acoust., Speech, Signal
2, pp.489-509, February 2006. Processing, 2009, pp.4125-4128.
[4] Richard Baraniuk, “Compressive sensing,” IEEE Signal Processing Mag- [14] M. G. Christensen, J. ∅stergaard, and S. H. Jensen, “On compressed
azine, Vol. 24,no. 4, pp.118-121, July 2007. sensing and its applications to speech and audio signals,” Asilomar Conf.,
[5] J. Haupt, R. Nowak, “Signal reconstruction from noisy random 2009.
projections,” IEEE Trans. Inform. Theory,” vol. 52, no.9, pp.4036- [15] E. Candès and J. Romberg, “l1-magic: Recovery of sparse signals via
4048,Sept.2006. convex programming,” code package available at www.l1-magic.org
[6] J. Tropp, A.C. Gilbert, “Signal recovery from partial information [16] Anthony Griffin, Elen Karamichali, “Speaker Identification using
via orthogonal matching pursuit,” Apr. 2005 [Online]. Available: sparsely excited speech signals and compressed sensing”, 18th European
http://www.personal.umich.edu/ jtropp/papers/TG06-Signal Recovery.pdf Signal Processing Conference, August 23-27, 2010.
[7] J. Schroeder, “Signal Processing via Fourier-Bessel series expansion,” [17] Laura Balzano, Robert Nowak, “Compressed Sensing Audio Demon-
Digital Signal Processing 3, 112-124, 1993. stration,” in website http://sunbeam.ece.wisc.edu/csaudio/
[8] K. Gopalan, T8. R. Anderson, “A Comparison of speaker identification [18] Anthony Griffin, Panagiotis Tsakalides, “Compressed sensing of Audio
results using features based on Cepstrum and Fourier-Bessel expansion,” signals using multiple sensors”,16th European Signal Processing Confer-
IEEE Transactions on Speech and Audio processing, Vol. 7, no. 3, May ence, August 25-29, 2008.
1999. [19] Chong Luo, Feng Wu, “Compressive Data Gathering for Large-Scale
[9] K. Gopalan, “Speech coding using Fourier-Bessel expansion of speech Wireless Sensor Networks”,in Mobicom 2009.
signals,” IECON’01: The 27th Annual Conference of the IEEE Industrial [20] S.S. Chen, D.L. Dohono, “Atomic decomposition by basis pursuit,”
Electronics Society. SIAM J.Sci. Comput., 20:33-61, 1999.
[10] K. Gopalan, “Speaker identification using Bessel function expansion of [21] Ettus Research LLC, 1043 North Shoreline Blvd., Suite 100, Mountain
View, CA 94043, www.ettus.com