The Digital Signal Processing Lab @ UCSD
The Digital Signal Processing Lab @ UCSD
Accepted/To Appear
1.S. Shivappa, B. D. Rao, and M. M. Trivedi, "Audio Visual Fusion and Tracking With Multilevel
Iterative Decoding: Framework and Experimental Evaluation," IEEE Journal of Selected Topics in
Signal Processing
2010
2.A. M.-Shirazi, W. Zhang, B. D. Rao, “Glimpsing Independent Vector Analysis: Separating More Sources Than Sensors Using Active and Inactive States,” IEEE International Conference on Acoustics, Speech, and Signal Processing, Dallas, Texas, Mar. 2010
2009
3.S. T. Shivappa, M. M. Trivedi, and B. D. Rao, "Hierarchical audio-visual cue integration framework for
activity analysis in intelligent meeting rooms", IEEE CVPR Joint Workshop for Visual and Contextual
Learning and Visual Scene Understanding, pages: 107-114, Jun. 2009
4.R.M. Hegde, J.Kurniawan, and B.D. Rao, “On the Design and Prototype Implementation of a Multimodal Situation Aware System,” Vol. 11, Issue 4, pages: 645 - 657, IEEE Transactions on Multimedia, Jun. 2009
5. S. T. Shivappa, B. D. Rao and M. M. Trivedi, "Role of Head Pose Estimation in Speech Acquisition from
Distant Microphones," IEEE International Conference on Acoustics, Speech, and Signal Processing,
Taipei, Taiwan, Apr. 2009
6.W. Zhang and B.D. Rao, "Two Microphone Based Direction of Arrival Estimation for Multiple Speech
Sources using Spectral Properties of Speech," IEEE International Conference on Acoustics, Speech, and
Signal Processing, Taipei, Taiwan, Apr. 2009
7.W. Zhang and B.D. Rao, "Combining Independent Component Analysis with Geometric Information and its Application to Speech Processing," IEEE International Conference on Acoustics, Speech, and Signal Processing, Taipei, Taiwan, Apr. 2009
8.A. M-Shirazi and B.D. Rao, "Independent Vector Analysis Incorporating Active and Inactive States," IEEE International Conference on Acoustics, Speech, and Signal Processing, Taipei, Taiwan, Apr. 2009
2008
9.S. T. Shivappa, M. M. Trivedi and B. D. Rao, " Person Tracking With Audio-visual Cues Using the Iterative Decoding Framework," IEEE International Conference on Advanced Video and Signal Surveillance, Santa Fe, New Mexico, Sep. 2008
10.S. T. Shivappa, B. D. Rao and M. M. Trivedi, "Multimodal Information Fusion Using the Iterative Decoding Algorithm and its Application to Audio-Visual Speech Recognition," IEEE International Conference on Acoustics, Speech, and Signal Processing, Las Vegas, Pages: 2241 – 2244, Apr. 2008
11.E. R. Duni and B. D. Rao, “Online Training Methods for Gaussian Mixture Vector Quantizers,” IEEE International Conference on Acoustics, Speech, and Signal Processing, Las Vegas, Pages: 4785 – 4788, Apr. 2008
12.S. T. Shivappa, B. D. Rao and M. M. Trivedi “An Iterative Decoding Algorithm for Fusion of Multimodal Information” EURASIP Journal on Advances in Signal Processing, Number: 478396, Feb. 2008
2007
13.E. R. Duni and B. D. Rao, “Performance of Speaker-Dependent Wideband Speech Coding,” Interspeech, Antwerp, Aug. 2007
14.R. Hegde, Y. Jin, and B. D. Rao, "Spectral Estimation of Voiced Speech Using a Family of MVDR Estimates," IEEE International Conference on Acoustics, Speech, and Signal Processing, Hawaii, Vol. 4, Pages: 1069 - 1072, Apr. 2007
15.E. R. Duni and B. D. Rao, "A High-Rate Optimal Transform Coder with Gaussian Mixture Companders," IEEE Transactions on Audio, Speech and Language Processing, Vol. 15, Issue 3, Pages: 770-783, Mar. 2007
16.E. R. Duni and B. D. Rao, "High-Rate Optimized Recursive Vector Quantization Structures Using Hidden Markov Models," IEEE Transactions on Audio, Speech and Language Processing, Vol. 15, Issue 3, Pages: 756-769, Mar. 2007
17.S. Dharanipragada, U. H. Yapanel, and B. D. Rao, "Robust Feature Extraction for Continuous Speech Recognition using the MVDR Spectrum Estimation Method," IEEE Transactions on Speech, Audio and Language Processing, Vol. 15, Issue 1, Pages: 224 - 234, Jan. 2007
2006
18.W. Zhang and B. D. Rao, "Robust Adaptive Beamformer with Feasibility Constraint on the Steering Vector," European Signal Processing Conference, Sep. 2006
19.R. M. Hegde, B.S. Manoj, B. D. Rao, and R. R. Rao, “Emotion Detection from Speech Signals and its Applications in Supporting Enhanced QoS in Emergency Response,” Third International Conference on Information Systems for Crisis Response and Management, Newark, USA, May. 2006
20.E. R. Duni and B. D. Rao, "High-Rate Design of Transform Coders with Gaussian Mixture Companders," IEEE International Conference on Acoustics, Speech, and Signal Processing, Tolouse, France, Vol. 1, Pages: 693 - 696, May. 2006
21.W. Zhang and B. D. Rao, "Robust Broadband Beam former With Diagonally Loaded Constraint Matrix and Its Application to Speech Recognition," IEEE International Conference on Acoustics, Speech, and Signal Processing, Tolouse, France, Vol. 1, Pages: 785 - 788, May. 2006
22.E. R. Duni and B. D. Rao, "High-Rate Training of Gaussian Mixture Vector Quantizers," Data Compression Conference, Page.1, Mar. 2006
23.A. D. Subramaniam, B. D. Rao, and W. R. Gardner, "Low-Complexity Source Coding Using Gaussian Mixture Models, Lattice Vector Quantization and Recursive Coding with Application to Speech Spectrum Quantization," IEEE Transactions on Speech and Audio Processing, Vol. 14, Issue. 2, Pages: 524 - 532, Mar. 2006
24.A. D. Subramaniam, B. D. Rao, and W. R. Gardner, "Iterative Joint Source-Channel Decoding of Speech Spectrum Parameters over an Additive White Gaussian Noise Channel," IEEE Transactions on Speech and Audio Processing, Vol. 14, Issue. 1, Pages: 152 - 162, Jan. 2006
2004
25.E. R. Duni, A. D. Subramaniam, and B. D. Rao, "Improved Quantization Structures Using Generalized HMM Modeling With Application to Wideband Speech Coding," IEEE International Conference on Acoustics, Speech, and Signal Processing, Pages: 161 - 164, May. 2004
26.A. D. Subramaniam, W. R. Gardner, B. D. Rao, “Joint Source-Channel Decoding of Speech Spectrum Parameters over an AWGN Channel Using Gaussian Mixture Models,” IEEE International Conference on Communications, Paris, France, Pages: 2847 – 2851, Vol. 5, Jun. 2004
2003
27.A. D. Subramaniam, W. R. Gardner, and B. D. Rao, “Joint Source-Channel Decoding of Speech Spectrum Parameters over Erasure Channels using Gaussian Mixture models,” IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 1, Pages: I-120 - I-123, Apr. 2003
28.A. D. Subramaniam and B. D. Rao, “PDF Optimized Parametric Vector Quantization of Speech Line Spectral Frequencies,” IEEE Transactions on Speech and Audio, Issue 2, Pages: 130-142, Mar. 2003
2002
29.A. D. Subramaniam, W. R. Gardner and B. D. Rao, "Speech Spectrum Quantization Using Gaussian Mixture Models and Multi Dimensional Companding", IEEE Speech Coding Workshop, Ibaraki, Japan, Pages: 5 - 7, Oct. 2002
30.W. R. Gardner, A. D. Subramaniam and B. D. Rao, "Comprehensive Evaluation of Theoretical Approximations for Spectral Quantization Performance", European Signal Processing Conference, Toulouse, France, Sep. 2002
31.A. D. Subramaniam, W. R. Gardner and B. D. Rao, “Low Complexity Recursive Coding of Spectrum Parameters,” IEEE International Conference on Acoustics, Speech and Signal Processing, Vol. 1, Pages: 637 -640, May. 2002
2001
32.A. D. Subramaniam and B. D. Rao, “Source Coding with Minimal and Rate-Independent Search and Memory Complexity, Data Compression Conference, Pages: 518-524, Mar. 2001
33.A. D. Subramaniam and B. D. Rao, “Speech LSF Quantization with Rate Independent Complexity, Bit Scalability and Learning,” IEEE International Conference on Acoustics, Speech and Signal Processing, Salt Lake City, Utah, Pages: 705-708, May. 2001
34.S. Dharanipragada and B. D. Rao, “MVDR Based Feature Extraction for Robust Speech Recognition,” IEEE International Conference on Acoustics, Speech and Signal Processing, Salt Lake City, Utah, Pages: 309-312, May. 2001
2000
35.A. D. Subramaniam, and B. D. Rao, “PDF Optimized Parametric Vector Quantization of Speech Line Spectral Frequencies,” IEEE Workshop on Speech Coding, Delavan, WI, Pages: 87-89, Sep. 2000
36.A. D. Subramaniam and B. D. Rao, “PDF Optimized Parametric Vector Quantization of Speech Line Spectral Frequencies," IEEE Asilomar Conference on Signals, Systems and Computers, Monterey, California, Vol. 2, Pages: 1475 - 1479, Nov. 2000
37.M. N. Murthi and B. D. Rao, “All-Pole Modeling of Voiced Speech Base on the Minimum Variance Distortionless Response Spectrum,” IEEE Transactions on Speech and Audio Processing, Pages: 221-239, May. 2000
1999
38.M. N. Murthi and B. D. Rao, “MVDR Based All-Pole Models for Spectral Coding of Speech,” IEEE International Conference on Acoustics, Speech and Signal Processing, Phoenix, AZ, Vol. 2, Pages: 669 - 672, Mar. 1999
39.M. N. Murthi and B. D. Rao, “MVDR Spectrum and Speech Modeling: A Tutorial,” Seventh Edition of the DSPtidende published by the Danish Society for Applied Digital Signal Processing, May. 1999
40.M. N. Murthi and B. D Rao, “MVDR Based All-Pole Modeling: Properties, Enhancements, and Comparison,” IEEE Workshop on Speech Coding, Pages: 31 -33, Jun. 1999
1998
41.M.N. Murthi, K. K-Delgado and B. D. Rao, “A New Algorithm and Entropy-like Measures for Sparse Coding,” Institute for Neural Computation, Vol. 8, Pages: 85-92, May. 1998
1997
42.M.N. Murthi and B. D. Rao, “Minimum Variance Distortionless Response (MDVR) Modeling of Voiced Speech,” IEEE International Conference on Acoustics, Speech and Signal Processing, Munich, Germany, Vol. 3, Pages: 1687 - 1690, Apr. 1997
43.M.N. Murthi and B. D. Rao, “All-Pole Model Parameter Estimation for Voiced Speech,” IEEE Workshop on Speech Coding for Telecommunications Proceedings, Pages: 17-18, 1997
44.M.N. Murthi and B. D. Rao, “All-Pole Modeling of Speech Based on the Minimum Variance Distortionless Response Spectrum,” IEEE Asilomar Conference on Signals, Systems and Computers, Monterey, CA, Vol. 2, Pages: 1061-1065, Nov. 1997
45.W. R. Gardner and B. D. Rao, “Noncausal All-Pole Modeling of Voiced Speech,” IEEE Transactions on Speech and Audio Processing, Vol. 5, No. 1, Pages: 1-10, Jan. 1997
1995
46.W. R. Gardner and B. D. Rao, “Theoretical Analysis of the High-Rate Vector Quantization of LPC Parameters,” IEEE Transactions on Speech and Audio Processing, Vol. 3, Issue: 5, Pages: 367-381, Sep. 1995
47.W. R. Gardner and B. D. Rao, “Optimal Distortion Measures for the High Rate Vector Quantization of LPC Parameters,” IEEE International Conference on Acoustics, Speech and Signal Processing, Detroit, Michigan, Vol. 1, Pages: 752 - 755, May. 1995
48.W. Y. Huang and B. D. Rao, “Channel and Noise Compensation for Text Dependent Speaker Verification over Telephone,” IEEE International Conference on Acoustics, Speech and Signal Processing, Detroit, Michigan, Vol. 1, Pages: 337 - 340, May. 1995
1994
49.W. R. Gardner and B. D. Rao, “Mixed-Phase AR Models for Voiced Speech and Perceptual Cost Functions,” Proc of the International Conference on Acoustics, Speech and Signal Processing, Adelaide, Australia, Vol. 1, Pages: 205 - 208, Apr. 1994
50.W. R. Gardner and B. D. Rao, “Analysis of High Rate LPC Vector Quantizers Designed by Minimizing Suboptimal Error Measures,” IEEE Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, Vol. 2, Pages: 1232 - 1236, Oct-Nov. 1994
1993
51.W. R. Gardner and B. D. Rao, “Non-Causal Linear Prediction of Voiced Speech,” IEEE Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, Pages: 1100-1104, Oct. 1992
1988
52.S. Dharanipragada, R. A. Gopinath and B. D. Rao, “Techniques for Capturing Temporal Variations in Speech Signals with Fixed-Rate Processing,” IEEE International Conference on Speech and Language Processing, Sydney, Australia, Nov. 1988