The Digital Signal Processing Lab @ UCSD

 
 

Accepted/To Appear


  1. 1.S. Shivappa, B. D. Rao, and M. M. Trivedi, "Audio Visual Fusion and Tracking With Multilevel

    Iterative Decoding: Framework and Experimental Evaluation," IEEE Journal of Selected Topics in

    Signal Processing


2010


  1. 2.A. M.-Shirazi, W. Zhang, B. D. Rao, “Glimpsing Independent Vector Analysis: Separating More  Sources Than Sensors Using Active and  Inactive States,” IEEE International Conference on Acoustics, Speech, and Signal Processing, Dallas, Texas,  Mar. 2010


2009


  1. 3.S. T. Shivappa, M. M. Trivedi, and B. D. Rao, "Hierarchical audio-visual cue integration framework for

   activity analysis in intelligent meeting rooms", IEEE CVPR Joint Workshop for Visual and Contextual   

   Learning and Visual Scene Understanding, pages: 107-114, Jun. 2009


  1. 4.R.M. Hegde, J.Kurniawan, and B.D. Rao, “On the Design and Prototype Implementation of a Multimodal Situation Aware System,” Vol. 11, Issue 4,  pages: 645 - 657, IEEE Transactions on Multimedia, Jun. 2009


5. S. T. Shivappa, B. D. Rao and M. M. Trivedi, "Role of Head Pose Estimation in Speech Acquisition from

    Distant Microphones," IEEE International Conference on Acoustics, Speech, and Signal Processing,    

    Taipei, Taiwan, Apr. 2009


  1. 6.W. Zhang and B.D. Rao, "Two Microphone Based Direction of Arrival Estimation for Multiple Speech

   Sources using Spectral Properties of Speech," IEEE International Conference on Acoustics, Speech, and

   Signal Processing, Taipei, Taiwan, Apr. 2009


  1. 7.W. Zhang and B.D. Rao, "Combining Independent Component Analysis with Geometric Information and its Application to Speech Processing," IEEE International Conference on Acoustics, Speech, and Signal Processing, Taipei, Taiwan, Apr. 2009


  1. 8.A. M-Shirazi and B.D. Rao, "Independent Vector Analysis Incorporating Active and Inactive States," IEEE International Conference on Acoustics, Speech, and Signal Processing, Taipei, Taiwan, Apr. 2009




2008


  1. 9.S. T. Shivappa, M. M. Trivedi and B. D. Rao, "  Person Tracking With Audio-visual Cues Using the Iterative Decoding Framework," IEEE International Conference on Advanced Video and Signal Surveillance, Santa Fe, New Mexico, Sep. 2008


  1. 10.S. T. Shivappa, B. D. Rao and M. M. Trivedi, "Multimodal Information Fusion Using the Iterative Decoding Algorithm and its Application to Audio-Visual Speech Recognition," IEEE International Conference on Acoustics, Speech, and Signal Processing, Las Vegas, Pages: 2241 – 2244, Apr. 2008


  1. 11.E. R. Duni and B. D. Rao, “Online Training Methods for Gaussian Mixture Vector Quantizers,” IEEE International Conference on Acoustics, Speech, and Signal Processing, Las Vegas, Pages: 4785 – 4788, Apr. 2008


  1. 12.S. T. Shivappa, B. D. Rao and M. M. Trivedi “An Iterative Decoding Algorithm for Fusion of Multimodal Information”  EURASIP Journal on Advances in Signal Processing, Number: 478396, Feb. 2008



2007



  1. 13.E. R. Duni and B. D. Rao, “Performance of Speaker-Dependent Wideband Speech Coding,” Interspeech, Antwerp, Aug. 2007


  1. 14.R. Hegde, Y. Jin, and B. D. Rao, "Spectral Estimation of Voiced Speech Using a Family of MVDR Estimates," IEEE International Conference on Acoustics, Speech, and Signal Processing, Hawaii, Vol. 4, Pages: 1069 - 1072, Apr. 2007 


  1. 15.E. R. Duni and B. D. Rao, "A High-Rate Optimal Transform Coder with Gaussian Mixture Companders," IEEE Transactions on Audio, Speech and Language Processing, Vol. 15, Issue 3, Pages: 770-783, Mar. 2007 


  1. 16.E. R. Duni and B. D. Rao, "High-Rate Optimized Recursive Vector Quantization Structures Using Hidden Markov Models,"  IEEE Transactions on Audio, Speech and Language Processing, Vol. 15, Issue 3, Pages: 756-769, Mar. 2007  


  1. 17.S. Dharanipragada,  U. H. Yapanel, and B. D. Rao, "Robust Feature Extraction for Continuous Speech Recognition using the MVDR Spectrum Estimation Method," IEEE Transactions on Speech, Audio and Language Processing, Vol. 15, Issue 1, Pages: 224 - 234, Jan. 2007



2006



  1. 18.W. Zhang and B. D. Rao, "Robust Adaptive Beamformer with Feasibility Constraint on the Steering Vector," European Signal Processing Conference, Sep. 2006


  1. 19.R. M. Hegde, B.S. Manoj, B. D. Rao, and R. R. Rao, “Emotion Detection from Speech Signals and its Applications in Supporting Enhanced QoS in Emergency Response,” Third International Conference on Information Systems for Crisis Response and Management, Newark, USA, May. 2006 


  1. 20.E. R. Duni and B. D. Rao, "High-Rate Design of Transform Coders with Gaussian Mixture Companders," IEEE International Conference on Acoustics, Speech, and Signal Processing, Tolouse, France, Vol. 1, Pages: 693 - 696, May. 2006


  1. 21.W. Zhang and B. D. Rao, "Robust Broadband Beam former With Diagonally Loaded Constraint Matrix and Its Application to Speech Recognition," IEEE International Conference on Acoustics, Speech, and Signal Processing, Tolouse, France, Vol. 1, Pages: 785 - 788, May. 2006


  1. 22.E. R. Duni and B. D. Rao, "High-Rate Training of Gaussian Mixture Vector Quantizers," Data Compression Conference, Page.1, Mar. 2006


  1. 23.A. D. Subramaniam, B. D. Rao, and W. R. Gardner, "Low-Complexity Source Coding Using Gaussian Mixture Models, Lattice Vector Quantization and Recursive Coding with Application to Speech Spectrum Quantization," IEEE Transactions on Speech and Audio Processing, Vol. 14, Issue. 2, Pages: 524 - 532, Mar. 2006


  1. 24.A. D. Subramaniam, B. D. Rao, and W. R. Gardner, "Iterative Joint Source-Channel Decoding of Speech Spectrum Parameters over an Additive White Gaussian Noise Channel," IEEE Transactions on Speech and Audio Processing, Vol. 14, Issue. 1, Pages: 152 - 162, Jan. 2006



2004



  1. 25.E. R. Duni, A. D. Subramaniam, and B. D. Rao, "Improved Quantization Structures Using Generalized HMM Modeling With Application to Wideband Speech Coding," IEEE International Conference on Acoustics, Speech, and Signal Processing, Pages: 161 - 164, May. 2004


  1. 26.A. D. Subramaniam, W. R. Gardner, B. D. Rao, “Joint Source-Channel Decoding of Speech Spectrum Parameters over an AWGN Channel Using Gaussian Mixture Models,” IEEE International Conference on Communications, Paris, France, Pages: 2847 – 2851, Vol. 5, Jun. 2004



2003


  1. 27.A. D. Subramaniam, W. R. Gardner, and B. D. Rao, “Joint Source-Channel Decoding of Speech Spectrum Parameters over Erasure Channels using Gaussian Mixture models,” IEEE  International Conference on Acoustics, Speech, and Signal Processing, Vol. 1, Pages: I-120 - I-123, Apr. 2003


  1. 28.A. D. Subramaniam and B. D. Rao, “PDF Optimized Parametric Vector Quantization of Speech Line Spectral Frequencies,” IEEE Transactions on Speech and Audio,  Issue 2, Pages: 130-142, Mar. 2003



2002



  1. 29.A. D. Subramaniam, W. R. Gardner and B. D. Rao, "Speech Spectrum Quantization Using Gaussian Mixture Models and Multi Dimensional Companding", IEEE Speech Coding Workshop, Ibaraki, Japan, Pages: 5 - 7, Oct. 2002


  1. 30.W. R. Gardner, A. D. Subramaniam and B. D. Rao, "Comprehensive Evaluation of Theoretical Approximations for Spectral Quantization Performance", European Signal Processing Conference, Toulouse, France, Sep. 2002


  1. 31.A. D. Subramaniam, W. R. Gardner and B. D. Rao, “Low Complexity Recursive Coding of Spectrum Parameters,” IEEE International Conference on Acoustics, Speech and Signal Processing, Vol. 1, Pages: 637 -640, May. 2002



2001



  1. 32.A. D. Subramaniam and B. D. Rao, “Source Coding with Minimal and Rate-Independent Search and Memory Complexity, Data Compression Conference, Pages: 518-524, Mar. 2001


  1. 33.A. D. Subramaniam and B. D. Rao, “Speech LSF Quantization with Rate Independent Complexity, Bit Scalability and Learning,” IEEE International Conference on Acoustics, Speech and Signal Processing, Salt Lake City, Utah, Pages: 705-708, May. 2001


  1. 34.S. Dharanipragada and B. D. Rao, “MVDR Based Feature Extraction for Robust Speech Recognition,” IEEE International Conference on Acoustics, Speech and Signal Processing, Salt Lake City, Utah, Pages: 309-312, May. 2001



2000



  1. 35.A. D. Subramaniam, and B. D. Rao, “PDF Optimized Parametric Vector Quantization of Speech Line Spectral Frequencies,” IEEE Workshop on Speech Coding, Delavan, WI, Pages: 87-89, Sep. 2000


  1. 36.A. D. Subramaniam and B. D. Rao, “PDF Optimized Parametric Vector Quantization of Speech Line Spectral Frequencies," IEEE Asilomar Conference on Signals, Systems and Computers, Monterey, California, Vol. 2, Pages: 1475 - 1479, Nov. 2000


  1. 37.M. N. Murthi and B. D. Rao, “All-Pole Modeling of Voiced Speech Base on the Minimum Variance Distortionless Response Spectrum,” IEEE Transactions on Speech and Audio Processing, Pages: 221-239, May. 2000



1999



  1. 38.M. N. Murthi and B. D. Rao, “MVDR Based All-Pole Models for Spectral Coding of Speech,” IEEE International Conference on Acoustics, Speech and Signal Processing, Phoenix, AZ, Vol. 2, Pages: 669 - 672, Mar. 1999


  1. 39.M. N. Murthi and B. D. Rao, “MVDR Spectrum and Speech Modeling: A Tutorial,” Seventh Edition of the DSPtidende published by the Danish Society for Applied Digital Signal Processing, May. 1999


  1. 40.M. N. Murthi and B. D Rao, “MVDR Based All-Pole Modeling: Properties, Enhancements, and Comparison,” IEEE Workshop on Speech Coding, Pages: 31 -33, Jun. 1999



1998



  1. 41.M.N. Murthi, K. K-Delgado and B. D. Rao, “A New Algorithm and Entropy-like Measures for Sparse Coding,” Institute for Neural Computation, Vol. 8, Pages: 85-92, May. 1998



1997



  1. 42.M.N. Murthi and B. D. Rao, “Minimum Variance Distortionless Response (MDVR) Modeling of Voiced Speech,” IEEE International Conference on Acoustics, Speech and Signal Processing, Munich, Germany, Vol. 3, Pages: 1687 - 1690, Apr. 1997


  1. 43.M.N. Murthi and B. D. Rao, “All-Pole Model Parameter Estimation for Voiced Speech,” IEEE Workshop on Speech Coding for Telecommunications Proceedings, Pages: 17-18, 1997


  1. 44.M.N. Murthi and B. D. Rao, “All-Pole Modeling of Speech Based on the Minimum Variance Distortionless Response Spectrum,” IEEE Asilomar Conference on Signals, Systems and Computers, Monterey, CA, Vol. 2, Pages: 1061-1065, Nov. 1997


  1. 45.W. R. Gardner and B. D. Rao, “Noncausal All-Pole Modeling of Voiced Speech,” IEEE Transactions on Speech and Audio Processing, Vol. 5, No. 1, Pages: 1-10, Jan. 1997



1995



  1. 46.W. R. Gardner and B. D. Rao, “Theoretical Analysis of the High-Rate Vector Quantization of LPC Parameters,” IEEE Transactions on Speech and Audio Processing, Vol. 3, Issue: 5, Pages: 367-381, Sep. 1995


  1. 47.W. R. Gardner and B. D. Rao, “Optimal Distortion Measures for the High Rate Vector Quantization of LPC Parameters,” IEEE International Conference on Acoustics, Speech and Signal Processing, Detroit, Michigan, Vol. 1, Pages: 752 - 755, May. 1995


  1. 48.W. Y. Huang and B. D. Rao, “Channel and Noise Compensation for Text Dependent Speaker Verification over Telephone,” IEEE International Conference on Acoustics, Speech and Signal Processing, Detroit, Michigan, Vol. 1, Pages: 337 - 340, May. 1995



1994 



  1. 49.W. R. Gardner and B. D. Rao, “Mixed-Phase AR Models for Voiced Speech and Perceptual Cost Functions,” Proc of the International Conference on Acoustics, Speech and Signal Processing, Adelaide, Australia, Vol. 1, Pages: 205 - 208, Apr. 1994


  1. 50.W. R. Gardner and B. D. Rao, “Analysis of High Rate LPC Vector Quantizers Designed by Minimizing Suboptimal Error Measures,” IEEE Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, Vol. 2, Pages: 1232 - 1236, Oct-Nov. 1994



1993 



  1. 51.W. R. Gardner and B. D. Rao, “Non-Causal Linear Prediction of Voiced Speech,” IEEE Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, Pages:  1100-1104, Oct. 1992 



1988



  1. 52.S. Dharanipragada, R. A. Gopinath and B. D. Rao, “Techniques for Capturing Temporal Variations in Speech Signals with Fixed-Rate Processing,” IEEE International Conference on Speech and Language Processing, Sydney, Australia, Nov. 1988