Speech Processing

Home Faculty Students Publications Projects Album

Publications :: Chronological Journal Conferences Research Area

Accepted/To Appear

1.S. Shivappa, B. D. Rao, and M. M. Trivedi, "Audio Visual Fusion and Tracking With Multilevel

Iterative Decoding: Framework and Experimental Evaluation," IEEE Journal of Selected Topics in

Signal Processing

2010

2.A. M.-Shirazi, W. Zhang, B. D. Rao, “Glimpsing Independent Vector Analysis: Separating More Sources Than Sensors Using Active and Inactive States,” IEEE International Conference on Acoustics, Speech, and Signal Processing, Dallas, Texas, Mar. 2010

2009

3.S. T. Shivappa, M. M. Trivedi, and B. D. Rao, "Hierarchical audio-visual cue integration framework for

activity analysis in intelligent meeting rooms", IEEE CVPR Joint Workshop for Visual and Contextual

Learning and Visual Scene Understanding, pages: 107-114, Jun. 2009

4.R.M. Hegde, J.Kurniawan, and B.D. Rao, “On the Design and Prototype Implementation of a Multimodal Situation Aware System,” Vol. 11, Issue 4, pages: 645 - 657, IEEE Transactions on Multimedia, Jun. 2009

5. S. T. Shivappa, B. D. Rao and M. M. Trivedi, "Role of Head Pose Estimation in Speech Acquisition from

Distant Microphones," IEEE International Conference on Acoustics, Speech, and Signal Processing,

Taipei, Taiwan, Apr. 2009

6.W. Zhang and B.D. Rao, "Two Microphone Based Direction of Arrival Estimation for Multiple Speech

Sources using Spectral Properties of Speech," IEEE International Conference on Acoustics, Speech, and

Signal Processing, Taipei, Taiwan, Apr. 2009

7.W. Zhang and B.D. Rao, "Combining Independent Component Analysis with Geometric Information and its Application to Speech Processing," IEEE International Conference on Acoustics, Speech, and Signal Processing, Taipei, Taiwan, Apr. 2009

8.A. M-Shirazi and B.D. Rao, "Independent Vector Analysis Incorporating Active and Inactive States," IEEE International Conference on Acoustics, Speech, and Signal Processing, Taipei, Taiwan, Apr. 2009

2008

9.S. T. Shivappa, M. M. Trivedi and B. D. Rao, " Person Tracking With Audio-visual Cues Using the Iterative Decoding Framework," IEEE International Conference on Advanced Video and Signal Surveillance, Santa Fe, New Mexico, Sep. 2008

10.S. T. Shivappa, B. D. Rao and M. M. Trivedi, "Multimodal Information Fusion Using the Iterative Decoding Algorithm and its Application to Audio-Visual Speech Recognition," IEEE International Conference on Acoustics, Speech, and Signal Processing, Las Vegas, Pages: 2241 – 2244, Apr. 2008

11.E. R. Duni and B. D. Rao, “Online Training Methods for Gaussian Mixture Vector Quantizers,” IEEE International Conference on Acoustics, Speech, and Signal Processing, Las Vegas, Pages: 4785 – 4788, Apr. 2008

12.S. T. Shivappa, B. D. Rao and M. M. Trivedi “An Iterative Decoding Algorithm for Fusion of Multimodal Information” EURASIP Journal on Advances in Signal Processing, Number: 478396, Feb. 2008

2007

13.E. R. Duni and B. D. Rao, “Performance of Speaker-Dependent Wideband Speech Coding,” Interspeech, Antwerp, Aug. 2007

14.R. Hegde, Y. Jin, and B. D. Rao, "Spectral Estimation of Voiced Speech Using a Family of MVDR Estimates," IEEE International Conference on Acoustics, Speech, and Signal Processing, Hawaii, Vol. 4, Pages: 1069 - 1072, Apr. 2007

15.E. R. Duni and B. D. Rao, "A High-Rate Optimal Transform Coder with Gaussian Mixture Companders," IEEE Transactions on Audio, Speech and Language Processing, Vol. 15, Issue 3, Pages: 770-783, Mar. 2007

16.E. R. Duni and B. D. Rao, "High-Rate Optimized Recursive Vector Quantization Structures Using Hidden Markov Models," IEEE Transactions on Audio, Speech and Language Processing, Vol. 15, Issue 3, Pages: 756-769, Mar. 2007

17.S. Dharanipragada, U. H. Yapanel, and B. D. Rao, "Robust Feature Extraction for Continuous Speech Recognition using the MVDR Spectrum Estimation Method," IEEE Transactions on Speech, Audio and Language Processing, Vol. 15, Issue 1, Pages: 224 - 234, Jan. 2007

2006

18.W. Zhang and B. D. Rao, "Robust Adaptive Beamformer with Feasibility Constraint on the Steering Vector," European Signal Processing Conference, Sep. 2006

19.R. M. Hegde, B.S. Manoj, B. D. Rao, and R. R. Rao, “Emotion Detection from Speech Signals and its Applications in Supporting Enhanced QoS in Emergency Response,” Third International Conference on Information Systems for Crisis Response and Management, Newark, USA, May. 2006

20.E. R. Duni and B. D. Rao, "High-Rate Design of Transform Coders with Gaussian Mixture Companders," IEEE International Conference on Acoustics, Speech, and Signal Processing, Tolouse, France, Vol. 1, Pages: 693 - 696, May. 2006

21.W. Zhang and B. D. Rao, "Robust Broadband Beam former With Diagonally Loaded Constraint Matrix and Its Application to Speech Recognition," IEEE International Conference on Acoustics, Speech, and Signal Processing, Tolouse, France, Vol. 1, Pages: 785 - 788, May. 2006

22.E. R. Duni and B. D. Rao, "High-Rate Training of Gaussian Mixture Vector Quantizers," Data Compression Conference, Page.1, Mar. 2006

23.A. D. Subramaniam, B. D. Rao, and W. R. Gardner, "Low-Complexity Source Coding Using Gaussian Mixture Models, Lattice Vector Quantization and Recursive Coding with Application to Speech Spectrum Quantization," IEEE Transactions on Speech and Audio Processing, Vol. 14, Issue. 2, Pages: 524 - 532, Mar. 2006

24.A. D. Subramaniam, B. D. Rao, and W. R. Gardner, "Iterative Joint Source-Channel Decoding of Speech Spectrum Parameters over an Additive White Gaussian Noise Channel," IEEE Transactions on Speech and Audio Processing, Vol. 14, Issue. 1, Pages: 152 - 162, Jan. 2006

2004

25.E. R. Duni, A. D. Subramaniam, and B. D. Rao, "Improved Quantization Structures Using Generalized HMM Modeling With Application to Wideband Speech Coding," IEEE International Conference on Acoustics, Speech, and Signal Processing, Pages: 161 - 164, May. 2004

26.A. D. Subramaniam, W. R. Gardner, B. D. Rao, “Joint Source-Channel Decoding of Speech Spectrum Parameters over an AWGN Channel Using Gaussian Mixture Models,” IEEE International Conference on Communications, Paris, France, Pages: 2847 – 2851, Vol. 5, Jun. 2004

2003

27.A. D. Subramaniam, W. R. Gardner, and B. D. Rao, “Joint Source-Channel Decoding of Speech Spectrum Parameters over Erasure Channels using Gaussian Mixture models,” IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 1, Pages: I-120 - I-123, Apr. 2003

28.A. D. Subramaniam and B. D. Rao, “PDF Optimized Parametric Vector Quantization of Speech Line Spectral Frequencies,” IEEE Transactions on Speech and Audio, Issue 2, Pages: 130-142, Mar. 2003

2002

29.A. D. Subramaniam, W. R. Gardner and B. D. Rao, "Speech Spectrum Quantization Using Gaussian Mixture Models and Multi Dimensional Companding", IEEE Speech Coding Workshop, Ibaraki, Japan, Pages: 5 - 7, Oct. 2002

30.W. R. Gardner, A. D. Subramaniam and B. D. Rao, "Comprehensive Evaluation of Theoretical Approximations for Spectral Quantization Performance", European Signal Processing Conference, Toulouse, France, Sep. 2002

31.A. D. Subramaniam, W. R. Gardner and B. D. Rao, “Low Complexity Recursive Coding of Spectrum Parameters,” IEEE International Conference on Acoustics, Speech and Signal Processing, Vol. 1, Pages: 637 -640, May. 2002

2001

32.A. D. Subramaniam and B. D. Rao, “Source Coding with Minimal and Rate-Independent Search and Memory Complexity, Data Compression Conference, Pages: 518-524, Mar. 2001

33.A. D. Subramaniam and B. D. Rao, “Speech LSF Quantization with Rate Independent Complexity, Bit Scalability and Learning,” IEEE International Conference on Acoustics, Speech and Signal Processing, Salt Lake City, Utah, Pages: 705-708, May. 2001

34.S. Dharanipragada and B. D. Rao, “MVDR Based Feature Extraction for Robust Speech Recognition,” IEEE International Conference on Acoustics, Speech and Signal Processing, Salt Lake City, Utah, Pages: 309-312, May. 2001

2000

35.A. D. Subramaniam, and B. D. Rao, “PDF Optimized Parametric Vector Quantization of Speech Line Spectral Frequencies,” IEEE Workshop on Speech Coding, Delavan, WI, Pages: 87-89, Sep. 2000

36.A. D. Subramaniam and B. D. Rao, “PDF Optimized Parametric Vector Quantization of Speech Line Spectral Frequencies," IEEE Asilomar Conference on Signals, Systems and Computers, Monterey, California, Vol. 2, Pages: 1475 - 1479, Nov. 2000

37.M. N. Murthi and B. D. Rao, “All-Pole Modeling of Voiced Speech Base on the Minimum Variance Distortionless Response Spectrum,” IEEE Transactions on Speech and Audio Processing, Pages: 221-239, May. 2000

1999

38.M. N. Murthi and B. D. Rao, “MVDR Based All-Pole Models for Spectral Coding of Speech,” IEEE International Conference on Acoustics, Speech and Signal Processing, Phoenix, AZ, Vol. 2, Pages: 669 - 672, Mar. 1999

39.M. N. Murthi and B. D. Rao, “MVDR Spectrum and Speech Modeling: A Tutorial,” Seventh Edition of the DSPtidende published by the Danish Society for Applied Digital Signal Processing, May. 1999

40.M. N. Murthi and B. D Rao, “MVDR Based All-Pole Modeling: Properties, Enhancements, and Comparison,” IEEE Workshop on Speech Coding, Pages: 31 -33, Jun. 1999

1998

41.M.N. Murthi, K. K-Delgado and B. D. Rao, “A New Algorithm and Entropy-like Measures for Sparse Coding,” Institute for Neural Computation, Vol. 8, Pages: 85-92, May. 1998

1997

42.M.N. Murthi and B. D. Rao, “Minimum Variance Distortionless Response (MDVR) Modeling of Voiced Speech,” IEEE International Conference on Acoustics, Speech and Signal Processing, Munich, Germany, Vol. 3, Pages: 1687 - 1690, Apr. 1997

43.M.N. Murthi and B. D. Rao, “All-Pole Model Parameter Estimation for Voiced Speech,” IEEE Workshop on Speech Coding for Telecommunications Proceedings, Pages: 17-18, 1997

44.M.N. Murthi and B. D. Rao, “All-Pole Modeling of Speech Based on the Minimum Variance Distortionless Response Spectrum,” IEEE Asilomar Conference on Signals, Systems and Computers, Monterey, CA, Vol. 2, Pages: 1061-1065, Nov. 1997

45.W. R. Gardner and B. D. Rao, “Noncausal All-Pole Modeling of Voiced Speech,” IEEE Transactions on Speech and Audio Processing, Vol. 5, No. 1, Pages: 1-10, Jan. 1997

1995

46.W. R. Gardner and B. D. Rao, “Theoretical Analysis of the High-Rate Vector Quantization of LPC Parameters,” IEEE Transactions on Speech and Audio Processing, Vol. 3, Issue: 5, Pages: 367-381, Sep. 1995

47.W. R. Gardner and B. D. Rao, “Optimal Distortion Measures for the High Rate Vector Quantization of LPC Parameters,” IEEE International Conference on Acoustics, Speech and Signal Processing, Detroit, Michigan, Vol. 1, Pages: 752 - 755, May. 1995

48.W. Y. Huang and B. D. Rao, “Channel and Noise Compensation for Text Dependent Speaker Verification over Telephone,” IEEE International Conference on Acoustics, Speech and Signal Processing, Detroit, Michigan, Vol. 1, Pages: 337 - 340, May. 1995

1994

49.W. R. Gardner and B. D. Rao, “Mixed-Phase AR Models for Voiced Speech and Perceptual Cost Functions,” Proc of the International Conference on Acoustics, Speech and Signal Processing, Adelaide, Australia, Vol. 1, Pages: 205 - 208, Apr. 1994

50.W. R. Gardner and B. D. Rao, “Analysis of High Rate LPC Vector Quantizers Designed by Minimizing Suboptimal Error Measures,” IEEE Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, Vol. 2, Pages: 1232 - 1236, Oct-Nov. 1994

1993

51.W. R. Gardner and B. D. Rao, “Non-Causal Linear Prediction of Voiced Speech,” IEEE Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, Pages: 1100-1104, Oct. 1992

1988

52.S. Dharanipragada, R. A. Gopinath and B. D. Rao, “Techniques for Capturing Temporal Variations in Speech Signals with Fixed-Rate Processing,” IEEE International Conference on Speech and Language Processing, Sydney, Australia, Nov. 1988