...::Mehran University Research Journal of Engineering & Technology::...

Article Information

Using Reversed MFCC and IT-EM for Automatic Speaker Verification

Keywords: Information Theory, Expectation Maximization, MFCC, Gaussian Mixture Model, Speaker Verification.

Mehran University Research Journal of Engineering & Technology

Volume 31 , Issue 1

Sheeraz Memon , Sania Bhatti , Tariq Jamil Saifullah Khanzada ,

References

1.	Quatieri, T.F., "Discrete-Time Speech Signal Processing Principles and Practice", Prentice Hall, 2002.
2.	Liu, Y., Russell, M., and Carey, M.," The Role of Dynamic Features in Text-Dependent and Independent Speaker Verification", IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Volume 1, May, 2006.
3.	Reynolds, D.A., Quatieri, T., and Dunn, R., "Speaker Verification Using Adapted Gaussian Mixture Models", Digital Signal Processing, Volume 10, No. 1, pp. 19-41, 2000.
4.	Bozkurt, E., Erzin, E., Erdem, C.E., and Erdem, A.T., "Automatic Emotion Recognition for Facial Expression Animation from Speech", IEEE Conference on Signal Processing and Communications, pp. 989-992, 2009.
5.	Lehn-Schiøler, T., Hegde, A., Erdogmuz, D., and Principe, J., "Vector-Quantization Using Information Theoretic Concepts", Natural Computing, Voume 4, No. 1, pp. 39-51, January, 2005.
6.	Zwicker, E., and Terhardt, E., "Analytical Expressions for Critical Band Rate and Critical Bandwidth as a Function of Frequency", Journal of Acoustical Society of America, Volume 68, No. 5, pp. 1523-1525, 1980.
7.	Scherer, K.R., Johnstone, T., Klasmeyer, G., and Bänziger, T., "Can Automatic Speaker Verification be Improved by Training the Algorithms on Emotional Speech", University of Geneva, Switzerland, 2000.
8.	Davis, S.B., and Mermelstein, P., "Comparison of Parametric Representation for Monosyllabic Word Recognition in Continuously Spoken Sentence", IEEE Transactions on Acoustic Speech and Signal Processing, Volume 28, No. 4, pp. 357-366, 1980.
9.	Reynolds, D.A., "Speaker Identification and Verification Using Gaussian Mixture Speaker Models", Speech Communication, Volume 17, pp. 91-108, 1995.
10.	Chakroborty, S., Roy, A., Majumdar, S., and Saha, G., "Capturing Complementary Information via Reversed Filter Bank and Parallel Implementation with MFCC for Improved Text-Independent Speaker Identification", International Conference on Computing Theory and Applications, pp. 463-467, March, 2007.
11.	Yegnanarayana, B., Prasanna, S.R.M., Zachariah, J.M., and Gupta, C.S., "Combining Evidence from Source, Suprasegmental and Spectral Features for a Fixed-Text Speaker Verification System", IEEE Transactions on Speech and Audio Processing, Volume 13, No. 4, pp. 575-582, July, 2005.
12.	Murty K.S.R., and Yegnanarayana, B., "Combining Evidence from Residual Phase and MFCC Features for Speaker Recognition", IEEE Signal Processing Letters, Volume 13, No. 1, pp. 52-55, January, 2006.
13.	Prasanna, S.R.M., Cheedella, S.G., and Yegnanarayana, B., "Extraction of Speaker-Specific Excitation Information from Linear Prediction Residual of Speech", Speech Communication, Volume 48, No. 10, pp. 1243-1261, October, 2006.
14.	Gold, B., and Morgan, N., "Speech and Audio Signal Processing", Part-IV, Chapter-14, pp. 189-203, John Willy & Sons, 2002.
15.	Damper, R., and Higgins, J., "Improving Speaker Identification in Noise by Sub Band Processing and Decision Fusion", Pattern Recognition Letters, Volume 24, pp. 2167-2173, 2003.
16.	Sorenson, H.W., and Aspach, D.L., "Recursive Bayesian Estimation Using Gaussian Sums", Automatica, Volume 7, pp. 465-479, 1971.
17.	Duda, R.O., and Hart, P.E., "Pattern Classification and Scene Analysis", Wiley, New York, 1973.
18.	Reynolds, D.A., and Rose, R.C., "Robust Text- Independent Speaker Identification Using Gaussian Mixture Speaker Models", IEEE Transactions on Speech and Audio Processing, Volume 3, No.1, pp. 72-83, 1995.
19.	Pelecanos, J., Myers, S., Sridharan, S., and Chandran, V., "Vector Quantization Based Gaussian Modeling for Speaker Verification", International Conference on Pattern Recognition, Volume 3, pp. 294-297, Spain, 2000
20.	Alpaydm, E., "Soft Vector Quantization and the EM Algorithm", Neural Networks, Volume 11, No. 3, pp. 467-477, April, 1998.
21.	Ueda, N., and Nakano, R., "Deterministic Annealing EM Algorithm", Neural Networks, Volume 11, pp. 271-282, 1998.
22.	Ueda, N., and Nakano, R., "Deterministic Annealing EM Algorithm", Neural Networks, Volume 11, pp. 271-282, 1998.
23.	Ververidis, D., and Kotropoulos, C., "Gaussian Mixture Modeling by Exploiting the Mahalanobis Distance", IEEE Transactions on Signal Processing, Volume 56, No. 7, pp. 2797-2811, July, 2008.
24.	Hedelin, P., and Skoglund, J., "Vector Quantization Based on Gaussian Mixture Models", IEEE Transactions on Speech and Audio Processing, Volume 8, No. 4, pp. 385-401, 2000.
25.	Memon, S., and Lech, M., "Speaker Verification Based on Information Theoretic Vector Quantization", Communications in Computer and Information Science, Wireless Networks, Information Processing and Systems, Springer Berlin Heidelberg, 2009.
26.	Memon, S., and Lech, M., "Using Information Theoretic Vector Quantization for GMM Based Speaker Verification", 16th European Signal Processing Conference, Lausanne, Switzerland, August 25-29, 2008.
27.	Memon, S., Lech, M., and Maddage, N., "Speaker Verification Based on Different Vector Quantization Techniques with Gaussian Mixture Models", Third International Conference on Network and System Security, 2009.
28.	Memon, S., Lech, M., and Maddage, N., "Information Theoretic Expectation Maximization Based Gaussian Mixture Modeling for Speaker Verification", 20th International Conference on Pattern Recognition, pp. 4536-4540, [ISBN: 978-0-7695-4109-9], 2010.
29.	Reynolds, D.A, Rose, R.C., and Smith, M.J.T., "The PCBased TMS 320C30 Implementation of the Gaussian Mixture Model Text-Independent Speaker Recognition System", Proceedings of ICSPAT, pp. 967-973, November, 1992.