Using Reversed MFCC and IT-EM for Automatic Speaker Verification
Keywords: Information Theory, Expectation Maximization, MFCC, Gaussian Mixture Model, Speaker
Verification.
Mehran University Research Journal of Engineering & Technology
Volume 31 , Issue 1
Sheeraz Memon , Sania Bhatti , Tariq Jamil Saifullah Khanzada ,
References
1. |
Quatieri, T.F., "Discrete-Time Speech Signal Processing
Principles and Practice", Prentice Hall, 2002. |
2. |
Liu, Y., Russell, M., and Carey, M.," The Role of Dynamic
Features in Text-Dependent and Independent Speaker
Verification", IEEE International Conference on
Acoustics, Speech and Signal Processing (ICASSP),
Volume 1, May, 2006. |
3. |
Reynolds, D.A., Quatieri, T., and Dunn, R., "Speaker
Verification Using Adapted Gaussian Mixture Models",
Digital Signal Processing, Volume 10, No. 1, pp. 19-41,
2000. |
4. |
Bozkurt, E., Erzin, E., Erdem, C.E., and Erdem, A.T.,
"Automatic Emotion Recognition for Facial Expression
Animation from Speech", IEEE Conference on Signal Processing and Communications, pp. 989-992, 2009. |
5. |
Lehn-Schiøler, T., Hegde, A., Erdogmuz, D., and
Principe, J., "Vector-Quantization Using Information
Theoretic Concepts", Natural Computing, Voume 4,
No. 1, pp. 39-51, January, 2005. |
6. |
Zwicker, E., and Terhardt, E., "Analytical Expressions
for Critical Band Rate and Critical Bandwidth as a
Function of Frequency", Journal of Acoustical Society
of America, Volume 68, No. 5, pp. 1523-1525, 1980. |
7. |
Scherer, K.R., Johnstone, T., Klasmeyer, G., and Bänziger,
T., "Can Automatic Speaker Verification be Improved
by Training the Algorithms on Emotional Speech",
University of Geneva, Switzerland, 2000. |
8. |
Davis, S.B., and Mermelstein, P., "Comparison of
Parametric Representation for Monosyllabic Word
Recognition in Continuously Spoken Sentence", IEEE
Transactions on Acoustic Speech and Signal Processing,
Volume 28, No. 4, pp. 357-366, 1980. |
9. |
Reynolds, D.A., "Speaker Identification and Verification
Using Gaussian Mixture Speaker Models", Speech
Communication, Volume 17, pp. 91-108, 1995. |
10. |
Chakroborty, S., Roy, A., Majumdar, S., and Saha, G.,
"Capturing Complementary Information via Reversed
Filter Bank and Parallel Implementation with MFCC
for Improved Text-Independent Speaker Identification",
International Conference on Computing Theory and
Applications, pp. 463-467, March, 2007. |
11. |
Yegnanarayana, B., Prasanna, S.R.M., Zachariah, J.M.,
and Gupta, C.S., "Combining Evidence from Source,
Suprasegmental and Spectral Features for a Fixed-Text
Speaker Verification System", IEEE Transactions on
Speech and Audio Processing, Volume 13, No. 4,
pp. 575-582, July, 2005. |
12. |
Murty K.S.R., and Yegnanarayana, B., "Combining
Evidence from Residual Phase and MFCC Features for
Speaker Recognition", IEEE Signal Processing Letters,
Volume 13, No. 1, pp. 52-55, January, 2006. |
13. |
Prasanna, S.R.M., Cheedella, S.G., and Yegnanarayana,
B., "Extraction of Speaker-Specific Excitation
Information from Linear Prediction Residual of Speech",
Speech Communication, Volume 48, No. 10,
pp. 1243-1261, October, 2006. |
14. |
Gold, B., and Morgan, N., "Speech and Audio Signal
Processing", Part-IV, Chapter-14, pp. 189-203, John
Willy & Sons, 2002. |
15. |
Damper, R., and Higgins, J., "Improving Speaker
Identification in Noise by Sub Band Processing and
Decision Fusion", Pattern Recognition Letters,
Volume 24, pp. 2167-2173, 2003. |
16. |
Sorenson, H.W., and Aspach, D.L., "Recursive Bayesian
Estimation Using Gaussian Sums", Automatica,
Volume 7, pp. 465-479, 1971. |
17. |
Duda, R.O., and Hart, P.E., "Pattern Classification and
Scene Analysis", Wiley, New York, 1973. |
18. |
Reynolds, D.A., and Rose, R.C., "Robust Text-
Independent Speaker Identification Using Gaussian
Mixture Speaker Models", IEEE Transactions on Speech
and Audio Processing, Volume 3, No.1, pp. 72-83, 1995. |
19. |
Pelecanos, J., Myers, S., Sridharan, S., and Chandran, V.,
"Vector Quantization Based Gaussian Modeling for
Speaker Verification", International Conference on
Pattern Recognition, Volume 3, pp. 294-297, Spain,
2000 |
20. |
Alpaydm, E., "Soft Vector Quantization and the EM
Algorithm", Neural Networks, Volume 11, No. 3,
pp. 467-477, April, 1998. |
21. |
Ueda, N., and Nakano, R., "Deterministic Annealing
EM Algorithm", Neural Networks, Volume 11,
pp. 271-282, 1998. |
22. |
Ueda, N., and Nakano, R., "Deterministic Annealing
EM Algorithm", Neural Networks, Volume 11,
pp. 271-282, 1998. |
23. |
Ververidis, D., and Kotropoulos, C., "Gaussian Mixture
Modeling by Exploiting the Mahalanobis Distance",
IEEE Transactions on Signal Processing, Volume 56,
No. 7, pp. 2797-2811, July, 2008. |
24. |
Hedelin, P., and Skoglund, J., "Vector Quantization Based
on Gaussian Mixture Models", IEEE Transactions on
Speech and Audio Processing, Volume 8, No. 4,
pp. 385-401, 2000. |
25. |
Memon, S., and Lech, M., "Speaker Verification Based
on Information Theoretic Vector Quantization",
Communications in Computer and Information Science,
Wireless Networks, Information Processing and Systems,
Springer Berlin Heidelberg, 2009. |
26. |
Memon, S., and Lech, M., "Using Information Theoretic
Vector Quantization for GMM Based Speaker
Verification", 16th European Signal Processing
Conference, Lausanne, Switzerland, August 25-29, 2008. |
27. |
Memon, S., Lech, M., and Maddage, N., "Speaker
Verification Based on Different Vector Quantization
Techniques with Gaussian Mixture Models", Third
International Conference on Network and System
Security, 2009. |
28. |
Memon, S., Lech, M., and Maddage, N., "Information
Theoretic Expectation Maximization Based Gaussian
Mixture Modeling for Speaker Verification", 20th
International Conference on Pattern Recognition,
pp. 4536-4540, [ISBN: 978-0-7695-4109-9], 2010. |
29. |
Reynolds, D.A, Rose, R.C., and Smith, M.J.T., "The PCBased
TMS 320C30 Implementation of the Gaussian
Mixture Model Text-Independent Speaker Recognition
System", Proceedings of ICSPAT, pp. 967-973,
November, 1992. |
|
|
|