Voice Biometrics Based on Pitch Replication

L.C. Moreno; P.B. Lopes

doi:10.31686/ijier.vol6.iss10.1201

Authors

L.C. Moreno Universidade Presbiteriana Mackenzie, Brazil Author
P.B. Lopes Universidade Presbiteriana Mackenzie, Brazil Author

DOI:

https://doi.org/10.31686/ijier.vol6.iss10.1201

Keywords:

authentication, biometrics, pitch, algorithm, pitch replication

Abstract

Authentication and security in automated systems have become very much necessary in our days and many techniques have been proposed towards this end. One of these alternatives is biometrics in which human body characteristics are used to authenticate the system user. The objective of this article is to present a method of text independent speaker identification through the replication of pitch characteristics. Pitch is an important speech feature and is used in a variety of applications, including voice biometrics. The proposed method of speaker identification is based on short segments of speech, namely, three seconds for training and three seconds for the speaker determination. From these segments pitch characteristics are extracted and are used in the proposed method of replication for identification of the speaker.

Author Biographies

L.C. Moreno, Universidade Presbiteriana Mackenzie, Brazil

Programa de Pós-Graduação em Engenharia Elétrica e Computação
P.B. Lopes, Universidade Presbiteriana Mackenzie, Brazil

Programa de Pós-Graduação em Engenharia Elétrica e Computação

References

CemalHanilçi and FigenErtaş, “Investigation of the effect of data duration and speaker gender on text-independent speaker recognition”, Computer and Electrical Engineering-2012. http://cs.uef.fi/sipu/pub/kanervisto17_gender.pdf

Mak MW, Hsiao R, Mak B. A comparison of various adaptation methods for speaker verification with limited enrollment data. In: Proc. ICASSP; 2006. p. 929-32 http://www.eie.polyu.edu.hk/~mwmak/papers/icassp06Poster.pdf

Vogt R, Sridharan S. “Experiments in session variability modelling for speaker verification”. In: Proc. ICASSP; 2006. p. 897-900 https://dl.acm.org/citation.cfm?id=2464271

Fauve BGB, Evans NWD, Pearson N, Bonastre JF, Mason JSD. “Influence of task duration in text-independent speaker verification”. In: Proc. interspeech; 2007. p. 794-7. https://dl.acm.org/citation.cfm?id=2464271 DOI: https://doi.org/10.21437/Interspeech.2007-151

Vogt R, Baker B, Sridharan S. “Factor analysis subspace estimation for speaker verification with short utterances”. In: Proc. interspeech; 2008. p.853-6. https://www.researchgate.net/profile/Figen_Ertas/publication/235995473_Investigation_of_the_effect_of_data_duration_and_speaker_gender_on_text-independent_speaker_recognition/links/5b090f0caca2725783e63547/Investigation-of-the-effect-of-data-duration-and-speaker-gender-on-text-independent-speaker-recognition.pdf

Vogt R, Lustri C, Sridharan S. “Factor analysis modelling for speaker verification with short utterances”. In: Proc. speaker Odyssey; 2008 https://eprints.qut.edu.au/12629/ DOI: https://doi.org/10.21437/Interspeech.2008-274

Vogt R, Pelecanos JW, Scheffer N, Kajarekar SS, Sridharan S. “Within-session variability modelling for factor analysis speaker verification”. In: Proc. interspeech; 2009. p. 1563-6. https://www.isca-speech.org/archive/interspeech_2009/i09_1563.html DOI: https://doi.org/10.21437/Interspeech.2009-386

Pelecanos J, Chaudhari U, Ramaswamy G. “Compensation of utterance length for speaker verification”. In: Proc. speaker Odyssey; 2004. https://repositorio.uam.es/bitstream/handle/10486/7508/42232_gonzalez_dominguez_javier.pdf?sequence=1

McLaren M, Vogt R, Baker B, Sridharan S. “Experiments in svm-based speaker verification using short utterances”. In: Proc. speaker Odyssey; 2010. p. 83-90. https://www.researchgate.net/publication/324936163_Improving_the_performance_of_GPLDA_speaker_verification_using_unsupervised_inter-dataset_variability_compensation_approaches

ArnabPoddar, MdSahidullah, GoutamSaha. “Speaker verification with short utterances: a review of challenges, trends and opportunities”. In: The Institution of Engineering and Techology 2015. https://www.researchgate.net/profile/Arnab_Poddar/publication/320201024_Speaker_Verification_with_Short_Utterances_A_Review_of_Challenges_Trends_and_Opportunities/links/5a0e84f4aca27244d2859732/Speaker-Verification-with-Short-Utterances-A-Review-of-Challenges-Trends-and-Opportunities.pdf

Kinnum,T,Li,H: “An overview of tex-independent speaker recognition from features to supervectores”. Speech Commun, 2010 52(1), pp12-40 http://www.cs.joensuu.fi/pages/tkinnu/webpage/pdf/speaker_recognition_overview.pdf DOI: https://doi.org/10.1016/j.specom.2009.08.009

Campbell, J.P.Jr: “Speaker recognition a tutorial”, Proc. IEEE, 1997, 85(9) pp 1437-1462 https://www.lsv.uni-saarland.de/fileadmin/publications/non_articles/Speaker_Recognition_A_Tutorial.pdf DOI: https://doi.org/10.1109/5.628714

John G. Proakis (Autor),‎ Dimitris G. Manolakis: “Digital Signal Processing”, 4th edition, 2007 https://engineering.purdue.edu/~ee538/DSP_Text_3rdEdition.pdf

Tomi Kinnunen, Haizhou: An Overview of Tex-Independent Speaker Recognition: from Features to Supervectors”, 2011 – https://hal.archives-ouvertes.fr/hal-00587602

D.A. Reynolds, T.F.Quatieri, and R.B. Dunn.: “Speaker verification using adapted gaussian misture models”. Digital signal processing , vol.10 no.1-3 pp 19-41.2000 https://scholar.google.com.br/scholar?q=Speaker+verification+using+adapted+gaussian+mixture+models&hl=pt-BR&as_sdt=0&as_vis=1&oi=scholart DOI: https://doi.org/10.1006/dspr.1999.0361

D.A. Reynolds and R.C. Rose.: “Robust text-independent speaker identification using gaussian mixture speaker models”. IEEE transactions on speech and audio processing, vol.3 no1 pp-7283, 1995. https://scholar.google.com.br/scholar?q=Robust+text-independent+speaker+identification+using+gaussian+mixture+speaker+models&hl=pt-BR&as_sdt=0&as_vis=1&oi=scholart DOI: https://doi.org/10.1109/89.365379

Yi-Hsiang Chao, Wei-Ho Tsai and Hsin-Min Wang: ”Improving GMM-UBM speaker verification using discriminate feedback adaptation” Computer Speech&Language, 2009 https://www.researchgate.net/publication/224364035_Discriminative_Feedback_Adaptation_for_GMM-UBM_Speaker_Verification

S.S. Tirumala, R.Wang.: “Speaker Identification Features Extration Methods: A Systematic Review”, An International Journal on Expert Systems with Applications vol.102.July 15,2017 http://www.massey.ac.nz/~rwang/publications/17-ESwA-Reza.pdf

Join Factor Analysis and i-vector Tutorial, disponivel no http://www1.icsi.Berkeley.edu/Speech 27.03.2018. http://www1.icsi.berkeley.edu/Speech/presentations/AFRL_ICSI_visit2_JFA_tutorial_icsitalk.pdf

M. T. S. Al Kaltakchi and W. L. Woo and S. S. Dlay and J. A. Chamber: Study in Fusion Strategies and Exploiting the Combination of MFCC and PMCC features for Robust Biometric Speaker Identification, 2016. https://pdfs.semanticscholar.org/86f7/488e6ad64ad3a7aab65f936c9686aee91a1a.pdf

LeandroA Silva,S. M Peres Sarajane,BoscarioliClodis: Introdução a Mineração de Dados, Brasil, 2016. https://www.loja.elsevier.com.br/introducao-a-mineracao-de-dados-9788535284461.html

Lawrence Rabiner and Biing-Hwang Juang: Fundamentals of Speech Recognition, EUA 1993. https://www.amazon.com/Fundamentals-Speech-Recognition-Lawrence-Rabiner/dp/0130151572

L. Feng: Speaker Recognition Informatics and Mathematical Modelling – Technical Univeristy of Demmark, DTU, ResearchGate, Sep.2004. https://www.researchgate.net/publication/259333765_Speaker_Recognition

X. Sun,:A pitch determination algorithm based on subharmonic-to-harmonic ratio, pp.679-679 -6th Internacional Conference of Spoken Language Processing -China – 2000 https://ieeexplore.ieee.org/document/5743722 DOI: https://doi.org/10.21437/ICSLP.2000-902

Nayana,P,Mathewa,D and Thomasa,A:Comparation of Text Independent Speaker Identification Systems using GMM and i-Vector Methods, 2017 https://www.sciencedirect.com/science/article/pii/S1877050917318823 DOI: https://doi.org/10.1016/j.procs.2017.09.075

Ruud M. Bolle,Jonathan H. Connel, Sharath Pankanti,Nalini K. Ratha and Andrew W. Senior: Guide to Biometrics, 2004 https://www.springer.com/gp/book/9780387400891 DOI: https://doi.org/10.1007/978-1-4757-4036-3

Garrett Thomas, CS PhD student at Stanford: How does KNN classification compare to classification by neural networks?,2017 https://www.quora.com/How-does-KNN-classification-compare-to-classification-by-neural-networks

Voice Biometrics Based on Pitch Replication

Authors

DOI:

Keywords:

Abstract

Author Biographies

References

Downloads

Published

Issue

Section

License

How to Cite

submit

Side Menu

Announcements

Call for manuscripts for April 2026