13.07.2015 Views

Proceedings Fonetik 2009 - Institutionen för lingvistik

Proceedings Fonetik 2009 - Institutionen för lingvistik

Proceedings Fonetik 2009 - Institutionen för lingvistik

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>Proceedings</strong>, FONETIK <strong>2009</strong>, Dept. of Linguistics, Stockholm UniversityA first step towards a text-independent speaker verificationPraat plug-in using Mistral/Alize toolsJonas LindhDepartment of Philosophy, Linguistics and Theory of Science, University of GothenburgAbstractText-independent speaker verification can be auseful tool as a substitute for passwords or increasedsecurity check. The tool can also beused in forensic phonetic casework. A text-independentspeaker verification Praat plug-inwas created using tools from the open sourceMistral/Alize toolkit. A gate keeper setup wascreated for 13 department employees and testedfor verification. 2 different universal backgroundmodels where trained and the same settested and evaluated. The results show promisingresults and give implications for the usefulnessof such a tool in research on voice quality.IntroductionAutomatic methods are increasingly being usedin forensic phonetic casework, but most oftenin combination with aural/acoustic methods. Itis therefore important to get a better understandingof how the two systems compare. Forseveral studies on voice quality judgement, butalso as a tool for visualisation and demonstration,a text-independent speaker comparisonwas implemented as a plugin to the phoneticanalysis program Praat (Boersma & Weenink,<strong>2009</strong>). The purpose of this study was to makean as easy to use implementation as possible sothat people with phonetic knowledge could usethe system to demonstrate the technique or performresearch. A state-of-art technique, the socalled GMM-UBM (Reynolds, 2000), was appliedwith tools from the open source toolkitMistral (former Alize) (Bonastre et al., 2005;2008). This paper describes the surface of theimplementation and the tools used without anydeeper analysis to get an overview. A small testwas then made on high quality recordings tosee what difference the possession of trainingdata for the universal background model makes.The results show that for demonstration purposesa very simple world model including thespeakers you have trained as targets is sufficient.However, for research purposes a largerworld model should be trained to be able toshow more correct scores.Mistral (Alize), an open source toolkitfor building a text-independent speakercomparison systemThe NIST speaker recognition evaluation campaignstarted already 1996 with the purpose ofdriving the technology of text-independentspeaker recognition forward as well as test theperformance of the state-of-the-art approachand to discover the most promising algorithmsand new technological advances (fromhttp://www.nist.gov/speech/tests/sre/ Jan 12,<strong>2009</strong>). The aim is to have an evaluation at leastevery second year and some tools are providedto facilitate the presentation of the results andhandling the data (Martin and Przybocki,1999). A few labs have been evaluating theirdevelopments since the very start with increasingperformances over the years. These labsgenerally have always performed best in theevaluation. However, an evaluation is a rathertedious task for a single lab and the question ofsome kind of coordination came up. This coordinationcould be just to share information,system scores or other to be able to improve theresults. On the other hand, the more naturalchoice to be able to share and interpret results isopen source. On the basis of this Mistral andmore specifically the ALIZE SpkDet packageswere developed and released as open sourcesoftware under a so-called LGPL licence(Bonastre et al., 2005; 2008).MethodA standard setup was made for placing datawithin the plugin. On the top of the tree structureseveral scripts controlling executable binaries,configuration files, data etc. were createdwith basic button interfaces that show up in agiven Praat configuration. The scripts weremade according to the different necessary stepsthat have to be covered to create a test environment.194

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!