25.08.2013 Views

PDF (Online Text) - EURAC

PDF (Online Text) - EURAC

PDF (Online Text) - EURAC

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Table 1: SpeechCluster command-line tools<br />

Tool Function<br />

segFake ‘Fake autosegmentation’ of a speech audio file<br />

segInter Interpolates labels into a segmented but unlabelled segment tier<br />

segMerge Merges separate label files<br />

segReplace Converts labels in a label file<br />

segSwitch Converts label file format<br />

splitAll Splits audio/label file pairs<br />

3.2 Using SpeechCluster I: The Tools<br />

a) SegSwitch<br />

SegSwitch is a label file format converter. It converts label files between any of the<br />

formats supported by SpeechCluster (currently, Praat <strong>Text</strong>Grid, esps and the various<br />

HTK formats [i.e., the simple .lab format and the multi-file .mlf format]). This kind of<br />

format conversion is a very common task. For example, HTK requires files to be in its<br />

own esps-like format, but our team prefers to handlabel files in Praat, which outputs<br />

its own <strong>Text</strong>Grid format. Festival uses an esps-like format that is slightly different<br />

from HTK’s.<br />

SegSwitch has a simple command-line interface (see Table 2), in which single files<br />

or whole directories can be converted easily and perfectly.<br />

Usage:<br />

Examples:<br />

Table 2: segSwitch usage<br />

segSwitch -i -o <br />

segSwitch -d -o <br />

segSwitch -i example.lab -o example.<strong>Text</strong>Grid<br />

segSwitch -d labDir -o textGrid<br />

A simple facility like this has a remarkable effect on the efficiency of a team. The<br />

team no longer has to worry about in what file format they have to work. They can<br />

concentrate on the research task converting files in and out of particular formats as<br />

needed. In a sense, the two parts of the work- the research and the bookkeeping-<br />

have been separated, and the bookkeeping is done by the tools. This division of labour<br />

is repeated between the tools and the SpeechCluster module itself. As much of the<br />

low-level data manipulation as possible is carried out by SpeechCluster, so that the<br />

tools themselves can be written in simple, task-oriented terms.<br />

Table 3 shows the main code for segSwitch (excluding the command-line parsing and<br />

the loop over files in a directory): all of the work of file format conversion is done by<br />

175

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!