25.08.2013 Views

PDF (Online Text) - EURAC

PDF (Online Text) - EURAC

PDF (Online Text) - EURAC

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

that can be used by external developers. With limited resources of time, money and<br />

labour, every administrative chore added to the workflow reduces resources available<br />

for more delivery-oriented tasks.<br />

The first decision to the problem of ‘chores’ was that the solution should be a<br />

Speech Processing Resource in its own right. The solution should consist of a set of<br />

reusable, extensible, shareable tools to be made available to (a) ourselves on future<br />

projects, and (b) other teams working on speech processing projects around the<br />

world.<br />

2.1 Requirements<br />

The main design goals of our solution are as follows:<br />

• researchers should be able to work independently of data format<br />

restrictions;<br />

• necessary, complicated, but uninteresting tasks should be automated;<br />

• interesting, but complicated tasks should be made simple;<br />

• researchers should be able to address linguistic problems with linguistic<br />

solutions;<br />

• the toolkit should be increasingly simple to maintain and develop; and,<br />

• the toolkit should encourage its own use and development.<br />

• Researchers should be able to work independently of data format<br />

restrictions.<br />

Data can be collected, transcribed and stored in a range of formats. Each of the<br />

range of available tools for language technology research accepts or generates data<br />

in its own format, or in a limited range of standard formats. Researchers should not<br />

have to worry about which format works with which application: they should be able<br />

to pick the application necessary (or preferred) for the research problem, and the<br />

data should be readily accessible in the correct format.<br />

• Necessary, complicated, but uninteresting tasks should be automated.<br />

This applies to life in general, of course.<br />

• Interesting but complicated tasks should be made uncomplicated.<br />

Due to the nature of the field, where it is often necessary to process large sets of<br />

data, many of the more interesting problems (e.g. building AMs for speech recognition)<br />

involve procedures that are repetitive (e.g. those that have to be applied to every<br />

item in a corpus) or complicated (e.g. initialising a system). Researchers learning<br />

about a new area are hampered when these tasks dominate learning time.<br />

173

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!