30.01.2014 Views

Annual Report 2010 - Fachgruppe Informatik an der RWTH Aachen ...

Annual Report 2010 - Fachgruppe Informatik an der RWTH Aachen ...

Annual Report 2010 - Fachgruppe Informatik an der RWTH Aachen ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Overview<br />

The ‘Lehrstuhl für <strong>Informatik</strong> 6’ is concerned with research on adv<strong>an</strong>ced methods for<br />

statistical pattern recognition. The main application of these methods is in the field of<br />

automatic processing of hum<strong>an</strong> l<strong>an</strong>guage, i.e. the recognition of speech, the tr<strong>an</strong>slation of<br />

spoken <strong>an</strong>d written l<strong>an</strong>guage, the un<strong>der</strong>st<strong>an</strong>ding of natural l<strong>an</strong>guage <strong>an</strong>d spoken dialogue<br />

systems, <strong>an</strong>d image <strong>an</strong>d optical character recognition.<br />

The general framework for the research activities is based on statistical decision theory <strong>an</strong>d<br />

problem specific modelling. The prototypical area where this approach has been pushed<br />

forward is speech recognition. Here, the approach is expressed by the equation:<br />

Speech Recognition = Acoustic-Linguistic Modelling + Statistical Decision Theory<br />

The characteristic adv<strong>an</strong>tages of the probabilistic framework <strong>an</strong>d statistical decision theory<br />

are:<br />

• The approach is able to model weak dependencies <strong>an</strong>d vague knowledge at all levels of<br />

the system.<br />

• The free parameters of the models c<strong>an</strong> be automatically learned from training data (or<br />

examples), <strong>an</strong>d there exist powerful algorithms for this purpose.<br />

• Using the Bayes decision rule (as <strong>der</strong>ived from statistical decision theory), the final<br />

decision is made by taking all available context into account. For example, in large<br />

vocabulary speech recognition, a sound is always recognized as a part of a word, which<br />

itself is part of a sentence. This allows the optimal feedback from the syntactic-sem<strong>an</strong>tic<br />

constraints of the l<strong>an</strong>guage down to the level of sound recognition.<br />

From speech recognition, we have extended <strong>an</strong>d are still extending this approach to other<br />

areas, in particular the tr<strong>an</strong>slation of spoken <strong>an</strong>d written l<strong>an</strong>guage <strong>an</strong>d other tasks in natural<br />

l<strong>an</strong>guage processing. For l<strong>an</strong>guage tr<strong>an</strong>slation, the approach is expressed by the equation:<br />

L<strong>an</strong>guage Tr<strong>an</strong>slation = Linguistic Modelling + Statistical Decision Theory<br />

In addition, it offers a couple of adv<strong>an</strong>tages like increased robustness <strong>an</strong>d easy adaptation to a<br />

new task.<br />

In summary, the research activities of the ‘Lehrstuhl für <strong>Informatik</strong> 6’ cover the following<br />

applications:<br />

• speech recognition<br />

o large vocabulary recognition<br />

o multi-lingual speech recognition<br />

o speaker independent <strong>an</strong>d adaptive speech recognition<br />

o robust speech recognition<br />

• machine tr<strong>an</strong>slation of spoken <strong>an</strong>d written l<strong>an</strong>guage<br />

• natural l<strong>an</strong>guage processing<br />

o document classification<br />

o l<strong>an</strong>guage un<strong>der</strong>st<strong>an</strong>ding<br />

o spoken dialogue systems<br />

• part-of-speech tagging <strong>an</strong>d text <strong>an</strong>notation<br />

238

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!