30.01.2014 Views

Annual Report 2010 - Fachgruppe Informatik an der RWTH Aachen ...

Annual Report 2010 - Fachgruppe Informatik an der RWTH Aachen ...

Annual Report 2010 - Fachgruppe Informatik an der RWTH Aachen ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

characteristic measures. However, as the data stream proceeds, previous results may become<br />

invalid with respect to recently arrived data items. Thus, maintaining correct result in a data<br />

stream environment, e.g. to a top-k query, makes efficient continuous query processing <strong>an</strong>d<br />

incremental algorithms necessary.<br />

Anytime algorithms are capable of dealing with varying time constraints <strong>an</strong>d high data<br />

volumes as described above. The adv<strong>an</strong>tages of <strong>an</strong>ytime algorithms c<strong>an</strong> be summarized as<br />

flexibility (exploit all available time), interruptibility (provide a decision at <strong>an</strong>y time of<br />

interruption) <strong>an</strong>d incremental improvement (continue improvement from current position<br />

without restart).<br />

Figure 6: Bayes Tree as hierarchical org<strong>an</strong>ization of (Gaussi<strong>an</strong>) mixture models for density estimation<br />

MOA – Massive Online Analysis<br />

Philipp Kr<strong>an</strong>en, Hardy Kremer<br />

MOA (Massive On-line Analysis) is a framework for data stream mining. It includes tools for<br />

evaluation <strong>an</strong>d a collection of machine learning algorithms. It is related to the WEKA project,<br />

is also written in Java, but scales to more dem<strong>an</strong>ding problems. The goal of MOA is a<br />

benchmark framework for running experiments in the data stream mining context by<br />

providing<br />

• storable settings for data streams (real <strong>an</strong>d synthetic) for repeatable experiments,<br />

• a set of existing algorithms <strong>an</strong>d measures form the literature for comparison <strong>an</strong>d<br />

• <strong>an</strong> easily extendable framework for new streams, algorithms <strong>an</strong>d evaluation methods.<br />

308

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!