12.06.2018 Views

Big Data Analytics

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

140 D. Sitaram and K.V. Subramaniam<br />

requests that are queued. Further, given a large factory setting, with a multitude of<br />

such machines, the velocity of data ingestion requires non-traditional means.<br />

While this appears to be an example of an IoT (Internet of Things) system, a<br />

more careful examination of the system reveals that the intelligence is derived from<br />

not only sensors that read data but also a mixture of cameras that take images for<br />

tracking quality, process configuration files, and log files produced by the controlling<br />

machines. In other words, given the volume and variety of the data that is to<br />

be ingested, it is better to treat the system as a big data system so that the analysis<br />

can derive value for the manufacturing unit.<br />

The above considerations necessitate the introduction of systems that can process<br />

and draw inferences from the events in real time. As argued earlier, for many<br />

applications, an event model that captures the relationship between the different<br />

events is needed. In the next section, we give a brief overview of various features of<br />

such a complex event processing system.<br />

3 Basic Features of Complex Event Systems<br />

A real-time complex event processing system that is capable of processing millions<br />

of events from various types of sources that can be viewed as illustrated in Fig. 4<br />

[12]. The event observers or the sources on the left generate events. In our<br />

multi-tank cleaner example, the sources refer to timestamped values of temperature,<br />

pressure, and batch codes of the wafers being processed. The brokers or event<br />

processing agents encode the business logic to act upon the events. For example, to<br />

decide if the temperature reached its critical value over an interval, the brokers have<br />

to process a sequence of values to determine if the operating temperature is valid. If<br />

so, it signals an event indicating the operating temperature which is valid. If not, an<br />

operating temperature invalid event is raised; this generated event can act as a<br />

source event for subsequent stages in the event processing network. Observe that<br />

the graph of the event processing network is logical and does not imply the physical<br />

distribution of the brokers. The processing agents can be distributed across a set of<br />

machines and need not be on a single machine. Also, this network must not have<br />

any cycles. There may be consumers that are interested in the events being generated<br />

by the system. These are represented by the sinks in Fig. 3.<br />

A complex event processing system such as the one described above will have to<br />

consider the following aspects in designing the system. These are based on the eight<br />

requirements enumerated by [34].<br />

1. <strong>Data</strong> Model/Semantics—this provides a mechanism to handle data from a<br />

variety of sources and also to handle relationships across various events. Consider<br />

our example of the manufacturing intelligence application. Herein, we<br />

need not only a model for being able to extract events from various types of<br />

sensors and log files, but also a model that can help express the relationship<br />

among the various events.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!