Big Data Analytics
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
140 D. Sitaram and K.V. Subramaniam<br />
requests that are queued. Further, given a large factory setting, with a multitude of<br />
such machines, the velocity of data ingestion requires non-traditional means.<br />
While this appears to be an example of an IoT (Internet of Things) system, a<br />
more careful examination of the system reveals that the intelligence is derived from<br />
not only sensors that read data but also a mixture of cameras that take images for<br />
tracking quality, process configuration files, and log files produced by the controlling<br />
machines. In other words, given the volume and variety of the data that is to<br />
be ingested, it is better to treat the system as a big data system so that the analysis<br />
can derive value for the manufacturing unit.<br />
The above considerations necessitate the introduction of systems that can process<br />
and draw inferences from the events in real time. As argued earlier, for many<br />
applications, an event model that captures the relationship between the different<br />
events is needed. In the next section, we give a brief overview of various features of<br />
such a complex event processing system.<br />
3 Basic Features of Complex Event Systems<br />
A real-time complex event processing system that is capable of processing millions<br />
of events from various types of sources that can be viewed as illustrated in Fig. 4<br />
[12]. The event observers or the sources on the left generate events. In our<br />
multi-tank cleaner example, the sources refer to timestamped values of temperature,<br />
pressure, and batch codes of the wafers being processed. The brokers or event<br />
processing agents encode the business logic to act upon the events. For example, to<br />
decide if the temperature reached its critical value over an interval, the brokers have<br />
to process a sequence of values to determine if the operating temperature is valid. If<br />
so, it signals an event indicating the operating temperature which is valid. If not, an<br />
operating temperature invalid event is raised; this generated event can act as a<br />
source event for subsequent stages in the event processing network. Observe that<br />
the graph of the event processing network is logical and does not imply the physical<br />
distribution of the brokers. The processing agents can be distributed across a set of<br />
machines and need not be on a single machine. Also, this network must not have<br />
any cycles. There may be consumers that are interested in the events being generated<br />
by the system. These are represented by the sinks in Fig. 3.<br />
A complex event processing system such as the one described above will have to<br />
consider the following aspects in designing the system. These are based on the eight<br />
requirements enumerated by [34].<br />
1. <strong>Data</strong> Model/Semantics—this provides a mechanism to handle data from a<br />
variety of sources and also to handle relationships across various events. Consider<br />
our example of the manufacturing intelligence application. Herein, we<br />
need not only a model for being able to extract events from various types of<br />
sensors and log files, but also a model that can help express the relationship<br />
among the various events.