23.03.2017 Views

wilamowski-b-m-irwin-j-d-industrial-communication-systems-2011

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Processing Data in Complex Communication Systems 68-7<br />

Log-likelihood Sensor values<br />

70<br />

60<br />

50<br />

40<br />

30<br />

20<br />

68.6 Intelligent Surveillance Systems<br />

Today, surveillance has become a relevant means for protecting public and <strong>industrial</strong> areas against<br />

malicious subjects like burglars or vandals. For both keeping privacy of irreproachable citizens as well<br />

as enabling automated detection of potential threats, computer-based <strong>systems</strong> are needed that support<br />

human operators in recognizing unusual situations. A suitable approach for that purpose is to utilize<br />

a hierarchical architecture of semantic processing layers distributed in a network of nodes. The goal of<br />

these layers is to learn the “normality” in the environment of the network, in order to detect unusual<br />

situations and to inform the human operator in such cases. The SENSE project [WSe06,BKV+08] implements<br />

such an architecture. Therefore, we will briefly cover it here as an example of how hierarchical<br />

semantic processing can be used for surveillance <strong>systems</strong>.<br />

SENSE consists of a network of communicating sensor nodes, each equipped with a camera and a<br />

microphone array. These sensor modalities observe their environment and deliver streams of mono-modal<br />

events to a reasoning unit, which derives fused high-level observations from this information. These<br />

observations are exchanged with neighbor nodes in order to establish a global view about the commonly<br />

observed environment. Detected potential threats are finally reported to the person(s) in charge.<br />

Though most of the methods used in the particular layers are widely used in many applications, the<br />

benefit lies in the combination of them in order to let the messages of the system really appear meaningful<br />

to the user.<br />

68.6.1 architecture<br />

5<br />

0<br />

–5<br />

FIGURE 68.2 Sensor value and log-likelihood for a single sensor from the system. Unusual sensor values register<br />

as drops in the log-likelihood, causing alarms.<br />

In this case, an eight-layer data processing architecture is adopted, in which the lower layers are responsible<br />

for a stable and comprehensive world representation to be evaluated in the higher layers (Figure 68.3).<br />

First, the visual low-level feature extraction (layer 0) processes frame by frame from the camera in<br />

2D camera coordinates and extracts predefined visual objects. At the same time, the audio low-level<br />

extraction scans the acoustic signals from a linear eight-microphone array for trained sound patterns of<br />

predefined categories. Due to limited processing capabilities, this layer can deliver significantly unstable<br />

data in both modalities. In case of unfortunate conditions for the camera, detected symbols can change<br />

their label from one category to another and back for the same physical object within consecutive frames.<br />

The size of detected symbols can change from small elements to large ones covering tens of square<br />

© <strong>2011</strong> by Taylor and Francis Group, LLC

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!