05.03.2013 Views

Xcell Journal Issue 78: Charge to Market with Xilinx 7 Series ...

Xcell Journal Issue 78: Charge to Market with Xilinx 7 Series ...

Xcell Journal Issue 78: Charge to Market with Xilinx 7 Series ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

XPERT OPINION<br />

Figure 3 – Pedestrian detection and tracking capabilities included a “trip wire” alarm that<br />

reported when an individual moved <strong>with</strong>in a bordered portion of the video frame.<br />

of the video-analytics IP cores, <strong>with</strong><br />

the generated metadata passing <strong>to</strong> a<br />

secondary algorithm suite for additional<br />

processing. Challenges this time<br />

around included:<br />

• Resolving the fundamental tradeoff<br />

between unwanted noise and<br />

proper object segmentation<br />

Input<br />

Video<br />

VA IP<br />

Config.<br />

File<br />

Appl.<br />

Config.<br />

File<br />

VA IP<br />

Configuration<br />

Preprocessing<br />

• Varying object morphology (form<br />

and structure)<br />

• Varying object motion, both person-<strong>to</strong>-person<br />

and over time <strong>with</strong><br />

a particular person<br />

• Vanishing metadata, when a person<br />

s<strong>to</strong>ps moving, for example,<br />

is blocked by an intermediary<br />

Parser Tracker Identifier<br />

Get one<br />

frame of<br />

metadata,<br />

parse and<br />

preclassify<br />

Specialized Video Analytics Hardware<br />

Image<br />

Analysis<br />

Match metadata<br />

objects <strong>to</strong><br />

tracked objects<br />

Object<br />

Segmentation<br />

Predict motion<br />

of each<br />

tracked object<br />

Tracked<br />

objects list<br />

Metadata<br />

Generation<br />

Once per frame<br />

Identify tracked<br />

objects by<br />

category<br />

object or blends in<strong>to</strong> the background<br />

pattern<br />

• Other objects in the scene, both<br />

stationary and in motion<br />

• Varying distance between each<br />

person and the camera, and<br />

• Individuals vs. groups, and dominant<br />

vs. contrasting motion vec<strong>to</strong>rs<br />

<strong>with</strong>in a group<br />

With respect <strong>to</strong> the “trip wire”<br />

implementation, four distinct video<br />

streams were particularly effective in<br />

debugging and optimizing the videoanalytics<br />

algorithms:<br />

• “Near” pedestrians walking and<br />

reversing directions<br />

• “Near” pedestrians walking in<br />

two different directions<br />

• A “far” pedestrian <strong>with</strong> a moving<br />

truck that appeared, through a<br />

trick of perspective, <strong>to</strong> be of a<br />

comparable size, and<br />

• “Far” pedestrians <strong>with</strong> an<br />

approaching truck that appeared<br />

larger than they were<br />

On-Screen<br />

Display<br />

Figure 4 – The final phase of the project migrated from <strong>Xilinx</strong>’s simulation models <strong>to</strong> actual FPGA IP blocks. BDTI also ported the second-level algorithms<br />

from an x86 CPU <strong>to</strong> an ARM-based SoC, thereby paving the path for the single-chip Zynq Extensible Processing Platform successor.<br />

18 <strong>Xcell</strong> <strong>Journal</strong> First Quarter 2012<br />

Test<br />

for<br />

alarm<br />

Output<br />

Video<br />

Process<br />

for<br />

OSD

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!