20.11.2014 Views

Digital Image Processing - Multimedia Computing and Computer ...

Digital Image Processing - Multimedia Computing and Computer ...

Digital Image Processing - Multimedia Computing and Computer ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

<strong>Digital</strong> <strong>Image</strong> <strong>Processing</strong><br />

SS 2008<br />

Prof. Dr. Rainer Lienhart<br />

www.multimedia-computing.{de,org}


Prelude<br />

• Language<br />

– Lecture: in English, but questions can be phrased in<br />

English or German<br />

– Exam: Questions are in English, but answers can be<br />

given in English or German<br />

– Homework: Questions are in English, but answer can<br />

be given in English or German<br />

– Practice lecture (Übung): in German<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Introduction (1)<br />

• Who am I?<br />

• What are the research topics?<br />

– Automatic content analysis of images, videos, <strong>and</strong> audio<br />

(<strong>Multimedia</strong> Content Analysis)<br />

– Automatic content analysis of multimodal <strong>and</strong> multi-lingual documents<br />

– Data Mining on large scale distributed multimedia databases<br />

(<strong>Multimedia</strong> Google)<br />

– Aware rooms (with AV sensor <strong>and</strong> actuators)<br />

– Autonomous car robots (cognitive robotics)<br />

– Automatic learning to identify specific people in audio <strong>and</strong> video.<br />

– WYSIWYH – What You See Is What You Hear<br />

– Machine learning / statistical computing<br />

– <strong>Computer</strong> vision<br />

– <strong>Image</strong> processing<br />

Automatic ≡ Learning Independently<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Who are you?<br />

Please introduce yourself one-byone


Illusions<br />

Five deadly illusions of students at the University of Augsburg<br />

• Scores / marks do not matter (especially in Vordiplom or Bachelor)<br />

• One week of learning is enough<br />

• Only the final academic degree matters, not the time needed to get it<br />

• You only have to be slightly better than the mass<br />

• I underst<strong>and</strong> everything, but stupidly memorizing <strong>and</strong> writing exams<br />

are boring<br />

If you think that any of these statements are true, you will have a<br />

hard time now <strong>and</strong> later.<br />

A university is a place for people who have a burning desire to learn,<br />

acquire knowledge, <strong>and</strong> apply it in novel ways!<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Passion Matters !<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


What is your passion?<br />

Why are you<br />

passionate about<br />

Uni Augsburg?<br />

What is your<br />

passion?<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Courses<br />

• SS 08<br />

– Lecture: <strong>Digital</strong> <strong>Image</strong> <strong>Processing</strong> (2+2)<br />

– Lecture: Bayesian Networks (2+2)<br />

– Lecture: Probabilistic Robotics (2+2)<br />

– Praktikum: Audio Signal <strong>Processing</strong> (2+4)<br />

– Praktikum: Video Signal <strong>Processing</strong> (2+4)<br />

– Oberseminar: Selected Aspects of Media Mining (2)<br />

• WS 07/08<br />

– Lecture: Media Mining I (4+2)<br />

– Praktikum: Audio Signal <strong>Processing</strong> (2+4)<br />

– Praktikum: Video Signal <strong>Processing</strong> (2+4)<br />

– Seminar: Object Recognition (2)<br />

– Oberseminar: Selected Aspects of Media Mining (2)<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Schedule<br />

• SS08: 14 Apr 2008 – 20 Jul 2008<br />

• Lecture (Room 207, Eichl. Str. [alte Uni])<br />

– Tue: 08:15-09:45<br />

– First lecture: Apr. 15 th , 2008<br />

• Exercise (Room 202, Eichl. Str. [alte Uni])<br />

– Wed : 10:00-11:30<br />

– First exercise: April 30 th , 2008<br />

• Important (National holidays):<br />

– No lecture on Tuesday May, 13 th<br />

• Examine (Room 207, Eichl. Str. [alte Uni])<br />

– Tue: 08:15-09:45<br />

– Jul. 15 th , 2008<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Outline<br />

1. Introduction<br />

2. Imaging Sensors<br />

3. Cameras & Camera<br />

Calibration<br />

4. <strong>Image</strong> Representation<br />

5. Color Spaces<br />

6. <strong>Image</strong> Statistics & <strong>Image</strong><br />

Models<br />

7. Pixel Operations<br />

8. Neighborhood<br />

Operations<br />

9. Filtering: Averages &<br />

Edges<br />

10.Salient Points<br />

11.Salient Point Descriptors<br />

12.Segmentation<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Book Recommendations<br />

• Bernd Jähne. <strong>Digital</strong> <strong>Image</strong> <strong>Processing</strong>. Springer Verlag. highly<br />

recommended<br />

Relevant chapters 1-5, 8, 10 (4 th edition)<br />

• David A. Forsyth <strong>and</strong> Jean Ponce. <strong>Computer</strong> Vision: A Modern<br />

Approach. Prentice Hall, Upper Saddle River, New Jersey 07458.<br />

(http://www.cs.berkeley.edu/~daf/book.html)<br />

• Martin Schader <strong>and</strong> Stefan Kuhlins. Programmieren in C++. Springer-<br />

Verlag. ISBN: 3540637761<br />

This is a perfect resource for all your questions relating C/C++;<br />

recommended if you are not skilled in C/C++<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Software Tools<br />

Every student must request access to MSDNAA-Software-Center<br />

Read instructions at http://www.informatik.uni-augsburg.de/service/msdn/<br />

Get <strong>and</strong> install Microsoft VisualStudio .NET 2005 or 2008<br />

OpenCV (Open Source <strong>Computer</strong> Vision Library)<br />

Universal toolbox for research <strong>and</strong> development in the field of <strong>Computer</strong> Vision (win & linux)<br />

Get <strong>and</strong> install OpenCV from http://sourceforge.net/projects/opencvlibrary<br />

Required include paths (must be set in your DevEnv):<br />

– C:\Programme\OpenCV\cv\include<br />

– C:\Programme\OpenCV\cxcore\include<br />

– C:\Programme\OpenCV\otherlibs\highgui<br />

Required library paths: C:\Programme\OpenCV\lib<br />

Required libraries:<br />

cxcore.lib, cv.lib, highgui.lib<br />

YOUR<br />

ACTION REQUIRED<br />

(AR)<br />

!!!<br />

Problems/deficits with/in C/C++:<br />

Martin Schader <strong>and</strong> Stefan Kuhlins. Programmieren in C++. Springer-Verlag. ISBN: 3540637761<br />

Required for Exercise<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Rules<br />

Exam:<br />

• Written examine: July 15th from 8:15-9:45am.<br />

• Lecture, exercise, <strong>and</strong> all specified reading notes are relevant for the exams<br />

independent of how thoroughly the reading notes have been discussed during the<br />

lecture.<br />

Exercise:<br />

• The homework (“Übungsblätter”) will be given out during the Tuesday lecture <strong>and</strong><br />

must be returned by the following Monday 10am sharply (no exceptions) by (a) email<br />

to Eva Hörster or (b) by delivering your solution into our mailbox at Eichleitnerstr. 30<br />

in the entrance hall. During the exercise lecture on Wednesday the students will have<br />

the chance to present their answers.<br />

• First homework will be given out on April, 22 nd 2008.<br />

• Exercise will start on Wednesday, April. 30 th 2008.<br />

• IMPORTANT: In order to be admitted to the final exam, students are required:<br />

– to score at least 50% of the points archievable on the weekly assignments<br />

– to attend the weekly exercise sessions (Wednesday, 10:00-11:30pm). Students are allowed<br />

to miss the exercise session at most three times. No exceptions<br />

Unterschleif<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Demos<br />

Media Content Analysis


Demo – Face Recognition<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Demo – Mouth Detection<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Demo – Bowl Detection<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Mining for Commercials<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Demos<br />

Effortless Sensor <strong>and</strong> Actuator<br />

Infrastructure


Business Scenario -<br />

Meeting room<br />

2<br />

PC server<br />

1<br />

<strong>Digital</strong> Home Scenario –<br />

Connected devices<br />

5<br />

1<br />

PDA’s or<br />

phones<br />

mobile PC<br />

6<br />

sensors<br />

3<br />

4<br />

wireless networks<br />

Requirements to make vision happen:<br />

• Sync between wireless I/O <strong>and</strong> wireless<br />

computing devices<br />

• 3D position <strong>and</strong> 3D orientation of<br />

sensor/actuators<br />

Enable multi-microphone conferencing using<br />

available computing devices in room (laptop,<br />

PDA, cell phones, other sensors)<br />

Enable true 3D audio in the home without<br />

tedious calibration<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Goal<br />

Given<br />

– A location/place<br />

– N c cameras,<br />

– N m microphones,<br />

– N d displays,<br />

– N l loudspeakers,<br />

– Enough computational<br />

resources<br />

– Enough power outlets at the<br />

right place<br />

– Wired/Wireless connectivity<br />

get a smart room running within<br />

an hour (55 minutes for<br />

connecting cables, 5 minutes<br />

for calibration).<br />

What’s needed:<br />

– Simple, scalable, selforganizing,<br />

self-healing, autocalibrating<br />

system setup<br />

Believe:<br />

– This would be a perfect<br />

system for us researchers<br />

– Progress <strong>and</strong> creativity is<br />

hindered by current inflexible<br />

<strong>and</strong> non-scalable systems<br />

(e.g., Aware Home Research<br />

Initiative (AHRI))<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Position Calibration of Audio Sensors/Actuators<br />

Z<br />

Y<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de<br />

X


Speaker<br />

2<br />

Speaker<br />

4<br />

Room Length = 4.22 m<br />

Synchronized setup<br />

bias 0.08 cm sigma 3.8 cm<br />

Room Height = 2.03 m<br />

Speaker<br />

3<br />

1 2<br />

Mic<br />

3<br />

Mic<br />

4<br />

Mic<br />

2<br />

Mic<br />

1<br />

Speaker<br />

1<br />

Z<br />

Room Width = 2.55 m<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Providing a Common Space for Multiple<br />

Cameras <strong>and</strong> Flat-Panel Displays<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Intrinsic Calibration<br />

• Control point extraction:<br />

– Pattern is displayed on a laptop screen<br />

– <strong>Image</strong>s from different orientations are<br />

captured by waving the screen<br />

– Projected pattern points are determined by<br />

SIFT-feature matching<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Extrinsic Calibration of Multiple Cameras<br />

Bundle adjustment is superior method<br />

min<br />

ˆ j ˆ<br />

<br />

– Minimization of the reprojection error : P , Xi<br />

ij<br />

d( ˆ<br />

j<br />

P Xˆ<br />

i<br />

, x<br />

j<br />

i<br />

)<br />

2<br />

Initialization of P j <strong>and</strong> X i :<br />

– Cameras are partitioned into<br />

manageable subgroups that<br />

share a common view<br />

(triplets)<br />

– Hierarchical registration of<br />

the different coordinate<br />

systems based on common<br />

cameras/points<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Optimal Sensor Placement<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Wie gewinnt man ein<br />

Rennen mit<br />

computergesteuerten<br />

Autos?


Darpa Gr<strong>and</strong> Challenge<br />

• First held in 2004, 2nd held on Oct, 8 th 2005<br />

• Designed to accelerate research <strong>and</strong> development in<br />

autonomous ground vehicle technology that will help<br />

save lives on the battlefield<br />

• Team whose fully autonomous ground vehicle finishes a<br />

course, from Los Angeles to Las Vegas, in the fastest<br />

time <strong>and</strong> under 10 hours, wins $1M (2004) / $2M (2005).<br />

• Course:<br />

– defined by ~ 1000 waypoints, given 2 hrs before race.<br />

– on-road, off-road, trails, water, obstacles<br />

– 2005: 131.6 miles in Mohave Desert; ~ 25 mph average speed<br />

• Only publicly available signals (e.g., GPS) may be used.<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Terrain (1)<br />

<strong>Image</strong> taken from<br />

http://www.darpa.mil/gr<strong>and</strong>challenge04/media_images.htm<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


<strong>Image</strong> taken from<br />

http://www.darpa.mil/gr<strong>and</strong>challenge04/media_images.htm<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


<strong>Computer</strong> Vision Problem<br />

- Lasers have limited range<br />

- Car is bouncing<br />

- Maximum speed is 25 mph but we want to go 35 mph<br />

- 25/35 mph decision is made with computer vision<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Define „Street“<br />

- Laser maps 3D terrain ahead of the car<br />

- Finds path <strong>and</strong> flat road surface<br />

- Projects 3D data into 2D video image<br />

- Learn the looks of the street using machine learning<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Extend “Street”<br />

- What we know for sure: laser brick is road<br />

- Use color texture under laser brick in order to extrapolate<br />

road to the horizon<br />

- Improve raw results by appropriate heuristics<br />

Unfiltered Result<br />

Filtered Result<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de


Gewinner 2005<br />

http://www.stanfordracing.org/<br />

6:53:58 19.1mph<br />

Start demo<br />

© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />

Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!