Digital Image Processing - Multimedia Computing and Computer ...
Digital Image Processing - Multimedia Computing and Computer ...
Digital Image Processing - Multimedia Computing and Computer ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
<strong>Digital</strong> <strong>Image</strong> <strong>Processing</strong><br />
SS 2008<br />
Prof. Dr. Rainer Lienhart<br />
www.multimedia-computing.{de,org}
Prelude<br />
• Language<br />
– Lecture: in English, but questions can be phrased in<br />
English or German<br />
– Exam: Questions are in English, but answers can be<br />
given in English or German<br />
– Homework: Questions are in English, but answer can<br />
be given in English or German<br />
– Practice lecture (Übung): in German<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Introduction (1)<br />
• Who am I?<br />
• What are the research topics?<br />
– Automatic content analysis of images, videos, <strong>and</strong> audio<br />
(<strong>Multimedia</strong> Content Analysis)<br />
– Automatic content analysis of multimodal <strong>and</strong> multi-lingual documents<br />
– Data Mining on large scale distributed multimedia databases<br />
(<strong>Multimedia</strong> Google)<br />
– Aware rooms (with AV sensor <strong>and</strong> actuators)<br />
– Autonomous car robots (cognitive robotics)<br />
– Automatic learning to identify specific people in audio <strong>and</strong> video.<br />
– WYSIWYH – What You See Is What You Hear<br />
– Machine learning / statistical computing<br />
– <strong>Computer</strong> vision<br />
– <strong>Image</strong> processing<br />
Automatic ≡ Learning Independently<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Who are you?<br />
Please introduce yourself one-byone
Illusions<br />
Five deadly illusions of students at the University of Augsburg<br />
• Scores / marks do not matter (especially in Vordiplom or Bachelor)<br />
• One week of learning is enough<br />
• Only the final academic degree matters, not the time needed to get it<br />
• You only have to be slightly better than the mass<br />
• I underst<strong>and</strong> everything, but stupidly memorizing <strong>and</strong> writing exams<br />
are boring<br />
If you think that any of these statements are true, you will have a<br />
hard time now <strong>and</strong> later.<br />
A university is a place for people who have a burning desire to learn,<br />
acquire knowledge, <strong>and</strong> apply it in novel ways!<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Passion Matters !<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
What is your passion?<br />
Why are you<br />
passionate about<br />
Uni Augsburg?<br />
What is your<br />
passion?<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Courses<br />
• SS 08<br />
– Lecture: <strong>Digital</strong> <strong>Image</strong> <strong>Processing</strong> (2+2)<br />
– Lecture: Bayesian Networks (2+2)<br />
– Lecture: Probabilistic Robotics (2+2)<br />
– Praktikum: Audio Signal <strong>Processing</strong> (2+4)<br />
– Praktikum: Video Signal <strong>Processing</strong> (2+4)<br />
– Oberseminar: Selected Aspects of Media Mining (2)<br />
• WS 07/08<br />
– Lecture: Media Mining I (4+2)<br />
– Praktikum: Audio Signal <strong>Processing</strong> (2+4)<br />
– Praktikum: Video Signal <strong>Processing</strong> (2+4)<br />
– Seminar: Object Recognition (2)<br />
– Oberseminar: Selected Aspects of Media Mining (2)<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Schedule<br />
• SS08: 14 Apr 2008 – 20 Jul 2008<br />
• Lecture (Room 207, Eichl. Str. [alte Uni])<br />
– Tue: 08:15-09:45<br />
– First lecture: Apr. 15 th , 2008<br />
• Exercise (Room 202, Eichl. Str. [alte Uni])<br />
– Wed : 10:00-11:30<br />
– First exercise: April 30 th , 2008<br />
• Important (National holidays):<br />
– No lecture on Tuesday May, 13 th<br />
• Examine (Room 207, Eichl. Str. [alte Uni])<br />
– Tue: 08:15-09:45<br />
– Jul. 15 th , 2008<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Outline<br />
1. Introduction<br />
2. Imaging Sensors<br />
3. Cameras & Camera<br />
Calibration<br />
4. <strong>Image</strong> Representation<br />
5. Color Spaces<br />
6. <strong>Image</strong> Statistics & <strong>Image</strong><br />
Models<br />
7. Pixel Operations<br />
8. Neighborhood<br />
Operations<br />
9. Filtering: Averages &<br />
Edges<br />
10.Salient Points<br />
11.Salient Point Descriptors<br />
12.Segmentation<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Book Recommendations<br />
• Bernd Jähne. <strong>Digital</strong> <strong>Image</strong> <strong>Processing</strong>. Springer Verlag. highly<br />
recommended<br />
Relevant chapters 1-5, 8, 10 (4 th edition)<br />
• David A. Forsyth <strong>and</strong> Jean Ponce. <strong>Computer</strong> Vision: A Modern<br />
Approach. Prentice Hall, Upper Saddle River, New Jersey 07458.<br />
(http://www.cs.berkeley.edu/~daf/book.html)<br />
• Martin Schader <strong>and</strong> Stefan Kuhlins. Programmieren in C++. Springer-<br />
Verlag. ISBN: 3540637761<br />
This is a perfect resource for all your questions relating C/C++;<br />
recommended if you are not skilled in C/C++<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Software Tools<br />
Every student must request access to MSDNAA-Software-Center<br />
Read instructions at http://www.informatik.uni-augsburg.de/service/msdn/<br />
Get <strong>and</strong> install Microsoft VisualStudio .NET 2005 or 2008<br />
OpenCV (Open Source <strong>Computer</strong> Vision Library)<br />
Universal toolbox for research <strong>and</strong> development in the field of <strong>Computer</strong> Vision (win & linux)<br />
Get <strong>and</strong> install OpenCV from http://sourceforge.net/projects/opencvlibrary<br />
Required include paths (must be set in your DevEnv):<br />
– C:\Programme\OpenCV\cv\include<br />
– C:\Programme\OpenCV\cxcore\include<br />
– C:\Programme\OpenCV\otherlibs\highgui<br />
Required library paths: C:\Programme\OpenCV\lib<br />
Required libraries:<br />
cxcore.lib, cv.lib, highgui.lib<br />
YOUR<br />
ACTION REQUIRED<br />
(AR)<br />
!!!<br />
Problems/deficits with/in C/C++:<br />
Martin Schader <strong>and</strong> Stefan Kuhlins. Programmieren in C++. Springer-Verlag. ISBN: 3540637761<br />
Required for Exercise<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Rules<br />
Exam:<br />
• Written examine: July 15th from 8:15-9:45am.<br />
• Lecture, exercise, <strong>and</strong> all specified reading notes are relevant for the exams<br />
independent of how thoroughly the reading notes have been discussed during the<br />
lecture.<br />
Exercise:<br />
• The homework (“Übungsblätter”) will be given out during the Tuesday lecture <strong>and</strong><br />
must be returned by the following Monday 10am sharply (no exceptions) by (a) email<br />
to Eva Hörster or (b) by delivering your solution into our mailbox at Eichleitnerstr. 30<br />
in the entrance hall. During the exercise lecture on Wednesday the students will have<br />
the chance to present their answers.<br />
• First homework will be given out on April, 22 nd 2008.<br />
• Exercise will start on Wednesday, April. 30 th 2008.<br />
• IMPORTANT: In order to be admitted to the final exam, students are required:<br />
– to score at least 50% of the points archievable on the weekly assignments<br />
– to attend the weekly exercise sessions (Wednesday, 10:00-11:30pm). Students are allowed<br />
to miss the exercise session at most three times. No exceptions<br />
Unterschleif<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Demos<br />
Media Content Analysis
Demo – Face Recognition<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Demo – Mouth Detection<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Demo – Bowl Detection<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Mining for Commercials<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Demos<br />
Effortless Sensor <strong>and</strong> Actuator<br />
Infrastructure
Business Scenario -<br />
Meeting room<br />
2<br />
PC server<br />
1<br />
<strong>Digital</strong> Home Scenario –<br />
Connected devices<br />
5<br />
1<br />
PDA’s or<br />
phones<br />
mobile PC<br />
6<br />
sensors<br />
3<br />
4<br />
wireless networks<br />
Requirements to make vision happen:<br />
• Sync between wireless I/O <strong>and</strong> wireless<br />
computing devices<br />
• 3D position <strong>and</strong> 3D orientation of<br />
sensor/actuators<br />
Enable multi-microphone conferencing using<br />
available computing devices in room (laptop,<br />
PDA, cell phones, other sensors)<br />
Enable true 3D audio in the home without<br />
tedious calibration<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Goal<br />
Given<br />
– A location/place<br />
– N c cameras,<br />
– N m microphones,<br />
– N d displays,<br />
– N l loudspeakers,<br />
– Enough computational<br />
resources<br />
– Enough power outlets at the<br />
right place<br />
– Wired/Wireless connectivity<br />
get a smart room running within<br />
an hour (55 minutes for<br />
connecting cables, 5 minutes<br />
for calibration).<br />
What’s needed:<br />
– Simple, scalable, selforganizing,<br />
self-healing, autocalibrating<br />
system setup<br />
Believe:<br />
– This would be a perfect<br />
system for us researchers<br />
– Progress <strong>and</strong> creativity is<br />
hindered by current inflexible<br />
<strong>and</strong> non-scalable systems<br />
(e.g., Aware Home Research<br />
Initiative (AHRI))<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Position Calibration of Audio Sensors/Actuators<br />
Z<br />
Y<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de<br />
X
Speaker<br />
2<br />
Speaker<br />
4<br />
Room Length = 4.22 m<br />
Synchronized setup<br />
bias 0.08 cm sigma 3.8 cm<br />
Room Height = 2.03 m<br />
Speaker<br />
3<br />
1 2<br />
Mic<br />
3<br />
Mic<br />
4<br />
Mic<br />
2<br />
Mic<br />
1<br />
Speaker<br />
1<br />
Z<br />
Room Width = 2.55 m<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Providing a Common Space for Multiple<br />
Cameras <strong>and</strong> Flat-Panel Displays<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Intrinsic Calibration<br />
• Control point extraction:<br />
– Pattern is displayed on a laptop screen<br />
– <strong>Image</strong>s from different orientations are<br />
captured by waving the screen<br />
– Projected pattern points are determined by<br />
SIFT-feature matching<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Extrinsic Calibration of Multiple Cameras<br />
Bundle adjustment is superior method<br />
min<br />
ˆ j ˆ<br />
<br />
– Minimization of the reprojection error : P , Xi<br />
ij<br />
d( ˆ<br />
j<br />
P Xˆ<br />
i<br />
, x<br />
j<br />
i<br />
)<br />
2<br />
Initialization of P j <strong>and</strong> X i :<br />
– Cameras are partitioned into<br />
manageable subgroups that<br />
share a common view<br />
(triplets)<br />
– Hierarchical registration of<br />
the different coordinate<br />
systems based on common<br />
cameras/points<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Optimal Sensor Placement<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Wie gewinnt man ein<br />
Rennen mit<br />
computergesteuerten<br />
Autos?
Darpa Gr<strong>and</strong> Challenge<br />
• First held in 2004, 2nd held on Oct, 8 th 2005<br />
• Designed to accelerate research <strong>and</strong> development in<br />
autonomous ground vehicle technology that will help<br />
save lives on the battlefield<br />
• Team whose fully autonomous ground vehicle finishes a<br />
course, from Los Angeles to Las Vegas, in the fastest<br />
time <strong>and</strong> under 10 hours, wins $1M (2004) / $2M (2005).<br />
• Course:<br />
– defined by ~ 1000 waypoints, given 2 hrs before race.<br />
– on-road, off-road, trails, water, obstacles<br />
– 2005: 131.6 miles in Mohave Desert; ~ 25 mph average speed<br />
• Only publicly available signals (e.g., GPS) may be used.<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Terrain (1)<br />
<strong>Image</strong> taken from<br />
http://www.darpa.mil/gr<strong>and</strong>challenge04/media_images.htm<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
<strong>Image</strong> taken from<br />
http://www.darpa.mil/gr<strong>and</strong>challenge04/media_images.htm<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
<strong>Computer</strong> Vision Problem<br />
- Lasers have limited range<br />
- Car is bouncing<br />
- Maximum speed is 25 mph but we want to go 35 mph<br />
- 25/35 mph decision is made with computer vision<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Define „Street“<br />
- Laser maps 3D terrain ahead of the car<br />
- Finds path <strong>and</strong> flat road surface<br />
- Projects 3D data into 2D video image<br />
- Learn the looks of the street using machine learning<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Extend “Street”<br />
- What we know for sure: laser brick is road<br />
- Use color texture under laser brick in order to extrapolate<br />
road to the horizon<br />
- Improve raw results by appropriate heuristics<br />
Unfiltered Result<br />
Filtered Result<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de
Gewinner 2005<br />
http://www.stanfordracing.org/<br />
6:53:58 19.1mph<br />
Start demo<br />
© 2004-2008, Prof. Dr. R. Lienhart, Head of <strong>Multimedia</strong> <strong>Computing</strong>, Institut für Informatik, Universität Augsburg<br />
Eichleitnerstr. 30, D-86135 Augsburg, Germany; email: Rainer.Lienhart@informatik.uni-augsburg.de