17.08.2013 Views

Mr Kiran Varanasi / Dr Fabio Cuzzolin Applicant Career Summary

Mr Kiran Varanasi / Dr Fabio Cuzzolin Applicant Career Summary

Mr Kiran Varanasi / Dr Fabio Cuzzolin Applicant Career Summary

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>Mr</strong> <strong>Kiran</strong> <strong>Varanasi</strong> / <strong>Dr</strong> <strong>Fabio</strong> <strong>Cuzzolin</strong> Newton International Fellowships - 2010<br />

Created: Friday, January 15, 2010 15:09 [Approved]<br />

During my Ph.D, I have worked on the problem of motion estimation without<br />

segmenting an object from background clutter. I demonstrated results on scenes<br />

involving multiple actors interacting with each other. To achieve this, I used a<br />

mesh evolution framework that handles topological changes. I proposed methods<br />

for computing local 3D features on objects, and algorithms for matching shapes<br />

that rely on such local features. Such methods can handle partial shape matching,<br />

and work even when the object is not entirely visible to the sensor.<br />

In the current project, we aim at combining the strengths of the two approaches<br />

developed by applicant and co-applicant and fit them into a statistical learning<br />

framework. The first challenge that we face is the huge dimensionality of the input<br />

data. Even though adaptive spectral embeddings translate this input into a new<br />

space of much lower dimensionality, they have to be first learnt by training. When<br />

the dimensionality of the input data is prohibitively large (as is in our case), the<br />

learning takes too much time and needs too much memory space. We would like<br />

to attack this problem by using the idea of "compressed sensing" that has recently<br />

become popular in the signal processing community. We shall code the input data<br />

using random projections that are linear, sparsity oriented and metric-preserving.<br />

This vastly reduces the dimensionality of the feature space and makes adaptive<br />

embeddings tractable.<br />

Adaptive spectral embeddings can handle global matching, but not partial<br />

matching. Our second challenge is to fit these embeddings into a partial matching<br />

framework. Similar to my Ph.D work, we would like to derive feature descriptors<br />

on these embeddings that are local in scope, and which can be used for matching<br />

shapes amidst background clutter. Ideally, we should be able to learn body-parts<br />

of the shapes, with their distinctive shape signatures.<br />

The third and final challenge that we will have to address is that of unsupervised<br />

learning over a large database of example shapes and poses. This learning will<br />

further improve the descriptiveness of our features. For example, statistical<br />

learning can restrict the lower arm of a human body to have a joint angle not more<br />

than 180 degrees with the upper arm. We seek to learn the shape and motion<br />

statistics of arbitrary shapes, not just human beings. For example, we can learn<br />

the statistics of men riding bicycles, or of children playing with their pets.<br />

The potential applications of the proposed work are numerous. A straightforward<br />

application, for instance, is kinematic motion capture in outdoor environments,<br />

crucial in the entertainment industry, such as in film production and gaming. A<br />

second natural application is automatic visual surveillance, especially in crowded<br />

and cluttered scenes where the suspect is on the run. An additional example<br />

concerns the assistance of elderly people, through monitoring for signs of ageing<br />

and weakness in their limb movements. A fourth application could be in sports<br />

analysis, where the movements of a sports-person are analysed for their<br />

correctness. Due to the general-purpose nature of our approach, it shall be able<br />

for deployment amidst multiple people performing diverse tasks, and in outdoor<br />

environments. This opens doors for several new applications, to a degree that is<br />

not possible with today's technology.<br />

Apart from practical applications, there shall also be theoretical breakthroughs that<br />

would be of interest, for various other fields that fit in the broad scope of pervasive<br />

computing. Bringing together the two fields of manifold learning and compressed<br />

sensing fits the broader research goal of <strong>Dr</strong>. <strong>Cuzzolin</strong>, as outlined by his<br />

collaboration with other European centers of research. The task of analysing<br />

motion (and human activities in particular) fits well with the long-term goals of the<br />

Vision Group of Oxford Brookes University, which has produced several worldclass<br />

publications in this field. I would like to benefit from the strong collaborations<br />

that tie the group in Oxford Brookes to other centres of excellence in the world,<br />

particularly to the Visual Geometry group in the University of Oxford and to the<br />

Page 6 of 8

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!