30.11.2012 Views

Automotive User Interfaces and Interactive Vehicular Applications

Automotive User Interfaces and Interactive Vehicular Applications

Automotive User Interfaces and Interactive Vehicular Applications

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Information Extraction from the World Wide Web Using a<br />

Speech Interface<br />

Hansjörg Hofmann, Ute Ehrlich<br />

Daimler AG<br />

Ulm, Germany<br />

{hansjoerg.hofmann,ute.ehrlich}@daimler.com<br />

ABSTRACT<br />

Due to the mobile internet revolution, people tend to browse<br />

the World Wide Web while driving their car which puts<br />

the driver’s safety at risk. Therefore, a speech interface to<br />

the Web integrated in the car’s head unit needs to be developed.<br />

In this paper, we present a speech dialog system<br />

which enables the user to extract topic related information<br />

from web sites with unknown page structures. One challenge<br />

is to extract <strong>and</strong> underst<strong>and</strong> the requested information from<br />

the web site which is achieved by parsing the HTML code<br />

against a predefined semantic net where special topics (e.g.<br />

“weather”) are modelled. The extracted information is the<br />

basis for the generic speech dialog which is designed in an<br />

intuitive <strong>and</strong> driver-convenient way in order to not distract<br />

the user.<br />

Categories <strong>and</strong> Subject Descriptors<br />

H.5.2 [Information <strong>Interfaces</strong> <strong>and</strong> Presentation]: <strong>User</strong><br />

<strong>Interfaces</strong><br />

1. MOTIVATION<br />

The arrival of smartphones has shaped new expectations<br />

towards mobile devices: Using mobile internet the user is<br />

able to get instant access to content-relevant information,<br />

infotainment systems, <strong>and</strong> services, anytime <strong>and</strong> anywhere.<br />

The success of smartphones also significantly impacts automotive<br />

systems. However, since for safety reasons smartphones<br />

<strong>and</strong> similar existing technologies cannot be used while<br />

driving, cars are the only place where accessing web sites on<br />

a regular basis is not possible, yet. Reports from the U.S.<br />

Department of Transportation[4] revealed that 20 percent<br />

of injury crashes involved distracted driving. While driving<br />

a vehicle browsing the Web by using the car’s head unit<br />

would distract the user <strong>and</strong> puts the drivers’ safety at risk.<br />

Therefore, when bringing Internet to the car a speech-based<br />

interface which provides a driver-convenient, audible representation<br />

of the content needs to be developed. Currently,<br />

browsing the World Wide Web is only achieved by using<br />

haptic input modalities <strong>and</strong> a visual browser representation.<br />

Copyright held by author(s)<br />

<strong>Automotive</strong> UI ’11, November 29-December 2, 2011, Salzburg, Austria<br />

Adjunct Proceedings.<br />

- 9 -<br />

Andreas Eberhardt<br />

BitTwister IT GmbH<br />

Senden, Germany<br />

<strong>and</strong>reas.eberhardt@bittwister.com<br />

However, in a driving environment this user interface is not<br />

feasible. Therefore, it is crucial to develop an intelligent web<br />

scraping algorithm which transforms semi-structured Web<br />

content in a representation accessible by the speech dialog.<br />

Attempts to access the World Wide Web by speech have<br />

been made in different ways. Poon et al.[5], amongst others,<br />

have introduced voice browsers which is not applicable in<br />

the automotive environment due to high cognitive load <strong>and</strong><br />

time consumption. In the SmartWeb project[1] the user can<br />

ask questions about the Soccer World Cup <strong>and</strong> the system<br />

provides answers retrieved from amongst others, a knowledge<br />

base containing information extracted from the FIFA<br />

web site. Nevertheless, this algorithm is only able to extract<br />

information from dedicated web sites with known web page<br />

structures.<br />

In this paper, we present a speech dialog system (SDS)<br />

designed in a driver convenient way which allows the user<br />

to retrieve topic related information from the World Wide<br />

Web which is available in an HTML structure. The core<br />

component of the proposed approach is an ontology which<br />

models the mentioned topics. The speech dialog is modelled<br />

according to the ontology <strong>and</strong> the information extraction<br />

(IE) component parses HTML code against the predefined<br />

semantic net. In this way, information from web sites with<br />

unknown web page structures can be extracted, understood<br />

<strong>and</strong> accessed by speech.<br />

2. INFORMATION EXTRACTION FROM<br />

SEMI-STRUCTURED WEB SITES<br />

Previous approaches to IE from web sites focus on the web<br />

page <strong>and</strong> try to extract relevant information only from the<br />

corresponding HTML DOM tree. In our approach, we first<br />

define an ontology to a certain topic <strong>and</strong> use topic-related<br />

web sites as our source to find matching information. The<br />

ontology the web site’s content is mapped on is defined in<br />

KL-ONE[2] because of its simplicity <strong>and</strong> sufficient modeling<br />

abilities. The IE <strong>and</strong> the semantic analysis of a web site is<br />

illustrated in Figure 1 <strong>and</strong> explained in the following.<br />

First, the HTML parser analyzes the web site <strong>and</strong> generates<br />

a preliminary internal graph representation. Hereby,<br />

dynamic contents are processed, embedded frames are loaded<br />

<strong>and</strong> referenced web pages are taken into consideration.<br />

In the second step, the text parser analyzes the textual<br />

content. By applying topic-oriented grammar definitions,<br />

the text parser generates initial concept hypotheses of the<br />

semantic model for each textual content.<br />

Since the following matching algorithm is computationally<br />

intensive, the current cyclic graph needs to be transformed

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!