13.07.2015 Views

Dictionary Alignment for Context-sensitive Word Glossing

Dictionary Alignment for Context-sensitive Word Glossing

Dictionary Alignment for Context-sensitive Word Glossing

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Figure 5: The output of word-translating Japanese definition sentences to English<strong>Word</strong>Net sense pairing, we extract the hypernymsof the respective senses and expand the definitionsentences with the definition sentences from the hypernyms.The term vectors are then based on thisexpanded term set, similar to query expansion in in<strong>for</strong>mationretrieval.5 Experimental Setup5.1 Gold-standard dataTo evaluate the per<strong>for</strong>mance of our system, werandomly selected 100 words from Lexeed, extractedout the Lexeed–<strong>Word</strong>Net sense pairings asdescribed above, and manually selected the goldstandardalignments from amongst them. The 100words were associated with a total of 268 Lexeedsenses and 772 <strong>Word</strong>Net senses, creating a total of206,896 possible alignment pairs. Of these, 259alignments were selected as our gold-standard.We encountered a number of partial matches thatwere caused by the Japanese word being more specificthan its English counterparts (as identified byour <strong>Word</strong>Net matching method). For example,kakkazan is translated as “active volcano”. Since<strong>Word</strong>Net does not have any entry <strong>for</strong> active volcano,the longest right word substring that matchesin <strong>Word</strong>Net is simply volcano. The definition sentencesreturned by Lexeed describe kakkazan as “avolcano which still can erupt” and “a volcano thatwill soon erupt”, while volcano is described as “afissure in the earth’s crust (or in the surface of someother planet) through which molten lava and gaseserupt” and “a mountain <strong>for</strong>med by volcanic material”.Although there is some similarity betweenthese definitions (namely key words such as eruptand volcano), we do not include this pairing in ourgold-standard alignment data.5.2 BaselineAs a baseline, we take the most-frequent sense ofeach of the 100 random words from Lexeed, andmatch it with the synset with the highest SemCorfrequency count out of all the candidate synsets.5.3 ThresholdingAll our calculations are based on cosine similarity,which returns a similarity between 0 and 1, with 1being an exact match. In its simplest <strong>for</strong>m, we wouldidentify the unique <strong>Word</strong>Net sense with highest similarityto each Lexeed sense, irrespective of the magnitudeof the similarity. This has the dual disadvantageof allowing only one <strong>Word</strong>Net sense <strong>for</strong> eachLexeed sense, and potentially <strong>for</strong>cing alignments tobe made on low similarity values. A more reasonableapproach is to apply a threshold x, and treatall <strong>Word</strong>Net senses with similarity greater than x asbeing aligned with the Lexeed sense. Thresholdingalso gives us more flexibility in terms of tuning theper<strong>for</strong>mance of our method: at higher threshold values,we can hope to increase precision at the expenseof recall, and at lower threshold values, we can hopeto increase recall at the expense of precision.5.4 Evaluation metricsTo evaluate the per<strong>for</strong>mance of our system, we useprecision, recall and F-score. In an alignment context,precision is defined as the proportion of correctalignments to all alignments returned by the system,and recall is defined as the proportion of the correctalignments returned by our system to all the align-130

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!