29.04.2015 Views

ツール班協力者:東京工業大学大学院情報理工 - 奈良先端科学技術 ...

ツール班協力者:東京工業大学大学院情報理工 - 奈良先端科学技術 ...

ツール班協力者:東京工業大学大学院情報理工 - 奈良先端科学技術 ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

BCCWJ <br />

<br />

∗<br />

<br />

Annotating Predicate-Argument Structure and Anaphoric Relations<br />

to BCCWJ<br />

Mamoru Komachi (Nara Institute of Science and Technology)<br />

Ryu Iida (Tokyo Institute of Technology)<br />

1 <br />

<br />

Gildea and Jurafsky (2002) <br />

(Semantic role labeling) Fillmore and Baker (2000) <br />

PropBank (Palmer et al., 2005) <br />

<br />

CoNLL (Conference on Computational Natural Language Learning)<br />

2004–2005, 2008–2009<br />

CoNLL 2011 PropBank <br />

OntoNotes (Hovy et al., 2006) <br />

<br />

(Harabagiu et al., 2005; Surdeanu et al., 2003) <br />

<br />

<br />

Meyers et al. (2004a) NomBank (Meyers et al., 2004c,b) <br />

<br />

(Gerber and Chai, 2010) ACL <br />

<br />

4.0 (Kawahara et al., 2002) <br />

<br />

GDA (Hasida, 2005) agent theme <br />

NAIST (, 2010) <br />

3.0 <br />

KNB (, 2009) <br />

<br />

(2009) <br />

BCCWJ <br />

(, 2010) NAIST <br />

(, 2004) <br />

∗ komachi@is.naist.jp


1: BCCWJ <br />

<br />

<br />

V V <br />

V V V <br />

V V <br />

V V <br />

V <br />

2 BCCWJ <br />

2.1 <br />

<br />

<br />

BCCWJ NAIST <br />

1.5β(, 2010) 1 <br />

2 <br />

agent theme PropBank <br />

ARG0 ARG1 NAIST<br />

<br />

<br />

(1) <br />

4.0 =, =<br />

, ==<br />

, =<br />

<br />

3 =<br />

1 <br />

(2) <br />

<br />

=, =<br />

NAIST FrameNet NOMLEX (Macleod<br />

et al., 1997, 1998) <br />

(2007); (2008) <br />

4 (, 2004) <br />

<br />

(3) <br />

1 http://cl.naist.jp/ ∼ ryu-i/coreference tag.html<br />

2 https://sites.google.com/site/naistcorpus/predicate tag<br />

3 <br />

4 http://cl.it.okayama-u.ac.jp/rsc/lcs


3 <br />

==<br />

<br />

(4) <br />

<br />

0.902 5 (, 2007) <br />

<br />

<br />

2.2 <br />

<br />

<br />

<br />

<br />

<br />

(5) <br />

6 <br />

<br />

(6) <br />

<br />

<br />

(2010) /<br />

4 <br />

2.3 <br />

BCCWJ <br />

<br />

2 <br />

<br />

(7) iPhone <br />

<br />

(8) iPad <br />

iPad <br />

Mitkov (2002) identity-ofreference<br />

anaphora (IRA) identityof-sense<br />

anaphora (ISA) <br />

5 http://cl.it.okayama-u.ac.jp/rsc/data/index.html<br />

6 +


2: BCCWJ 4 <br />

<br />

PN 478 5,730 127,077 A <br />

PB 55 4,691 113,399 A <br />

OW 30 2,414 100,396 A <br />

OC 938 6,402 103,188 B <br />

BCCWJ NAIST (, 2010) <br />

<br />

IRA <br />

NAIST bridging reference (Inoue et al., 2010) <br />

BCCWJ NAIST <br />

<br />

2.4 <br />

2011 2 7 BCCWJ <br />

2 UniDic 1.3.12 7 MeCab 0.98 8 <br />

<br />

<br />

<br />

9 90 1,653 (2010) <br />

<br />

3 A NAIST <br />

B,C <br />

A-B (2010) <br />

8 A-C <br />

A-B C <br />

<br />

A-C <br />

<br />

<br />

<br />

<br />

NomBank <br />

(, 2010) <br />

<br />

7 https://www.tokuteicorpus.jp/dist/index.php<br />

8 http://mecab.sourceforge.net/


3: <br />

A-B <br />

A-C <br />

<br />

87.1 (155/178) 89.6 (155/173) 80.1 (133/166) 76.9 (133/173)<br />

80.4 (123/153) 100.0 (123/123) 96.2 (101/105) 100.0 (101/101)<br />

96.3 (77/80) 98.7 (77/80) 85.3 (58/68) 96.7 (58/60)<br />

82.1 (23/28) 88.5 (23/26) 90.5 (19/21) 59.4 (19/32)<br />

93.9 (93/99) 84.5 (93/110) 72.9 (70/96) 63.6 (70/110)<br />

83.5 (76/91) 100.0 (76/76) 80.0 (32/40) 100.0 (32/32)<br />

59.0 (13/22) 100.0 (13/13) 52.4 (11/21) 73.3 (11/15)<br />

77.8 (7/9) 100.0 (7/7) 50.0 (4/8) 100.0 (4/4)<br />

3 <br />

NAIST <br />

BCCWJ NAIST <br />

<br />

<br />

<br />

OntoNotes (Hovy et al., 2006) BCCWJ <br />

<br />

90%<br />

BCCWJ NAIST <br />

90%Slate (<br />

, 2010) <br />

<br />

<br />

<br />

<br />

<br />

<br />

Fillmore, Charles J. and Collin F. Baker (2000) “FrameNet: Frame semantics meets the corpus,” in Proceedings<br />

of the 74th Annual Meeting of the Linguistic Society of America.<br />

Gerber, Matthew and Joyce Y. Chai (2010) “Beyond NomBank: a Study of Implicit Arguments for Nominal<br />

Predicates,” in Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics,<br />

pp. 1583–1592.<br />

Gildea, Daniel and Daniel Jurafsky (2002) “Automatic Labeling of Semantic Roles,” Computational Linguistics,<br />

Vol. 28, No. 3, pp. 245-288.<br />

Harabagiu, Sanda, Cosmin Adrian Bejan, and Paul Morarescu (2005) “Shallow Semantics for Relation<br />

Extraction,” in Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence<br />

(IJCAI ’05), pp. 1061–1066.<br />

Hasida, Koiti (2005) GDA 0.74 <br />

http://i-content.org/gda/tagman.html


Hovy, Eduard, Mitchell Marcus, Martha Palmer, Lance Ramshaw, and Ralph Weischedel (2006)<br />

“OntoNotes: The 90% Solution,” in Proceedings of the Human Language Technology Conference of<br />

the North American Chapter of the ACL, pp. 57–60.<br />

Inoue, Naoya, Ryu Iida, Kentaro Inui, and Yuji Matsumoto (2010) “Resolving Direct and Indirect Anaphora<br />

for Japanese Definite Noun Phrases,” Journal of Natural Language Processing, Vol. 17, No. 1, pp. 221–<br />

246.<br />

Kawahara, Daisuke, Sadao Kurohashi, and Koiti Hasida (2002) “Construction of a Japanese Relevancetagged<br />

Corpus,” in Proceedings of the 3rd International Conference on Language Resources and Evaluation<br />

(LREC), pp. 2008–2013.<br />

Macleod, Catherine, Ralph Grishman, Adam Meyers, Leslie Barrett, and Ruth Reeves (1998) “NOMLEX:<br />

A Lexicon of Nominalizations,” in Proceedings of Euralex98, pp. 187–193.<br />

Macleod, Cathrine, Adam Meyers, Ralph Grishman, Leslie Barret, and Ruth Reeves (1997) “Designing a<br />

Dictionary of Derived Nominals,” in Proceedings of Recent Advances in Natural Language Processing,<br />

pp. 142–151.<br />

Meyers, Adam, Ruth Reeves, Catherine Macleod, Rachel Szekely, Veronika Zielinska, Brian Young, and<br />

Ralph Grishman (2004a) “Annotating Noun Argument Structure for NomBank,” in Proceedings of the<br />

4th International Conference on Language Resources and Evaluation (LREC), pp. 803–806.<br />

Meyers, Adam, Ruth Reeves, and Catherine Macleod (2004b) “NP-External Arguments: A Study of Argument<br />

Sharing in English,” in Proceedings of the ACL 2004 Workshop on Multiword Expressions:<br />

Integrating Processing, pp. 96-103.<br />

Meyers, Adam, Ruth Reeves, Catherine Macleod, Rachel Szekely, Veronika Zielinska, Brian Young,<br />

and Ralph Grishman (2004c) “The NomBank Project: An Interim Report,” in Proceedings of the<br />

HLT/NAACL 2004 Workshop Frontiers in Corpus Annotation, pp. 24–31.<br />

Mitkov, Ruslan ed. (2002) Anaphora Resolution, Studies in Language and Linguistics: Peason Education.<br />

Palmer, Martha, Paul Kingsbury, and Daniel Gildea (2005) “The Proposition Bank: An Annotated Corpus<br />

of Semantic Roles,” Computational Linguistics, Vol. 31, No. 1, pp. 71–106.<br />

Surdeanu, Mihai, Sanda Harabagiu, John Williams, and Paul Aaseth (2003) “Using Predicate-Argument<br />

Structures for Information Extraction,” in Proceedings of the 41st Annual Meeting of the Association for<br />

Computational Linguistics (ACL), pp. 8–15.<br />

(2009) <br />

21 <br />

<br />

(2009) <br />

15 614–617 http://nlp.kuee.kyoto-u.ac.jp/<br />

∼ hasimoto/KNBC v1.0 090925.tar.bz2 <br />

(2010) <br />

17 1 141–159 <br />

(2007) <br />

13 286–289 <br />

(2008) <br />

14 1152–1155 <br />

(2004) 10 <br />

576–579 <br />

(2007) <br />

13 859–862 <br />

Dain Kaplan (2010) Slate<br />

2010-NL-199 1–10 <br />

(2010) <br />

: NAIST 17 2 25–50

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!