ツール班協力者:東京工業大学大学院情報理工 - 奈良先端科学技術 ...
ツール班協力者:東京工業大学大学院情報理工 - 奈良先端科学技術 ...
ツール班協力者:東京工業大学大学院情報理工 - 奈良先端科学技術 ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
BCCWJ <br />
<br />
∗<br />
<br />
Annotating Predicate-Argument Structure and Anaphoric Relations<br />
to BCCWJ<br />
Mamoru Komachi (Nara Institute of Science and Technology)<br />
Ryu Iida (Tokyo Institute of Technology)<br />
1 <br />
<br />
Gildea and Jurafsky (2002) <br />
(Semantic role labeling) Fillmore and Baker (2000) <br />
PropBank (Palmer et al., 2005) <br />
<br />
CoNLL (Conference on Computational Natural Language Learning)<br />
2004–2005, 2008–2009<br />
CoNLL 2011 PropBank <br />
OntoNotes (Hovy et al., 2006) <br />
<br />
(Harabagiu et al., 2005; Surdeanu et al., 2003) <br />
<br />
<br />
Meyers et al. (2004a) NomBank (Meyers et al., 2004c,b) <br />
<br />
(Gerber and Chai, 2010) ACL <br />
<br />
4.0 (Kawahara et al., 2002) <br />
<br />
GDA (Hasida, 2005) agent theme <br />
NAIST (, 2010) <br />
3.0 <br />
KNB (, 2009) <br />
<br />
(2009) <br />
BCCWJ <br />
(, 2010) NAIST <br />
(, 2004) <br />
∗ komachi@is.naist.jp
1: BCCWJ <br />
<br />
<br />
V V <br />
V V V <br />
V V <br />
V V <br />
V <br />
2 BCCWJ <br />
2.1 <br />
<br />
<br />
BCCWJ NAIST <br />
1.5β(, 2010) 1 <br />
2 <br />
agent theme PropBank <br />
ARG0 ARG1 NAIST<br />
<br />
<br />
(1) <br />
4.0 =, =<br />
, ==<br />
, =<br />
<br />
3 =<br />
1 <br />
(2) <br />
<br />
=, =<br />
NAIST FrameNet NOMLEX (Macleod<br />
et al., 1997, 1998) <br />
(2007); (2008) <br />
4 (, 2004) <br />
<br />
(3) <br />
1 http://cl.naist.jp/ ∼ ryu-i/coreference tag.html<br />
2 https://sites.google.com/site/naistcorpus/predicate tag<br />
3 <br />
4 http://cl.it.okayama-u.ac.jp/rsc/lcs
3 <br />
==<br />
<br />
(4) <br />
<br />
0.902 5 (, 2007) <br />
<br />
<br />
2.2 <br />
<br />
<br />
<br />
<br />
<br />
(5) <br />
6 <br />
<br />
(6) <br />
<br />
<br />
(2010) /<br />
4 <br />
2.3 <br />
BCCWJ <br />
<br />
2 <br />
<br />
(7) iPhone <br />
<br />
(8) iPad <br />
iPad <br />
Mitkov (2002) identity-ofreference<br />
anaphora (IRA) identityof-sense<br />
anaphora (ISA) <br />
5 http://cl.it.okayama-u.ac.jp/rsc/data/index.html<br />
6 +
2: BCCWJ 4 <br />
<br />
PN 478 5,730 127,077 A <br />
PB 55 4,691 113,399 A <br />
OW 30 2,414 100,396 A <br />
OC 938 6,402 103,188 B <br />
BCCWJ NAIST (, 2010) <br />
<br />
IRA <br />
NAIST bridging reference (Inoue et al., 2010) <br />
BCCWJ NAIST <br />
<br />
2.4 <br />
2011 2 7 BCCWJ <br />
2 UniDic 1.3.12 7 MeCab 0.98 8 <br />
<br />
<br />
<br />
9 90 1,653 (2010) <br />
<br />
3 A NAIST <br />
B,C <br />
A-B (2010) <br />
8 A-C <br />
A-B C <br />
<br />
A-C <br />
<br />
<br />
<br />
<br />
NomBank <br />
(, 2010) <br />
<br />
7 https://www.tokuteicorpus.jp/dist/index.php<br />
8 http://mecab.sourceforge.net/
3: <br />
A-B <br />
A-C <br />
<br />
87.1 (155/178) 89.6 (155/173) 80.1 (133/166) 76.9 (133/173)<br />
80.4 (123/153) 100.0 (123/123) 96.2 (101/105) 100.0 (101/101)<br />
96.3 (77/80) 98.7 (77/80) 85.3 (58/68) 96.7 (58/60)<br />
82.1 (23/28) 88.5 (23/26) 90.5 (19/21) 59.4 (19/32)<br />
93.9 (93/99) 84.5 (93/110) 72.9 (70/96) 63.6 (70/110)<br />
83.5 (76/91) 100.0 (76/76) 80.0 (32/40) 100.0 (32/32)<br />
59.0 (13/22) 100.0 (13/13) 52.4 (11/21) 73.3 (11/15)<br />
77.8 (7/9) 100.0 (7/7) 50.0 (4/8) 100.0 (4/4)<br />
3 <br />
NAIST <br />
BCCWJ NAIST <br />
<br />
<br />
<br />
OntoNotes (Hovy et al., 2006) BCCWJ <br />
<br />
90%<br />
BCCWJ NAIST <br />
90%Slate (<br />
, 2010) <br />
<br />
<br />
<br />
<br />
<br />
<br />
Fillmore, Charles J. and Collin F. Baker (2000) “FrameNet: Frame semantics meets the corpus,” in Proceedings<br />
of the 74th Annual Meeting of the Linguistic Society of America.<br />
Gerber, Matthew and Joyce Y. Chai (2010) “Beyond NomBank: a Study of Implicit Arguments for Nominal<br />
Predicates,” in Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics,<br />
pp. 1583–1592.<br />
Gildea, Daniel and Daniel Jurafsky (2002) “Automatic Labeling of Semantic Roles,” Computational Linguistics,<br />
Vol. 28, No. 3, pp. 245-288.<br />
Harabagiu, Sanda, Cosmin Adrian Bejan, and Paul Morarescu (2005) “Shallow Semantics for Relation<br />
Extraction,” in Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence<br />
(IJCAI ’05), pp. 1061–1066.<br />
Hasida, Koiti (2005) GDA 0.74 <br />
http://i-content.org/gda/tagman.html
Hovy, Eduard, Mitchell Marcus, Martha Palmer, Lance Ramshaw, and Ralph Weischedel (2006)<br />
“OntoNotes: The 90% Solution,” in Proceedings of the Human Language Technology Conference of<br />
the North American Chapter of the ACL, pp. 57–60.<br />
Inoue, Naoya, Ryu Iida, Kentaro Inui, and Yuji Matsumoto (2010) “Resolving Direct and Indirect Anaphora<br />
for Japanese Definite Noun Phrases,” Journal of Natural Language Processing, Vol. 17, No. 1, pp. 221–<br />
246.<br />
Kawahara, Daisuke, Sadao Kurohashi, and Koiti Hasida (2002) “Construction of a Japanese Relevancetagged<br />
Corpus,” in Proceedings of the 3rd International Conference on Language Resources and Evaluation<br />
(LREC), pp. 2008–2013.<br />
Macleod, Catherine, Ralph Grishman, Adam Meyers, Leslie Barrett, and Ruth Reeves (1998) “NOMLEX:<br />
A Lexicon of Nominalizations,” in Proceedings of Euralex98, pp. 187–193.<br />
Macleod, Cathrine, Adam Meyers, Ralph Grishman, Leslie Barret, and Ruth Reeves (1997) “Designing a<br />
Dictionary of Derived Nominals,” in Proceedings of Recent Advances in Natural Language Processing,<br />
pp. 142–151.<br />
Meyers, Adam, Ruth Reeves, Catherine Macleod, Rachel Szekely, Veronika Zielinska, Brian Young, and<br />
Ralph Grishman (2004a) “Annotating Noun Argument Structure for NomBank,” in Proceedings of the<br />
4th International Conference on Language Resources and Evaluation (LREC), pp. 803–806.<br />
Meyers, Adam, Ruth Reeves, and Catherine Macleod (2004b) “NP-External Arguments: A Study of Argument<br />
Sharing in English,” in Proceedings of the ACL 2004 Workshop on Multiword Expressions:<br />
Integrating Processing, pp. 96-103.<br />
Meyers, Adam, Ruth Reeves, Catherine Macleod, Rachel Szekely, Veronika Zielinska, Brian Young,<br />
and Ralph Grishman (2004c) “The NomBank Project: An Interim Report,” in Proceedings of the<br />
HLT/NAACL 2004 Workshop Frontiers in Corpus Annotation, pp. 24–31.<br />
Mitkov, Ruslan ed. (2002) Anaphora Resolution, Studies in Language and Linguistics: Peason Education.<br />
Palmer, Martha, Paul Kingsbury, and Daniel Gildea (2005) “The Proposition Bank: An Annotated Corpus<br />
of Semantic Roles,” Computational Linguistics, Vol. 31, No. 1, pp. 71–106.<br />
Surdeanu, Mihai, Sanda Harabagiu, John Williams, and Paul Aaseth (2003) “Using Predicate-Argument<br />
Structures for Information Extraction,” in Proceedings of the 41st Annual Meeting of the Association for<br />
Computational Linguistics (ACL), pp. 8–15.<br />
(2009) <br />
21 <br />
<br />
(2009) <br />
15 614–617 http://nlp.kuee.kyoto-u.ac.jp/<br />
∼ hasimoto/KNBC v1.0 090925.tar.bz2 <br />
(2010) <br />
17 1 141–159 <br />
(2007) <br />
13 286–289 <br />
(2008) <br />
14 1152–1155 <br />
(2004) 10 <br />
576–579 <br />
(2007) <br />
13 859–862 <br />
Dain Kaplan (2010) Slate<br />
2010-NL-199 1–10 <br />
(2010) <br />
: NAIST 17 2 25–50