Skip Navigation

IEICE Transactions on Information and Systems 2008 E91-D(4):969-975; doi:10.1093/ietisy/e91-d.4.969
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Request Permissions
Google Scholar
Right arrow Articles by OH, H.-J.
Right arrow Articles by YUN, B.-H.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Copyright © 2008 The Institute of Electronics, Information and Communication Engineers

Special Section on Knowledge-Based Software Engineering -- Papers -- Knowledge Engineering

Sentence Topics Based Knowledge Acquisition for Question Answering

Hyo-Jung OH1 and Bo-Hyun YUN2

1 The author is with ETRI, 161 Gajeong-dong, Yuseong-gu, Daejeon, 350–700, Korea., 2 The author is with Mokwon University, Mokwon Gil 21, Seo-gu, Daejeon, 302–318, Korea. E-mail: ybh{at}mokwon.ac.kr

This paper presents a knowledge acquisition method using sentence topics for question answering. We define templates for information extraction by the Korean concept network semi-automatically. Moreover, we propose the two-phase information extraction model by the hybrid machine learning such as maximum entropy and conditional random fields. In our experiments, we examined the role of sentence topics in the template-filling task for information extraction. Our experimental result shows the improvement of 18% in F-score and 434% in training speed over the plain CRF-based method for the extraction task. In addition, our result shows the improvement of 8% in F-score for the subsequent QA task.

Key Words: knowledge acquisition, machine learning, question answering


Manuscript received July 2, 2007. Manuscript revised September 28, 2007.

Reference

[1] J. Kupiec, "MURAX: A robust linguistic approach for question answering using on-line encyclopedia," Proc. ACM SIGIR, 1993.

[2] W. Li and R.K. Srihari, Extracting Exact Answers to Questions Based Structural Links, Proc. Coling, 2002.

[3] C.K. Lee, J.H. Wang, H.J. Kim, and M.G. Jang, "Extracting template for knowledge-based question-answering using conditional random fields," Proc. 28th Annual International ACM SIGIR Workshop on MF/IR, 2005.

[4] A.L. Berger, S.A. Pietar, and V.J. Pietra, "Maximum entropy approach to natural language processing," Computational Linguistics, vol.22, no.1, pp.39–71, 1996.

[5] H. Christensen, B. Kolluru, Y. Gotoh, and S. Renals, "Maximum entropy segmentation on broadcast news," Proc. IEEE International Conference on Acoustic, Speech, and Signal Processing (ICASSP-05), pp.1029–1032, 2005.

[6] S. Della Pietra, V. Della Pietra, and J. Lafferty, "Inducing features of random fields," IEEE Trans. Pattern Anal. Mach. Intell., vol.19, no.4, pp.380–393, 1997.

[7] J. Lafferty, A. McCallum, and F. Pereira, "Conditional random fields: Probabilistic models for segmenting and labeling sequence data," ICML, 2001.

[8] C.K. Lee, H.J. Oh, Y.G. Hwang, C.H. Lee, J.H. Wang, S.J. Lim, and M.G. Jang,"Fine-grained named entity recognition using conditional random fields for question answering," Proc. Asia Information Retrieval Symposium (AIRS-06), LNCS vol.4182, pp.581–587, 2006.

[9] J. Darroch and D. Ratcliff, "Generalized iterative scaling for log-linear models," Annals of Mathematical Statistics, vol.43, no.5, pp.1470–1480, 1972.

[10] K. Nigam, J. Lafferty, and A. McCallum, "Using maximum entropy for text classification," Proc. IJCAI-99 Workshop on Machine Learning for Information Filtering, pp.61–67, 1999.

[11] R. Malouf, "A comparison of algorithms for maximum entropy parameter estimation," Proc. 6th Conference on Natural Language Learning, pp.49–55, 2002.

[12] M.R. Choi, J. Hur, and M.G. Jang, "Constructing Korean lexical concept network for encyclopedia question-answering system," Proc. IECON 2004 – 30th Annual Conference of IEEE Industrial Electronics Society, pp.3115–3119, 2004.

[13] B.Y. Kang and S.H. Myaeng, "Theme assignment for sentences based on head-driven patterns," IEICE Trans. Inf. & Syst., vol.E89-D, no.1, pp.377–380, Jan. 2006.

[14] C. Fellbaum, WordNet, an electronic lexical database, The MIT Press, 1998.

[15] Y. Yang and X. Liu, "A re-examination of text categorization methods," Proc. 22nd Annual International ACM-SIGIR, pp.42–49, 1999.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Request Permissions
Google Scholar
Right arrow Articles by OH, H.-J.
Right arrow Articles by YUN, B.-H.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?