Towards the Methodology for Extending Princeton WordNet

Authors

  • Ewa Katarzyna Rudnicka Politechnika Wrocławska [Wrocław University of Technology], Wrocław , Politechnika Wrocławska [Wrocław University of Technology], Wrocław
  • Wojciech Witkowski Uniwersytet Wrocławski [University of Wrocław], Wrocław , Uniwersytet Wrocławski [University of Wrocław], Wrocław
  • Michał Kaliński Politechnika Wrocławska [Wrocław University of Technology], Wrocław , Politechnika Wrocławska [Wrocław University of Technology], Wrocław

DOI:

https://doi.org/10.11649/cs.2015.023

Keywords:

wordnet extension, lexico-semantic relations, I-hyponymy links, rule-based algorithms, lexical gaps

Abstract

The paper presents the methodology and results of the first, pilot stage of the extension of Princeton WordNet, a huge electronic English language thesaurus and lexico-semantic network based on synsets, ie. sets of synonymous lexical units, or lemma sense pairs. The necessity for such extension arose in the course of mapping plWordNet (Polish WordNet — Słowosieć) onto Princeton WordNet, which produced a large number of inter-lingual hyponymy links signalling differences in the structure and lexical coverage of the two networks. The proposed strategy uses I-hyponymy links as pointers to presumed gaps in the lexical coverage of PrincetonWordNet and offers strategies of filling them in with new lexical units and synsets.

References

Fellbaum, C. (1998). Wordnet: An electronic lexical database. Cambridge, Mass.: MIT Press. DOI: https://doi.org/10.7551/mitpress/7287.001.0001

Hamp, B. & Feldweg, H. (1997). Germanet: A lexical-semantic net for German. In Proceedings of the Proceedings of ACL workshop Automatic Information Extraction and Building of Lexical Semantic Resources for NLP Applications.

Kędzia, P., Piasecki, M., & Przybycień, E. R. K. (2013). Automatic prompt system in the process of mapping plWordNet on Princeton WordNet. Cognitive Studies | Études cognitives, (13), 123– 141. DOI: https://doi.org/10.11649/cs.2013.008

Maziarz, M., Piasecki, M., Rudnicka, E., & Szpakowicz, S. (2013). Beyond the transfer-and-merge wordnet construction: plWordNet and a comparison with Wordnet. In Proc. RANLP (pp. 443– 452).

Maziarz, M., Piasecki, M., & Szpakowicz, S. (2012). Approaching plWordNet 2.0. In Proceedings of the 6th Global Wordnet Conference.

Miller, G. A., Beckwith, R., Fellbaum, C., Gross, D., & Miller, K. (1990). Introduction to WordNet: An on-line lexical database. International Journal of Lexicography, 3(4), 235– 244. http://doi.org/10.1093/ijl/3.4.235 DOI: https://doi.org/10.1093/ijl/3.4.235

Pianta, E., Bentivogli, L., & Girardi, C. (2002). Multiwordnet: Developing an aligned multilingual database. In Proceedings of the First International Conference on Global WordNet.

Piasecki, M., Maziarz, M., Szpakowicz, S., & Rudnicka, E. (2014). PlWordNet as the cornerstone of a toolkit of lexico-semantic resources. In Proc. 7th International Global Wordnet Conference (pp. 304– 312).

Piasecki, M., Szpakowicz, S., & Broda, B. (2009). A wordnet from the ground up. Wrocław: Oficyna Wydawnicza Politechniki Wrocławskiej.

Piotrowski, T. & Saloni, Z. (2002). Słownik angielsko-polski i polsko-angielski. Warszawa: Wydawnictwo Naukowe PWN.

Pushpak, B. (2010). Indowordnet. In Lexical Resources Engineering Conference 2010 (LREC 2010).

Rudnicka, E., Maziarz, M., Piasecki, M., & Szpakowicz, S. (2012). A strategy of mapping polish WordNet on Princeton Wordnet. In M. Maziarz, M. Piasecki, E. Rudnicka, & S. Szpakowicz (Eds.), Proceedings of COLING.

Svensen, B. (2009). A handbook of lexicography: The theory and practice dictionary-making. Cambridge: Cambridge University Press.

Vossen, P. (2002). EuroWordNet General Document.

Downloads

Published

2015-12-31

Issue

Section

Semantics, Corpus Linguistics and Computer Linguistics