Expanding WordNet with Gloss and Polysemy Links for Evocation Strength Recognition
DOI:
https://doi.org/10.11649/cs.2325Keywords:
evocation, WordNet, glosses, polysemy, evocation strength, semantic relationsAbstract
Evocation – a phenomenon of sense associations going beyond standard (lexico)-semantic relations – is difficult to recognise for natural language processing systems. Machine learning models give predictions which are only moderately correlated with the evocation strength. It is believed that ordinary graph measures are not as good at this task as methods based on vector representations. The paper proposes a new method of enriching the WordNet structure with weighted polysemy and gloss links, and proves that Dijkstra’s algorithm performs equally as well as other more sophisticated measures when set together with such expanded structures.
References
Agirre, E., Alfonseca, E., Hall, K., Kravalova, J., Pasca, M., & Soroa, A. (2009). A study on similarity and relatedness using distributional and WordNet-based approaches. In Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics on – NAACL ’09, 19. Association for Computational Linguistics. https://doi.org/10.3115/1620754.1620758 DOI: https://doi.org/10.3115/1620754.1620758
Agirre, E., & Edmonds, P. (2007). Word sense disambiguation: Algorithms and applications. Springer. https://doi.org/10.1007/1-4020-4809-2 DOI: https://doi.org/10.1007/978-1-4020-4809-8
Agirre, E., & Lopez de Lacalle, O. (2003). Clustering WordNet word senses. In N. Nicolov, K. Bontcheva, G. Angelova, & R. Mitkov (Eds.), Recent Advances in Natural Language Processing III: Selected papers from RANLP 2003 (pp. 121–130). https://doi.org/10.1075/cilt.260.13agi DOI: https://doi.org/10.1075/cilt.260.13agi
Allen, K. (2014). Linguistic meaning. Routledge.
Baker, M. C. (2003). Lexical categories: Verbs, nouns and adjectives. Cambridge University Press. https://doi.org/10.1017/CBO9780511615047 DOI: https://doi.org/10.1017/CBO9780511615047
Ballatore, A., Bertolotto, M., & Wilson, D. C. (2014). An evaluative baseline for geo-semantic relatedness and similarity. GeoInformatica, 18, 747–767. https://doi.org/10.1007/s10707-013-0197-8 DOI: https://doi.org/10.1007/s10707-013-0197-8
Boyd-Graber, J., Fellbaum, C., Osherson, D., & Schapire, R. (2006). Adding dense, weighted, connections to WordNet. In Proceedings of the Global WordNet Conference. http://umiacs.umd.edu/ jbg/docs/jbg-jeju.pdf
Cattle, A., & Ma, X. (2017). Predicting word association strengths. In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (pp. 1283–1288). Association for Computational Linguistics. https://doi.org/10.18653/v1/D17-1132 DOI: https://doi.org/10.18653/v1/D17-1132
Chklovski, T., & Mihalcea, R. (2002). Building a sense tagged corpus with open mind word expert. In Proceedings of the ACL-02 workshop on Word sense disambiguation: Recent successes and future directions (Vol. 8, pp. 116–122). Association for Computational Linguistics. https://doi.org/10.3115/1118675.1118692 DOI: https://doi.org/10.3115/1118675.1118692
Cormen, T. H., Leiserson, C. E., Rivest, R. L., & Stein, C. (2001). Introduction to algorithms. MIT Press.
Cramer, I. (2008). How well do semantic relatedness measures perform? A meta-study. In Proceedings of the 2008 Conference on Semantics in Text Processing (pp. 59–70). Association for Computational Linguistics. https://doi.org/10.3115/1626481.1626487 DOI: https://doi.org/10.3115/1626481.1626487
Cruse, A. (2006). Glossary of semantics and pragmatics. Edinburgh University Press. DOI: https://doi.org/10.1515/9780748626892
Csardi, G., & Nepusz, T. (2006). The igraph software package for complex network research. InterJournal: Complex Systems, Article 1695. http://igraph.org
Edmonds, P. (2004). Lexical disambiguation. In Elsevier encyclopedia of language & linguistics (pp. 43–62). Elsevier.
Faruqui, M., Tsvetkov, Y., Rastogi, P., & Dyer, C. (2016). Problems with evaluation of word embeddings using word similarity tasks. In Proceedings of the 1st Workshop on Evaluating Vector-Space Representations for NLP (pp. 30–35). Association for Computational Linguistics. https://doi.org/10.18653/v1/W16-2506 DOI: https://doi.org/10.18653/v1/W16-2506
Fellbaum, C. (Ed.). (1998). WordNet: An electronic lexical database. MIT Press. https://doi.org/10.7551/mitpress/7287.001.0001 DOI: https://doi.org/10.7551/mitpress/7287.001.0001
Ge, J., & Qiu, Y. (2008). Concept similarity matching based on semantic distance. In 2008 Fourth International Conference on Semantics, Knowledge and Grid (pp. 380–383). IEEE. https://doi.org/10.1109/SKG.2008.24 DOI: https://doi.org/10.1109/SKG.2008.24
Hayashi, Y. (2016). Predicting the evocation relation between lexicalized concepts. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers (pp. 1657–1668). Association for Computational Linguistics.
Jackson, H. (2002). Lexicography: An introduction. Routledge.
Janz, A., & Maziarz, M. (in press). Chaining polysemous senses for evocation recognition. In Proceedings of the 12th International Conference on Computational Collective Intelligence. Springer.
Kacmajor, M., & Kelleher, J. D. (2019). Capturing and measuring thematic relatedness. Language Resources and Evaluation, 54(3), 645–682. https://doi.org/10.1007/s10579-019-09452-w DOI: https://doi.org/10.1007/s10579-019-09452-w
Lyons, J. (1977). Semantics (Vol. 1). Cambridge University Press. https://doi.org/10.1017/CBO9781139165693 DOI: https://doi.org/10.1017/CBO9781139165693
Lyons, J. (1995). Linguistic semantics: An introduction. Cambridge University Press. https://doi.org/10.1017/CBO9780511810213 DOI: https://doi.org/10.1017/CBO9780511810213
Ma, X. (2013). Evocation: Analyzing and propagating a semantic link based on free word association. Language Resources and Evaluation, 47(3), 819–837. https://doi.org/10.1007/s10579-013-9219-2 DOI: https://doi.org/10.1007/s10579-013-9219-2
Miller, G. A., & Fellbaum, C. (2007). WordNet then and now. Language Resources and Evaluation, 41(2), 209–214. https://doi.org/10.1007/s10579-007-9044-6 DOI: https://doi.org/10.1007/s10579-007-9044-6
Nikolova, S. S., Boyd-Graber, J., Fellbaum, C., & Cook, P. (2009). Better vocabularies for assistive communication aids: Connecting terms using semantic networks and untrained annotators. In ACM Conference on Computers and Accessibility. ACM Press. https://doi.org/10.1145/1639642.1639673 DOI: https://doi.org/10.1145/1639642.1639673
Saeed, J. (2003). Semantics (2nd ed.). Blackwell Publishing.
Sag, I. A., Baldwin, T., Bond, F., Copestake, A., & Flickinger, D. (2002). Multiword expressions: A pain in the neck for NLP. In A. Gelbukh (Ed.), Computational Linguistics and Intelligent Text Processing: CICLing 2002 (pp. 1–15). Springer. https://doi.org/10.1007/3-540-45715-1_1 DOI: https://doi.org/10.1007/3-540-45715-1_1
Schmid, H.-J. (2007). Laurie Bauer and Salvador Valera (eds.), Approaches to conversion/zero-derivation. Münster, New York, Munich, and Berlin: Waxmann, 2005. 175 pp., £19.90 (pb.), ISBN 3-8309-1456-3 [Review]. English Language & Linguistics, 11(3), 587–590. https://doi.org/10.1017/S1360674307002407 DOI: https://doi.org/10.1017/S1360674307002407
Schönefeld, D. (2005). Zero-derivation-functional change-metonymy. In L. Bauer & S. Valera (Eds.), Approaches to conversion/zero-derivation (pp. 131–160). Waxmann.
Small, S. L., Cottrell, G. W., & Tanenhaus, M. K. (1988). Preface. In S. L. Small, G. W. Cottrell, & M. K. Tanenhaus (Eds.), Lexical ambiguity resolution: Perspectives from psycholinguistics, neuropsychology, and artificial intelligence. Morgan Kaufmann. https://doi.org/10.1016/B978-0-08-051013-2.50004-5 DOI: https://doi.org/10.1016/B978-0-08-051013-2.50004-5
Suderman, K., & Ide, N. (2006). Layering and merging linguistic annotations. In Proceedings of the 5th Workshop on NLP and XML (NLPXML-2006): Multi-Dimensional Markup in Natural Language Processing (pp. 89–92). https://doi.org/10.3115/1621034.1621052 DOI: https://doi.org/10.3115/1621034.1621052
Svensen, B. (2009). A handbook of lexicography: The theory and practice of dictionary-making. Cambridge University Press.
Vicente, A., & Falkum, I. L. (2017). Polysemy. In Oxford Research Encyclopedia of Linguistics. Oxford University Press. https://doi.org/10.1093/acrefore/9780199384655.013.325 DOI: https://doi.org/10.1093/acrefore/9780199384655.013.325
Yang, Y. (2008). Multiple criteria third-order response surface design and comparison. https://www.researchgate.net/publication/254671895_Multiple_Criteria_Third-Order_Response_Surface_Design_and_Comparison
Zipf, G. K. (1945). The meaning-frequency relationship of words. The Journal of General Psychology, 33(2), 251–256. https://doi.org/10.1080/00221309.1945.10544509 DOI: https://doi.org/10.1080/00221309.1945.10544509
Downloads
Published
Issue
Section
License
Copyright (c) 2020 Marek Maziarz, Ewa Rudnicka

This work is licensed under a Creative Commons Attribution 3.0 Unported License.



