Removing grammar ambiguity of word forms by statistical methods

Authors

  • Ganna Karnaukh Український мовно-інформацiйний фонд Національної академії наук України [Ukrainian Lingua-Information Fund, National Academy of Sciences of Ukraine], Kyiv , Український мовно-інформацiйний фонд Національної академії наук України [Ukrainian Lingua-Information Fund, National Academy of Sciences of Ukraine], Kijów

DOI:

https://doi.org/10.11649/cs.2014.003

Keywords:

grammatical homonyms, linguistic processor, N-gram, trigram, word form, grammatical meaning, capital / small letters, statistical methods

Abstract

Research is devoted to the study of behaviour of linguistic processor at simultaneous application of software supporting functions (taking into account the characteristics of the writing word forms (capital / small letters), punctuation marks in trigrams and location of trigrams within a sentence). The article analyses qualitative quantitative characteristics of the results removing grammatical homonyms of word forms using statistical methods in compliance with requirements. The research is based on the texts of normative legal document.

References

Apresian, IU. (n.d.). Novyĭ bol’shoĭ anglo-russkiĭ slovar’. Retrieved from http://www.classes.ru/dictionary-english-russian-Apresyan.htm

Bol’shoĭ anglo-russkiĭ i russko-angliĭskiĭ slovar’. (2001). Akademik.ru. Retrieved from http://dic.academic.ru/dic.nsf/eng_rus/808936/trigram/

Buzikashvili, N., Krylova, G., & Samoĭlov, D. (n.d.). Cognitive Technologies. Retrieved from http://cognitive.ru/assets/docs/scienwork/sbornic2/samoilov.doc./

Karnaukh, G. (2011). Velika i mala litery iak zasib rozriznennia neodnoznachnikh slovoform pri h.avtomatychniĭ dizambiguatsiї. Problemy gramatyky i leksykologiї ukraїnskoї mowy, 8, 26-36.

Kochergan, М. (1980). Slovo i kontekst. L’viv: Vyshcha shkola.

Krygin, М. (2000). Tekst na estestvennom iazyke kak ob’’ekt statisticheskogo analiza. Bionika intelektu, 1(72), 75-82.

Krygin, М., Shkurko, V., & Afanas’eva, О. (2009). Sniatie grammaticheskoĭ omonimii v tekste s pomoshch’iu statisticheskikh metodov. In V. Shyrokov (Ed.), Prykladna linhvistyka ta linhvistychni tekhnologiї: MegaLing-2009 (pp. 516-519). Kyїv: Dovira.

Liubchenko, Т. (2000). Leksikohrafichni sistemy gramatychnoho tipu tа їkh zastosuvannia v zasobakh avtomatychnoho opratsiuvannia movy. Кyїv: UMIF NANU.

Mazov, N. (1995). N-grammnye metody obrabotki tekstovoĭ informatsii. Ėlektronnaia biblioteka GPNTB Rossii. Retrieved from http://www.gpntb.ru/win/interevents/crimea95/report/rep075_r.html

Piotrovskiĭ, R. (1979). Inzhenernaia lingvistika i teoriia iazyka. Leningrad: Nauka.

Salomatina, N., Gusev, V., Il’ina, L., Kuz’min, А., & Parmon, V. (2010). O vozmozhnosti avtomatizatsii vyiavlenia sviazeĭ mezhdu terminami predmetnoĭ oblasti (na primere kataliza). Konferentsiia «Dialog 2010», doklad 65. Retrieved from http://www.dialog21.ru/digests/dialog2010/materials/html/65.htm

Shennon, К. (1963). Raboty po teorii informatsii i kibernetike. Мoskva: Izd. IL.

Shevchenko, I., Rabulets, А., & Shyrokov, V. (2005). Ėlektronnyĭ grammaticheskiĭ slovar’ ukrainskogo iazyka. In V. Shyrokov (Ed.), Trudy Mezhdunarodnoĭ konferentsii «MegaLing-2005. Prikladnaia lingvistika v poiske novykh puteĭ» (pp. 124-129). Krym: Meganom.

Shyrokov, V., Bugakov, О., Griaznukhina, Т. et al. (2005). Korpusna linhvistyka. Кyїv: Dovira.

Universal’nyĭ anglo-russkiĭ slovar’. (2011). Аkademik.ru. Retrieved from http://universal_en_ru.academic.ru/

ZHigalo, V., & Landė, D. (2010). Statisticheskiĭ onlaĭn-perevodchik InfoStream. In V. Shyrokov (Ed.), Prykladna linhvistyka ta linhvistychni tekhnologiї: MegaLing2010 (pp. 65- -69). Kyїv: Dovira.

Zvegintsev, V. (1959). Deskriptivnaia lingvistika. In G. Glison (Ed.), Vvedenie v deskriptivnuiu lingvistiku (pp. 5-27). Moskva: Izd-vo inostrannoĭ literatury.

Downloads

Published

2014-09-04

Issue

Section

Semantics and Corpus Linguistics