Zawilinski: A library for studying grammar in wiktionary
|Zawilinski: A library for studying grammar in wiktionary|
|Published in||Proceedings of WikiSym 2010 - The 6th International Symposium on Wikis and Open Collaboration|
|Keyword(s)||inflection, language, MediaWiki, parse, wiki, Wiktionary (Extra: inflection, language, MediaWiki, parse, wiki, Wiktionary, Linguistics)|
|Article||BASE, CiteSeerX, Google Scholar|
|Web||Ask, Bing, Google (PDF), Yahoo!|
|Download and mirrors|
|Local copy||Not available|
|Remote mirror(s)||Not available|
|Export and share|
|BibTeX, CSV, RDF, JSON|
|Browse properties · List of conference papers|
Zawilinski: A library for studying grammar in wiktionary is a 2010 conference paper written in English by Kurmas Z. and published in Proceedings of WikiSym 2010 - The 6th International Symposium on Wikis and Open Collaboration.
We present Zawilinski, a Java library that supports the extraction and analysis of grammatical data in Wiktionary. Zawilinski can efficiently (1) filter Wiktionary for content pertaining to a specified language, and (2) extract a word's inflections from its Wiktionary entry. We have thus far used Zawilinski to (1) measure the correctness of the inflections for a subset of the Polish words in the English Wiktionary and to (2) show that this grammatical data is very stable. (Only 131 out of 4748 Polish words have had their inflection data corrected.) We also explain Zawilinski's key features and discuss how it can be used to simplify the development of additional grammar-based analyses.
- This section requires expansion. Please, help!