1 Answers
Automatic acquisition of lexicon is a computerized process used for the development of a complex morphological lexicon of a language. The lexicon is essential for the NLP , as well as a prerequisite to any wide-coverage parser.The two main requirements represent raw corpus and the morphological description of the language. The aim is to provide lemmas that will serve to the explanation of all the words that occur within the corpus. For the achievement of a quality lexicon it is necessary to manually validate the generated lemmas and iterate the whole process several times.The process is focused on the open word classes. Closed classes are excluded.This method is applicable to the languages with a rich morphology, such as Slovak, Russian or Croatian.
Applied to Slovak, being an inflectional language, the automatic acquisition focuses on the inflectional morphology as well as on the derivational morphology. This fact enables the users to find out the information about derivational relations in the lexicon. For example, Slovak word korpusový is an adjectivization of korpus.