LINGUISTIC PROBLEMS OF TEXT ANNOTATION IN CORPUS LINGUISTICS
DOI:
https://doi.org/10.47390/SPR1342V4I4Y2024N48Keywords:
corpus, corpus linguistics, annotation, tagging, lemma, morphological tagging, semantic annotation, morphosyntactic annotation, syntactic annotation.Abstract
This article examines the issues of annotating texts in corpus linguistics. Today, in corpus linguistics, the classification of large volumes of texts from a linguistic point of view has become an urgent issue. Linguistic annotation is one of the main concepts of corpus linguistics. Linguistic classification of texts in the corpus is the attachment of linguistic and extralinguistic information to texts and their components. A good annotation makes it easy for the researcher to find the desired word, word form and construction. Because there is no special information (annotation) revealing its character in a simple electronic text. The methods of morphological tagging of words and their importance in language learning are considered to be the main issue.
References
Хамроева Ш. Ўзбек тили муаллифлик корпусини тузишнинг лингвистик асослари: Фил. фан. бўйича фалсафа доктори (PhD)…дис. – Бухоро, 2018. – 250 б
Garside R., Leech G., McEnery T., Corpus linguistics,1997, pp 292-293.
Sinclair J., Svartvik J., English Corpus Linguistics 1991, pp 379– 397.
Leech, G. and Wilson, A., Corpus linguistics by the Lune,1994, pp 101-102.
Abdullayeva O. Til korpuslarida lingvistik annotatsiya va uning prinsiplari: -Toshkent, 2022. Vol.1.N 01(2022)
Leech G., Wilson A., Corpus linguistics by the Lune,1994, pp 115-116.
Meyer Ch., Developing of corpus linguistics, 2004, pp 185
Abdullayeva O. Til korpuslarida lingvistik annotatsiya va uning prinsiplari: -Toshkent, 2022. Vol.1.N 01(2022)
Leech G., Wilson A., Corpus linguistics by the Lune, pp.110-113,