TABIIY TILNI QAYTA ISHLASH (NLP)DA SPACY MODULIDAN FOYDALANISH

B B Elov

doi:10.36522/2181-9637-2022-4-5

TABIIY TILNI QAYTA ISHLASH (NLP)DA SPACY MODULIDAN FOYDALANISH

B B Elov

https://doi.org/10.36522/2181-9637-2022-4-5

Copy DOI

Journal: JOURNAL OF SCIENCE AND INNOVATIVE DEVELOPMENT

Publication Date: Jul 21, 2022

#Natural Language Processing #Separate Units + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This article discusses the use and tools of the spaCy module, which is written in Python machine language, in the Natural Language Processing (NLP), considered as one of the main areas of computer linguistics. A text in a natural language contains separate units (symbols) and can be divided into several interrelated parts belonging to different levels. The article, therefore, presents methods for tokenizing text using the spaCy library tools as well as the lemma, POS, tag, dep, shape, alpha, and stop attributes generated in a pipeline process.

Full Text