Abstract

The paper presents an online system for clustering and classification of texts in the Polish language. It allows running complex workflows of language and machine learning tools. A high throughput and low latency was achieved by an asynchronous style of programming and a usage of message oriented middleware—RabbitMQ. Authors discuss the architecture assumptions, the language processing modelling notation for a workflow definition and the system architecture. Moreover, a sample Single Page Application is presented that clusters uploaded corpora and shows results online.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call