Our multilingual NLP Pipeline is based on a flexible API which enables effective end-to-end processing of text in the following languages:
Multilinguality is a key feature of our pipeline, with most modules available in 13 languages. Moreover, we feature:
Our multilingual Natural Language pipeline includes modules which perform the following tasks, which can be accessed separately and are integrated into the pipeline:
Babelscape’s NLP pipeline comes with several groundbreaking features. It is designed to work on a large scale in dozens of languages using the same interface for each language. Users can choose only the modules they need and can run dozens of tasks in parallel on the same CPU. The pipeline also integrates our flagship products as modules: WordAtlas, Comprehendo and Extraggo, thanks to which a full-fledged analysis of text can be performed, ranging from tokenization to semantic analysis and text analytics.
We compared the time performance of our multilingual pipeline with two strong competitors, namely the Stanford CoreNLP and the NLTK libraries, on gold-standard data. The results reported in the Table show that our pipeline is faster and more accurate than its alternatives.
|Named Entity Recognition||14.81ms||158.59ms||143.66ms|