Published on Tue Mar 30 2021

Collaborative construction of lexicographic and parallel datasets for African languages: first assessment

Elvis Mboning Tchiaze
0
0
0
Abstract

Faced with a considerable lack of resources in African languages to carry out work in Natural Language Processing (NLP), Natural Language Understanding (NLU) and artificial intelligence, the research teams of NTeALan association has set itself the objective of building open-source platforms for the collaborative construction of lexicographic data in African languages. In this article, we present our first reports after 2 years of collaborative construction of lexicographic resources useful for African NLP tools.

Fri Oct 23 2020
NLP
Evaluating Language Tools for Fifteen EU-official Under-resourced Languages
This article presents the results of the evaluation campaign of language tools available for fifteen EU-official under-resourced languages. The evaluation was conducted within the MSC ITN CLEOPATRA action that aims at building the cross-lingual event-centric knowledge processing.
0
0
0
Thu Jul 08 2021
NLP
A Review of Bangla Natural Language Processing Tasks and the Utility of Transformer Models
Bangla is ranked as the 6th most widely spoken language across the world. With 230 million native speakers, it is still considered as a low-resource language. With three decades of research, Bangla NLP is still lagging behind mainly due to the scarcity of resources.
2
2
3
Thu Jul 01 2021
NLP
Multilingual Central Repository: a Cross-lingual Framework for Developing Wordnets
The Multilingual Central Repository (MCR) is a multilingual knowledge base that includes wordnets of Basque, Catalan, English, Galician, Portuguese, Spanish.
0
0
0
Fri Oct 23 2020
NLP
Natural Language Processing Chains Inside a Cross-lingual Event-Centric Knowledge Pipeline for European Union Under-resourced Languages
This article presents the strategy for developing a platform containing Language Processing Chains for European Union languages. These chains are part of the first step of an event-centric knowledge processing pipeline. The aim is to process multilingual media information about major events.
0
0
0
Sun Aug 30 2009
NLP
An OLAC Extension for Dravidian Languages
This paper intends to review the bottom-up distributed character of the project. It proposes an extension of the architecture for Dravidian languages. An ontological structure is considered for effective natural language processing.
0
0
0
Wed Jun 05 2019
NLP
Survey on Publicly Available Sinhala Natural Language Processing Tools and Research
Sinhala is the native language of the Sinhalese people who make up the largest ethnic group of Sri Lanka. Due to poverty in both linguistic and economic capital, Sinhala remains a resource-poor language.
0
0
0