Published on Tue Dec 05 2017

One for All: Towards Language Independent Named Entity Linking

Avirup Sil, Radu Florian

LIEL is a Language Independent Entity Linking system. It works remarkably well on a number of different languages without change. LIEL makes a joint global prediction over the entire document.

0
0
0
Abstract

Entity linking (EL) is the task of disambiguating mentions in text by associating them with entries in a predefined database of mentions (persons, organizations, etc). Most previous EL research has focused mainly on one language, English, with less attention being paid to other languages, such as Spanish or Chinese. In this paper, we introduce LIEL, a Language Independent Entity Linking system, which provides an EL framework which, once trained on one language, works remarkably well on a number of different languages without change. LIEL makes a joint global prediction over the entire document, employing a discriminative reranking framework with many domain and language-independent feature functions. Experiments on numerous benchmark datasets, show that the proposed system, once trained on one language, English, outperforms several state-of-the-art systems in English (by 4 points) and the trained model also works very well on Spanish (14 points better than a competitor system), demonstrating the viability of the approach.

Mon Jul 17 2017
NLP
MAG: A Multilingual, Knowledge-base Agnostic and Deterministic Entity Linking Approach
The best performing approaches rely on trained mono-lingual models. Porting these approaches to other languages is difficult. We present a novel knowledge-based agnostic and deterministic approach to entity linking. MAG is based on a combination of context-based retrieval and structured knowledge bases.
0
0
0
Thu Nov 05 2020
NLP
Entity Linking in 100 Languages
We propose a new formulation for multilingual entity linking. Language-specific mentions resolve to a language-agnostic Knowledge Base. We provide Mewsli-9, a large new multilingual dataset matched to our new setting.
0
0
0
Wed Apr 20 2016
NLP
Distributed Entity Disambiguation with Per-Mention Learning
Existing techniques based on global ranking models fail to capture the individual peculiarities of the words. We propose a new disambiguation system that learns specializedfeatures and models. We train and validate the hundreds of thousands of learning models using a Wikipedia hyperlink dataset.
0
0
0
Mon Aug 30 2021our pick
NLP
Towards Consistent Document-level Entity Linking: Joint Models for Entity Linking and Coreference Resolution
We propose to join the EL task with that of coreference resolution. We cluster mentions that are linked via coreference, and enforce a single EL for all of the clustered mentions together. We formulate the coref+EL problem as a structured prediction task over directed trees.
0
0
0
Wed Apr 22 2020
Machine Learning
ParsEL 1.0: Unsupervised Entity Linking in Persian Social Media Texts
Social media is one of the largest data repositories in the world. A large portion of this social media data is natural language text. The output of the proposed method is 86.94%-score for the Persian language.
0
0
0
Mon Jan 04 2021
NLP
Reddit Entity Linking Dataset
We introduce and make publicly available an entity linking dataset from Reddit. The dataset contains 17,316 linked entities, each annotated by three human annotators. We analyze the different errors and disagreements made by annotators and suggest three types of corrections to the raw data.
0
0
0