Published on Wed Jul 22 2020

IITK at the FinSim Task: Hypernym Detection in Financial Domain via Context-Free and Contextualized Word Embeddings

Vishal Keswani, Sakshi Singh, Ashutosh Modi

The goal of this task is to classify financial terms into the most relevant hypernym (or top-level) concept. We leverage both context-dependent and context-independent word embeddings in our analysis.

0
0
0
Abstract

In this paper, we present our approaches for the FinSim 2020 shared task on "Learning Semantic Representations for the Financial Domain". The goal of this task is to classify financial terms into the most relevant hypernym (or top-level) concept in an external ontology. We leverage both context-dependent and context-independent word embeddings in our analysis. Our systems deploy Word2vec embeddings trained from scratch on the corpus (Financial Prospectus in English) along with pre-trained BERT embeddings. We divide the test dataset into two subsets based on a domain rule. For one subset, we use unsupervised distance measures to classify the term. For the second subset, we use simple supervised classifiers like Naive Bayes, on top of the embeddings, to arrive at a final prediction. Finally, we combine both the results. Our system ranks 1st based on both the metrics, i.e., mean rank and accuracy.

Thu Jul 29 2021
NLP
Term Expansion and FinBERT fine-tuning for Hypernym and Synonym Ranking of Financial Terms
Hypernym and synonym matching is one of the mainstream Natural Language Processing (NLP) tasks. We present systems that attempt to solve this problem. We designed these systems to participate in the FinSim-3, ashared task of FinNLP workshop at IJCAI-
5
0
0
Tue Jul 13 2021
NLP
Exploiting Network Structures to Improve Semantic Representation for the Financial Domain
This paper presents the participation of the MiniTrue team in the FinSim-3 task on learning semantic similarities for the financial domain in English. Our approach combines contextual embeddings learned by transformer-based language models with network structures embeddments extracted from external knowledge sources.
4
0
1
Sat Aug 21 2021
NLP
Yseop at FinSim-3 Shared Task 2021: Specializing Financial Domain Learning with Phrase Representations
The aim of this shared task is to correctly classify a list of given terms from the financial domain into the most relevant hypernym. Our system ranks 2nd overall on both metrics.
3
0
0
Tue Mar 02 2021
Machine Learning
FinMatcher at FinSim-2: Hypernym Detection in the Financial Services Domain using Knowledge Graphs
This paper presents the FinMatcher system and its results for the FinSim 2021shared task. The FinSim-2 shared task consists of a set of concept labels from the financial services domain. The goal is to find the most relevant top-level concept from a given set of concepts.
0
0
0
Wed Apr 19 2017
NLP
Predicting Role Relevance with Minimal Domain Expertise in a Financial Domain
Word embeddings have made enormous inroads in recent years in a wide variety of text mining applications. In this paper, we explore a word embedding-based architectures for predicting the relevance of a role between two financial entities.
0
0
0
Mon Oct 02 2017
NLP
Distributional Inclusion Vector Embedding for Unsupervised Hypernymy Detection
Modeling hypernymy, such as poodle is-a dog, is an important generalization. Existing unsupervised methods either do not scale to large vocabularies or yield unacceptably poor accuracy. This paper introduces distributional inclusion vector embedding.
0
0
0