Published on Wed Oct 07 2020

MuSeM: Detecting Incongruent News Headlines using Mutual Attentive Semantic Matching

Rahul Mishra, Piyush Yadav, Remi Calizzano, Markus Leippold
0
0
0
Abstract

Measuring the congruence between two texts has several useful applications, such as detecting the prevalent deceptive and misleading news headlines on the web. Many works have proposed machine learning based solutions such as text similarity between the headline and body text to detect the incongruence. Text similarity based methods fail to perform well due to different inherent challenges such as relative length mismatch between the news headline and its body content and non-overlapping vocabulary. On the other hand, more recent works that use headline guided attention to learn a headline derived contextual representation of the news body also result in convoluting overall representation due to the news body's lengthiness. This paper proposes a method that uses inter-mutual attention-based semantic matching between the original and synthetically generated headlines, which utilizes the difference between all pairs of word embeddings of words involved. The paper also investigates two more variations of our method, which use concatenation and dot-products of word embeddings of the words of original and synthetic headlines. We observe that the proposed method outperforms prior arts significantly for two publicly available datasets.

Mon Mar 23 2020
NLP
BaitWatcher: A lightweight web interface for the detection of incongruent news headlines
BaitWatcher is a lightweight web interface that guides readers in estimating the likelihood of incongruence in news articles. It uses a hierarchical recurrent encoder that learns complex textual representations of a news headline and its associated body text.
0
0
0
Mon Nov 09 2020
NLP
Adversarial Semantic Collisions
semantic collisions are texts that are semantically unrelated but judged as similar by NLP models. We develop gradient-based approaches for generating semantic collisions. We discuss how to evade perplexity-based filtering and discuss potential mitigations.
0
0
0
Sat Nov 17 2018
NLP
Detecting Incongruity Between News Headline and Body Text via a Deep Hierarchical Encoder
Some news headlines mislead readers with overrated or false information. Identifying them in advance will better assist readers in choosing proper news stories to consume. This research introduces million-scale pairs of news headlines and body text.
0
0
0
Mon Aug 26 2019
NLP
The Limitations of Stylometry for Detecting Machine-Generated Fake News
Recent developments in neural language models (LMs) have raised concerns about their potential misuse for automatically spreading misinformation. Several studies have proposed to detect machine-generated fake news by capturing their stylistic differences from human-written text. We show that stylometry is limited against machine-generated misinformation.
4
12
7
Sat Mar 14 2020
NLP
Text Similarity Using Word Embeddings to Classify Misinformation
Fake news is a growing problem in the last years, especially during elections. Technology can help with that work and optimize the fact-checking process. In this work, we address the challenge of finding similar content.
0
0
0
Tue Apr 13 2021
Computer Vision
NewsCLIPpings: Automatic Generation of Out-of-Context Multimodal Media
0
0
0