Published on Mon Feb 22 2016

Correlation Hashing Network for Efficient Cross-Modal Retrieval

Yue Cao, Mingsheng Long, Jianmin Wang, Philip S. Yu

Hashing is widely applied to approximate nearest neighbor search for large-scale multimodal retrieval with storage and computation efficiency. Existing cross-modal hashing methods transform data into low-dimensional feature vectors, and then generate binary codes by another separate quantization step.

0
0
0
Abstract

Hashing is widely applied to approximate nearest neighbor search for large-scale multimodal retrieval with storage and computation efficiency. Cross-modal hashing improves the quality of hash coding by exploiting semantic correlations across different modalities. Existing cross-modal hashing methods first transform data into low-dimensional feature vectors, and then generate binary codes by another separate quantization step. However, suboptimal hash codes may be generated since the quantization error is not explicitly minimized and the feature representation is not jointly optimized with the binary codes. This paper presents a Correlation Hashing Network (CHN) approach to cross-modal hashing, which jointly learns good data representation tailored to hash coding and formally controls the quantization error. The proposed CHN is a hybrid deep architecture that constitutes a convolutional neural network for learning good image representations, a multilayer perception for learning good text representations, two hashing layers for generating compact binary codes, and a structured max-margin loss that integrates all things together to enable learning similarity-preserving and high-quality hash codes. Extensive empirical study shows that CHN yields state of the art cross-modal retrieval performance on standard benchmarks.

Fri Jul 06 2012
Neural Networks
Multimodal similarity-preserving hashing
We introduce an efficient computational framework for hashing data belonging to multiple modalities into a single representation space. We show experimentally that our method significantly outperforms state-of-the-art hashing approaches.
0
0
0
Wed Jan 09 2019
Computer Vision
Deep Semantic Multimodal Hashing Network for Scalable Multimedia Retrieval
Hashing has been widely applied to multimodal retrieval on large-scale multimedia data due to its efficiency in computation and storage. Most of existing deep hashing methods learn binary hash codes by preserving the similarity of data points without exploiting the semantic labels. In this work, we propose a novel Deep Semantic Multimodal Hashing Network.
0
0
0
Sat Aug 01 2020
Computer Vision
Unsupervised Deep Cross-modality Spectral Hashing
This paper presents a novel framework, namely Deep Cross-modality Spectral Hashing (DCSH), to tackle the unsupervised learning problem of binary hash codes. The framework is a two-step hashing approach which decouples the optimization into (1) binary optimization and (2
0
0
0
Thu Aug 17 2017
Computer Vision
Deep Binary Reconstruction for Cross-modal Hashing
Cross-modal retrieval based on hashing technique has drawn much attention nowadays. It takes the binary codes of one modality as the query to retrieve the relevant hashing codes of another modality. We provide a Deep Binary Reconstruction (DBRC) network that can directly learn the hashing codes.
0
0
0
Wed Apr 01 2020
Machine Learning
Task-adaptive Asymmetric Deep Cross-modal Hashing
Supervised cross-modal hashing aims to embed the semantic correlations of heterogeneous modality data into the binary hash codes with discriminative semantic labels. The superiority of TA-ADCMH is proved on two standard datasets.
0
0
0
Sat May 15 2021
Computer Vision
FDDH: Fast Discriminative Discrete Hashing for Large-Scale Cross-Modal Retrieval
Fast Discriminative Discrete Hashing (FDDH) is an efficient cross-modal retrieval method. FDDH introduces an orthogonal basis to regress the targeted hash codes of training examples to their corresponding semantic labels.
2
0
0