Visual Word Embedding for Text Classification

Contributo in Atti di convegno

Data di Pubblicazione:

2021

Abstract:

The question we answer with this paper is: ‘can we convert a text document into an image to take advantage of image neural models to classify text documents?’ To answer this question we present a novel text classification method that converts a document into an encoded image, using word embedding. The proposed approach computes the Word2Vec word embedding of a text document, quantizes the embedding, and arranges it into a 2D visual representation, as an RGB image. Finally, visual embedding is categorized with state-of-the-art image classification models. We achieved competitive performance on well-known benchmark text classification datasets. In addition, we evaluated our proposed approach in a multimodal setting that allows text and image information in the same feature space.

Tipologia CRIS:

Relazione (in Volume)

Keywords:

Encoded text; Multimodal classification; Word embedding

Elenco autori:

Gallo, I.; Nawaz, S.; Landro, N.; La Grassa, R.

Autori di Ateneo:

GALLO IGNAZIO

LANDRO NICOLA

Link alla scheda completa:

https://irinsubria.uninsubria.it/handle/11383/2125887

Titolo del libro:

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Pubblicato in:

LECTURE NOTES IN ARTIFICIAL INTELLIGENCE

Journal

LECTURE NOTES IN ARTIFICIAL INTELLIGENCE

Series