Latest cl news

MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis. (arXiv:2005.03545v1 [cs.CL])

By arxiv.org
Published On ::

Multimodal Sentiment Analysis is an active area of research that leverages multimodal signals for affective understanding of user-generated videos. The predominant approach, addressing this task, has been to develop sophisticated fusion techniques. However, the heterogeneous nature of the signals creates distributional modality gaps that pose significant challenges. In this paper, we aim to learn effective modality representations to aid the process of fusion. We propose a novel framework, MISA, which projects each modality to two distinct subspaces. The first subspace is modality invariant, where the representations across modalities learn their commonalities and reduce the modality gap. The second subspace is modality-specific, which is private to each modality and captures their characteristic features. These representations provide a holistic view of the multimodal data, which is used for fusion that leads to task predictions. Our experiments on popular sentiment analysis benchmarks, MOSI and MOSEI, demonstrate significant gains over state-of-the-art models. We also consider the task of Multimodal Humor Detection and experiment on the recently proposed UR_FUNNY dataset. Here too, our model fares better than strong baselines, establishing MISA as a useful multimodal framework.

MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis. (arXiv:2005.03545v1 [cs.CL])

The Danish Gigaword Project. (arXiv:2005.03521v1 [cs.CL])

Practical Perspectives on Quality Estimation for Machine Translation. (arXiv:2005.03519v1 [cs.CL])

Computing with bricks and mortar: Classification of waveforms with a doped concrete blocks. (arXiv:2005.03498v1 [cs.ET])

Fine-Grained Analysis of Cross-Linguistic Syntactic Divergences. (arXiv:2005.03436v1 [cs.CL])

The Perceptimatic English Benchmark for Speech Perception Models. (arXiv:2005.03418v1 [cs.CL])

Scheduling with a processing time oracle. (arXiv:2005.03394v1 [cs.DS])

Does Multi-Encoder Help? A Case Study on Context-Aware Neural Machine Translation. (arXiv:2005.03393v1 [cs.CL])

2kenize: Tying Subword Sequences for Chinese Script Conversion. (arXiv:2005.03375v1 [cs.CL])

Playing Minecraft with Behavioural Cloning. (arXiv:2005.03374v1 [cs.AI])

JASS: Japanese-specific Sequence to Sequence Pre-training for Neural Machine Translation. (arXiv:2005.03361v1 [cs.CL])

DramaQA: Character-Centered Video Story Understanding with Hierarchical QA. (arXiv:2005.03356v1 [cs.CL])

Wavelet Integrated CNNs for Noise-Robust Image Classification. (arXiv:2005.03337v1 [cs.CV])

Boosting Cloud Data Analytics using Multi-Objective Optimization. (arXiv:2005.03314v1 [cs.DB])

Nakdan: Professional Hebrew Diacritizer. (arXiv:2005.03312v1 [cs.CL])

Adaptive Feature Selection Guided Deep Forest for COVID-19 Classification with Chest CT. (arXiv:2005.03264v1 [eess.IV])

Quda: Natural Language Queries for Visual Data Analytics. (arXiv:2005.03257v1 [cs.CL])

Multi-Target Deep Learning for Algal Detection and Classification. (arXiv:2005.03232v1 [cs.CV])

Conley's fundamental theorem for a class of hybrid systems. (arXiv:2005.03217v1 [math.DS])

A Dynamical Perspective on Point Cloud Registration. (arXiv:2005.03190v1 [cs.CV])

Fact-based Dialogue Generation with Convergent and Divergent Decoding. (arXiv:2005.03174v1 [cs.CL])

Unsupervised Multimodal Neural Machine Translation with Pseudo Visual Pivoting. (arXiv:2005.03119v1 [cs.CL])

AIOps for a Cloud Object Storage Service. (arXiv:2005.03094v1 [cs.DC])

Diagnosing the Environment Bias in Vision-and-Language Navigation. (arXiv:2005.03086v1 [cs.CL])

Categorical Vector Space Semantics for Lambek Calculus with a Relevant Modality. (arXiv:2005.03074v1 [cs.CL])

I Always Feel Like Somebody's Sensing Me! A Framework to Detect, Identify, and Localize Clandestine Wireless Sensors. (arXiv:2005.03068v1 [cs.CR])

Weakly-Supervised Neural Response Selection from an Ensemble of Task-Specialised Dialogue Agents. (arXiv:2005.03066v1 [cs.CL])

Extracting Headless MWEs from Dependency Parse Trees: Parsing, Tagging, and Joint Modeling Approaches. (arXiv:2005.03035v1 [cs.CL])

Evaluating text coherence based on the graph of the consistency of phrases to identify symptoms of schizophrenia. (arXiv:2005.03008v1 [cs.CL])

How Biofuels Can Cool Our Climate and Strengthen Our Ecosystems

Closure of Diablo Canyon Nuclear Plant

Docker Image for ASK and AWS CLI

Day Clock ( vue.js )

Clint Eastwood's true-life drama Richard Jewell takes aims at big targets, and misses

Spokane musician Eliza Johnson brought her quirky style — and tinned fish — to American Idol Sunday night. Watch the clip

It's no Pixar classic, but Onward continues the studio's penchant for intelligent, original animated entertainment

The Fox Theater cancels all events, including Spokane Symphony concerts, through April 10

Spokane Symphony launches Musicians' Relief Fund to help local classical stars survive the pandemic

Spokane Comedy Club bringing the laughs from Dan Cummins, Spokane's Kelsey Cook and more right to your computer this weekend

5 ways to entertain yourself online, from concerts and art shows to painting classes and story times

How climate change is contributing to skyrocketing rates of infectious disease

Regain control of your closet with some simple steps

North Idaho's Best Golf Course: Circling Raven

With ridership declining, we hop on the bus with one big question in mind: Where is the STA headed?

Combinatorial synthesis of libraries of macrocyclic compounds useful in drug discovery

Compound and organic light-emitting device including the same

Process for the conversion of aliphatic cyclic amines to aliphatic diamines

Techniques for evaluation, building and/or retraining of a classification model

Classifying unclassified samples

Multiple two-state classifier output fusion system and method

The Finish Line: Cleaning EIFS

Cloaked in Green?

New Gadget Analyzes Everything Including Building Industry

Climate Change

Exoskeleton in the Job Site Closet

Only 12 per cent of leading charities publicly recognise a trade union, analysis suggests

Companies' 'Green' Efforts Include Products’ Material Content

Viridian Introduces Engineered Reclaimed Hardwood Line

Reclaimé Collection by Quick-Step includes new White Washed Oak look

Good Morning, News: City Council to Vote on Clean & Safe Contract, Vision Zero Gets an Audit, and Trump Taps Elon Musk to Lead DOGE (Do You Even Want to Know?)

Portland’s Ranked Choice Voting Was a Success (Despite What the Oregonian Claims)

Who's powering nuclear energy's comeback?

What Trump's win means for electric vehicle manufacturers

Why working-class voters have been shifting toward the Republican Party

Exquisite bird fossil provides clues to the evolution of avian brains

Subscribe To Our Newsletter