Skip to content

Latest commit

 

History

History
84 lines (79 loc) · 17.3 KB

File metadata and controls

84 lines (79 loc) · 17.3 KB

ICASSP-2024-Papers

Application App
Previous Collections Conference

Speech Emotion Recognition and Analysis

Section Papers Preprint Papers Papers with Open Code Papers with Video

Title Repo Paper Video
Improving Multi-Modal Emotion Recognition using Entropy-based Fusion and Pruning-based Network Architecture Optimization IEEE Xplore
Improving Speaker-Independent Speech Emotion Recognition using Dynamic Joint Distribution Adaptation IEEE Xplore
arXiv
Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition GitHub IEEE Xplore
arXiv
Revealing Emotional Clusters in Speaker Embeddings: A Contrastive Learning Strategy for Speech Emotion Recognition IEEE Xplore
arXiv
Generalization of Self-Supervised Learning-based Representations for Cross-Domain Speech Emotion Recognition IEEE Xplore
Improving Speech Emotion Recognition with Unsupervised Speaking Style Transfer GitHub Page IEEE Xplore
arXiv
Foundation Model Assisted Automatic Speech Emotion Recognition: Transcribing, Annotating, and Augmenting IEEE Xplore
arXiv
CLAP4Emo: ChatGPT-Assisted Speech Emotion Retrieval with Natural Language Supervision GitHub IEEE Xplore
EMOCONV-Diff: Diffusion-based Speech Emotion Conversion for Non-Parallel and in-the-Wild Data WEB Page IEEE Xplore
arXiv
Large Language Model-based Emotional Speech Annotation using Context and Acoustic Feature for Speech Emotion Recognition IEEE Xplore
Leveraging Speech PTM, Text LLM, and Emotional TTS for Speech Emotion Recognition IEEE Xplore
arXiv
Customising General Large Language Models for Specialised Emotion Recognition Tasks IEEE Xplore
arXiv
RL-EMO: A Reinforcement Learning Framework for Multimodal Emotion Recognition GitHub IEEE Xplore
Zero Shot Audio to Audio Emotion Transfer with Speaker Disentanglement GitHub IEEE Xplore
arXiv
TRUST-SER: On the Trustworthiness of Fine-Tuning Pre-Trained Speech Embeddings for Speech Emotion Recognition GitHub IEEE Xplore
arXiv
STYLECAP: Automatic Speaking-Style Captioning from Speech based on Speech and Language Self-Supervised Learning Models GitHub Page IEEE Xplore
arXiv
Frame-Level Emotional State Alignment Method for Speech Emotion Recognition GitHub IEEE Xplore
arXiv
Gradient-based Dimensionality Reduction for Speech Emotion Recognition using Deep Networks GitHub IEEE Xplore
Disentanglement Network: Disentangle the Emotional Features from Acoustic Features for Speech Emotion Recognition IEEE Xplore
Balancing Speaker-Rater Fairness for Gender-Neutral Speech Emotion Recognition IEEE Xplore
Prompting Audios using Acoustic Properties for Emotion Representation IEEE Xplore
arXiv
Learning Arousal-Valence Representation from Categorical Emotion Labels of Speech GitHub IEEE Xplore
arXiv
A Robust Pitch-Fusion Model for Speech Emotion Recognition in Tonal Languages GitHub IEEE Xplore
Modeling Intrapersonal and Interpersonal Influences for Automatic Estimation of Therapist Empathy in Counseling Conversation IEEE Xplore
arXiv
Towards Improving Speech Emotion Recognition using Synthetic Data Augmentation from Emotion Conversion IEEE Xplore
Emohrnet: High-Resolution Neural Network based Speech Emotion Recognition IEEE Xplore
Fine-Grained Disentangled Representation Learning for Multimodal Emotion Recognition IEEE Xplore
arXiv
Investigating Salient Representations and Label Variance in Dimensional Speech Emotion Analysis IEEE Xplore
Adaptive Speech Emotion Representation Learning based on Dynamic Graph IEEE Xplore
Enhancing Two-Stage Finetuning for Speech Emotion Recognition using Adapters IEEE Xplore
Speech Swin-Transformer: Exploring a Hierarchical Transformer with Shifted Windows for Speech Emotion Recognition IEEE Xplore
arXiv
Emotion-Aware Contrastive Adaptation Network for Source-Free Cross-Corpus Speech Emotion Recognition IEEE Xplore
arXiv
Dynamic Speech Emotion Recognition using a Conditional Neural Process IEEE Xplore
MS-SENet: Enhancing Speech Emotion Recognition through Multi-Scale Feature Fusion with Squeeze-and-Excitation Blocks GitHub IEEE Xplore
arXiv
GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Accurate Speech Emotion Recognition IEEE Xplore
arXiv
Multi-Source Unsupervised Transfer Components Learning for Cross-Domain Speech Emotion Recognition IEEE Xplore
Self-Supervised Domain Exploration with an Optimal Transport Regularization for Open Set Cross-Domain Speech Emotion Recognition IEEE Xplore
Multi-Modal Emotion Recognition using Multiple Acoustic Features and Dual Cross-Modal Transformer IEEE Xplore
Speech Relationship Learning for Cross-Corpus Speech Emotion Recognition IEEE Xplore
Parameter Efficient Finetuning for Speech Emotion Recognition and Domain Adaptation IEEE Xplore
arXiv
MF-AED-AEC: Speech Emotion Recognition by Leveraging Multimodal Fusion, ASR Error Detection, and ASR Error Correction IEEE Xplore
arXiv
Improving Domain Generalization in Speech Emotion Recognition with Whisper IEEE Xplore
Comparing Data-Driven and Handcrafted Features for Dimensional Emotion Recognition GitHub IEEE Xplore
Speech Emotion Recognition with Distilled Prosodic and Linguistic Affect Representations IEEE Xplore
arXiv
MCM-CSD: Multi-Granularity Context Modeling with Contrastive Speaker Detection for Emotion Recognition in Real-Time Conversation GitHub IEEE Xplore