An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
-
Updated
Oct 10, 2023 - Python
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
This code provides word level language identification tool for identifying language for individual words in Code-Mixed text. e.g. The text that includes words from two languages such as Hindi written in roman script, mixed with English.
Implementation of meta-transfer-learning for ASR and LM (ACL 2020)
Multilingual Meta-Embeddings for Named Entity Recognition (RepL4NLP & EMNLP 2019)
Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TTS model.
💬 MaskLID: Code-Switching Language Identification through Iterative Masking -- ACL 2024
Code-Switching Language Modeling using Syntax-Aware Multi-Task Learning (CALCS 2018, ACL)
Code-switching patterns can be an effective route to improve performance of downstream NLP applications: A case study of humour, sarcasm and hate speech detection
Repository containing Abusive Tweet Detection, Location Detection and Gender Detection codes
POSIT aims to segment and tag mixed-text that contains English and C-like code, such that the user both knows what a token is, and within the language it's used in, what role, such as an AST tag or PoS tag, it serves.
A sequence tagging model with active learning
Implementation of a deep learning model (BiLSTM) to detect code-switching
Code for "CoVoSwitch: Machine Translation of Synthetic Code-Switched Text Based on Intonation Units" (Accepted at ACL-SRW 2024) 🇹🇭
a socket script to obtain chinese phones-sequence for any english word
Official repository for the paper titled "From Machine Translation to Code-Switching: Generating High-Quality Code-Switched Text" accepted at ACL 2021
Chrome extension for translating highlighted English text into Chinglish (a chinese + english hybrid)
A word level Language Identification (LID) tool for Tagalog-English (Taglish) text.
Code-Switching Sentence Generation by Generative Adversarial Networks and its Application to Data Augmentation. (Interspeech 2019)
Code-switching detection for Pacific languages
This project aims to use existing open-source eye-tracking data from Mandarin Chinese-English bilinguals to train a machine learning model to predict L2 English Language Proficiency based on reading behavior.
Add a description, image, and links to the code-switching topic page so that developers can more easily learn about it.
To associate your repository with the code-switching topic, visit your repo's landing page and select "manage topics."