Skip to content
View BingYang-20's full-sized avatar

Organizations

@Audio-WestlakeU

Block or report BingYang-20

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Multi-Channel Conformer” [TASLP 2024]

Python 34 1 Updated Oct 11, 2024

A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurIPS 2024]

Python 112 12 Updated Dec 11, 2024

Impulse response generation based on state-of-the-art geometric sound propagation engine.

C++ 154 20 Updated Jan 17, 2023

End-to-End Neural Diarization

Python 395 59 Updated Aug 30, 2021

Some comprehensive papers about speaker diarization

250 5 Updated Nov 12, 2024

Training data simulation

Python 47 7 Updated May 6, 2024

The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based attractors". [ICASSP 2024] and "LS-EEND: long-form streaming…

Python 115 5 Updated Feb 18, 2025

The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation

Python 255 31 Updated Jan 1, 2025

Awesome Papers related to Mamba.

1,310 68 Updated Oct 17, 2024

The missing star history graph of GitHub repos - https://star-history.com

TypeScript 7,012 269 Updated Feb 19, 2025

Deep-learning-based implementation of the popular Hungarian algorithm that helps solve the assignment problem.

Python 25 2 Updated Aug 31, 2023

A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]

Python 42 14 Updated Sep 28, 2024

The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]

Python 102 11 Updated Dec 9, 2024

Translating Synthetic RIRs to Real RIRs

Python 41 9 Updated Sep 15, 2023

A library that contains a rich collection of performant PyTorch model metrics, a simple interface to create new metrics, a toolkit to facilitate metric computation in distributed training and tools…

Python 226 54 Updated Jan 17, 2025

This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.

Python 1,157 420 Updated Jul 25, 2024

A list of publicly available room impulse response datasets and scripts to download them.

Shell 440 39 Updated Oct 7, 2024

Gammatone-based spectrograms, using gammatone filterbanks or Fourier transform weightings.

MATLAB 221 67 Updated Jun 29, 2023

Da - ECHO - RetrievAl - daTasEt

Jupyter Notebook 25 4 Updated Jul 7, 2024

Measuring room impulse responses with python and sounddevice

Python 73 18 Updated Jun 30, 2019

👫 Joint Discriminative and Generative Learning for Person Re-identification. CVPR'19 (Oral) 👫

Python 1,286 226 Updated Jul 9, 2023

End-to-End Object Detection with Transformers

Python 13,997 2,514 Updated Mar 12, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 12,382 1,575 Updated Feb 29, 2024

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Python 8,886 1,093 Updated Oct 9, 2024

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1,220 226 Updated May 21, 2023
Python 188 28 Updated Dec 4, 2023

BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation

Python 209 36 Updated Apr 26, 2023

Transformer seq2seq model, program that can build a language translator from parallel corpus

Python 1,373 349 Updated May 19, 2023

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,519 441 Updated Jan 3, 2025
Next
Showing results