DEAR Dataset Code

This is a torch.utils.data.Dataset class for the Deep Evaluation of Acoustic Representations (DEAR) dataset. The corresponding paper is on the arXiv and the data is on Zenodo.

Usage

Copy the dear directory to the source repository, then spawn a Dataset for an evaluation task using the desired class.

Environment

environment_eval_dataset = EnvironmentDEARDataset(
    base_path=Path("/data/evaluation/dear"),
    split=DatasetType.TRAIN,
    target_variable_type=TargetVariableType.DISCRETE,
)

Indoor or Outdoor

indoor_or_outdoor_eval_dataset = IndoorOutdoorDEARDataset(
    base_path=Path("/data/evaluation/dear"),
    split=DatasetType.TRAIN,
    target_variable_type=TargetVariableType.DISCRETE,
)

Stationary or Transient Noise

noise_eval_dataset = StationaryTransientNoiseDEARDataset(
    base_path=Path("/data/evaluation/dear"),
    split=DatasetType.TRAIN,
    target_variable_type=TargetVariableType.DISCRETE,
)

Signal to Noise Ration (SNR)

snr_eval_dataset = SNRDEARDataset(
    base_path=Path("/data/evaluation/dear"),
    split=DatasetType.TRAIN,
    target_variable_type=TargetVariableType.CONTINUOUS,
)

Speech Present

speech_present_eval_dataset = SpeechDEARDataset(
    base_path=Path("/data/evaluation/dear"),
    split=DatasetType.TRAIN,
    speech_present=True,
    target_variable_type=TargetVariableType.DISCRETE,
)

Speakers Active

speakers_active_eval_dataset = SpeechDEARDataset(
    base_path=Path("/data/evaluation/dear"),
    split=DatasetType.TRAIN,
    speech_present=False,
    target_variable_type=TargetVariableType.CONTINUOUS,
)

Direct-to-Reverberant Ratio (DRR)

drr_eval_dataset = DRRDEARDataset(
    base_path=Path("/data/evaluation/dear"),
    split=DatasetType.TRAIN,
    target_variable_type=TargetVariableType.CONTINUOUS,
)

RT60

rt60_eval_dataset = RT60DEARDataset(
    base_path=Path("/data/evaluation/dear"),
    split=DatasetType.TRAIN,
    target_variable_type=TargetVariableType.CONTINUOUS,
)

Example

Use the standard PyTorch pattern to run the evaluation, e.g.

model = Wav2Vec2Model()
for segments, labels in rt60_eval_dataset:
    predicted_labels = model(segments)
    score = metric(labels, predicted_labels)

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
dear		dear
tests		tests
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.yamllint		.yamllint
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
requirements_dev.txt		requirements_dev.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DEAR Dataset Code

Usage

Environment

Indoor or Outdoor

Stationary or Transient Noise

Signal to Noise Ration (SNR)

Speech Present

Speakers Active

Direct-to-Reverberant Ratio (DRR)

RT60

Example

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

DEAR-dataset/code

Folders and files

Latest commit

History

Repository files navigation

DEAR Dataset Code

Usage

Environment

Indoor or Outdoor

Stationary or Transient Noise

Signal to Noise Ration (SNR)

Speech Present

Speakers Active

Direct-to-Reverberant Ratio (DRR)

RT60

Example

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages