[BUG]: the lengths of the features after FACodecEncoderV2 is not match #188

Mahaotian1 · 2024-04-19T02:52:20Z

bug of FACodecEncoderV2

I have extracted prosody_feature and encoder_output from FACodecEncoderV2. It raise wrong when I use fa_decoder_v2 to extract vq codecs becaucse the lengths of prosody_feature(torch.Size([1, 20, 281])) and encoder_output(torch.Size([1, 256, 282])) is not same.

my code

wav_b = librosa.load(wav_b, sr=16000)[0]
wav_b = torch.from_numpy(wav_b).float()
wav_b = wav_b.unsqueeze(0).unsqueeze(0)
enc_out_b = fa_encoder_v2(wav_b)
prosody_b = fa_encoder_v2.get_prosody_feature(wav_b)
vq_post_emb_b, vq_id_b, _, quantized, spk_embs_b = fa_decoder_v2(
enc_out_b, prosody_b, eval_vq=False, vq=True
)

bug

File "/home/data/mahaotian/Amphion/models/codec/ns3_codec/inference_codc.py", line 129, in
vq_post_emb_a, vq_id_a, _, quantized, spk_embs_a = fa_decoder_v2(
File "/home/data/mahaotian/anaconda3/envs/vallex/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1518, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/home/data/mahaotian/anaconda3/envs/vallex/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1527, in _call_impl
return forward_call(*args, **kwargs)
File "/home/data/mahaotian/Amphion/models/codec/ns3_codec/facodec.py", line 1086, in forward
outs, qs, commit_loss, quantized_buf = self.quantize(
File "/home/data/mahaotian/Amphion/models/codec/ns3_codec/facodec.py", line 1048, in quantize
outs += out
RuntimeError: The size of tensor a (281) must match the size of tensor b (282) at non-singleton dimension 2

HeCheng0625 · 2024-04-26T07:28:02Z

Hi, you need padding your wav length to multiples of 200 (hopsize)

Mahaotian1 added the bug Something isn't working label Apr 19, 2024

RMSnow assigned HeCheng0625 Apr 19, 2024

norabai mentioned this issue Apr 20, 2024

Added data padding to 'forward', 'inference', and 'get_prosody_featur… #189

Open

9 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG]: the lengths of the features after FACodecEncoderV2 is not match #188

[BUG]: the lengths of the features after FACodecEncoderV2 is not match #188

Mahaotian1 commented Apr 19, 2024

HeCheng0625 commented Apr 26, 2024

[BUG]: the lengths of the features after FACodecEncoderV2 is not match #188

[BUG]: the lengths of the features after FACodecEncoderV2 is not match #188

Comments

Mahaotian1 commented Apr 19, 2024

bug of FACodecEncoderV2

my code

bug

HeCheng0625 commented Apr 26, 2024