Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Need help in infer.py #32

Open
malgabri opened this issue Jun 21, 2020 · 0 comments
Open

Need help in infer.py #32

malgabri opened this issue Jun 21, 2020 · 0 comments

Comments

@malgabri
Copy link

First of all, I would like to thank you for sharing this great work.

I trained the model successfully based on the instructions in README for articulatory detection in TIMIT.

But, I faced some problems in infer.py.

  1. This is the type of error "operands could not be broadcast together with shapes (62,) (42,) " in this function "segs_phones_to_frame_binf". Because there is 62 attributes and 42 phonemes [39 (+UNK, SOS, EOS)]. The length of logits_binf is only 42.

  2. Also, could you provide us the Notebook for compute the accuracy of attribute detection at frame level.

  3. Finally, in the paper, there are 28 place and manner attributes. My question, which mapping file that you used to calculate the detection accuracy of these 28 only.

Thanks a lot.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant