Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

[QUESTION] About Z-score #238

Open
moore3930 opened this issue Dec 18, 2024 · 1 comment
Open

[QUESTION] About Z-score #238

moore3930 opened this issue Dec 18, 2024 · 1 comment
Labels
question Further information is requested

Comments

@moore3930
Copy link

Hi, I find that COMET is trained on the z-score of DA. However, I am not sure about the implementation.

Is it rescaled on the translation direction level or something else?

@moore3930 moore3930 added the question Further information is requested label Dec 18, 2024
@vince62s
Copy link

By essence the z-score is rescaled by annotator to make sure there are no big discrepancies. Therefore it should make scores consistent across languages but it is not so meaningfull to compare scores between languages.
However since I used the same method as COMET in my estimator for EuroLLM here: https://medium.com/p/7dccfe167814
and as wmt24 provides the same English source for most language pair, you can see that scores are not so much different across traditional pairs. The question is more about: do we really trust the DA scores year after year ....

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants