[QUESTION] About Z-score #238

moore3930 · 2024-12-18T23:05:12Z

Hi, I find that COMET is trained on the z-score of DA. However, I am not sure about the implementation.

Is it rescaled on the translation direction level or something else?

vince62s · 2024-12-19T07:17:47Z

By essence the z-score is rescaled by annotator to make sure there are no big discrepancies. Therefore it should make scores consistent across languages but it is not so meaningfull to compare scores between languages.
However since I used the same method as COMET in my estimator for EuroLLM here: https://medium.com/p/7dccfe167814
and as wmt24 provides the same English source for most language pair, you can see that scores are not so much different across traditional pairs. The question is more about: do we really trust the DA scores year after year ....

moore3930 added the question Further information is requested label Dec 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QUESTION] About Z-score #238

[QUESTION] About Z-score #238

moore3930 commented Dec 18, 2024

vince62s commented Dec 19, 2024

[QUESTION] About Z-score #238

[QUESTION] About Z-score #238

Comments

moore3930 commented Dec 18, 2024

vince62s commented Dec 19, 2024