TF: XLA repetition penalty #16879

gante · 2022-04-21T15:56:01Z

What does this PR do?

This PR adds our first XLA-compatible TF logit processor, as well as corresponding tests. Since this is the first of a series of small (but similar) PRs, I'd like to request a more thorough review, so the remaining ones are quick.

More specifically, this PR makes three changes:

Rewrites the TF repetition penalty processor so as to be XLA-compatible;
Adds XLA tests for the processor;
Since the test mentioned in 2. was a near copy/paste of the non-XLA test, I've decided to split the test into three parts to improve code reuse and reduce errors from ad hoc edits (as the first and last part can be reused in the two versions of the test, XLA and non-XLA)
- get inputs
- run the processor
- check the output

HuggingFaceDocBuilderDev · 2022-04-21T16:13:27Z

The documentation is not available anymore as the PR was closed or merged.

tests/generation/test_generation_tf_logits_process.py

Rocketknight1

This looks like a good port of the original numpy code!

(But I still think logit penalties should be additive rather than multiplicative)

Rocketknight1 · 2022-04-22T17:19:02Z

Thinking about it more, a multiplicative logit penalty really doesn't work, right? Even if we use the reciprocal when the logit is negative, the scale of the penalty depends on the logit's distance from 0. For example, a logit in the range -0.1 to +0.1 will barely be moved by the penalty term, but such logits usually have quite a high probability of being chosen, because most logits are large and negative.

gante · 2022-04-22T17:29:20Z

(merging as the main goal was to port to XLA but, by all means, continue the discussion :) )

gante added 2 commits April 21, 2022 15:49

XLA repetition penalty

60a71e8

make fixup

bfc2a39

gante requested review from patrickvonplaten and Rocketknight1 April 21, 2022 15:59

gante mentioned this pull request Apr 21, 2022

(TF) model.generate to tf.function for tf serving #16823

Closed

patrickvonplaten reviewed Apr 21, 2022

View reviewed changes

tests/generation/test_generation_tf_logits_process.py Outdated Show resolved Hide resolved

no need to skip XLA here, no softmax

fb29bf8

patrickvonplaten approved these changes Apr 22, 2022

View reviewed changes

Rocketknight1 approved these changes Apr 22, 2022

View reviewed changes

gante merged commit 99c8226 into huggingface:main Apr 22, 2022

gante deleted the xla_repetition_penalty branch April 22, 2022 17:29

elusenji pushed a commit to elusenji/transformers that referenced this pull request Jun 12, 2022

TF: XLA repetition penalty (huggingface#16879)

670d5b5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TF: XLA repetition penalty #16879

TF: XLA repetition penalty #16879

gante commented Apr 21, 2022

HuggingFaceDocBuilderDev commented Apr 21, 2022 •

edited

Loading

Rocketknight1 left a comment

Rocketknight1 commented Apr 22, 2022

gante commented Apr 22, 2022

TF: XLA repetition penalty #16879

TF: XLA repetition penalty #16879

Conversation

gante commented Apr 21, 2022

What does this PR do?

HuggingFaceDocBuilderDev commented Apr 21, 2022 • edited Loading

Rocketknight1 left a comment

Choose a reason for hiding this comment

Rocketknight1 commented Apr 22, 2022

gante commented Apr 22, 2022

HuggingFaceDocBuilderDev commented Apr 21, 2022 •

edited

Loading