Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

feat(te): add te2 for upper bound on avg batch width #2

Merged
merged 1 commit into from
Aug 16, 2024
Merged

feat(te): add te2 for upper bound on avg batch width #2

merged 1 commit into from
Aug 16, 2024

Conversation

aryamaanthakur
Copy link
Contributor

TE2 provides a realistic upper bound for calculating average batch width

Given a dataset and batch size b
TE2 will be calculated by first sorting the dataset in descending order of sequence lengths and creating batches by selecting top 1 and bottom (b-1) sequences without replacement.

Average Batch Width = mean (top [len(dataset)/b] sequences)

Signed-off-by: aryamaanthakur <aryamaan.thakur@outlook.com>
@ChanderG ChanderG mentioned this pull request Aug 8, 2024
3 tasks
@aryamaanthakur aryamaanthakur changed the base branch from main to feat-token-te2 August 13, 2024 06:25
Copy link
Collaborator

@ChanderG ChanderG left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@ChanderG ChanderG merged commit 83ed89e into foundation-model-stack:feat-token-te2 Aug 16, 2024
1 check passed
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants