adding packed bert from optimum-main #71

arsalanu · 2023-03-09T13:21:27Z

Adding PackedBERT notebooks/ models/ utils folder into Paperspace from HF Optimum:

------ copied from the original PR description for Hugging Face Optimum Graphcore (now merged):

Contents:
Simplified notebooks for all three supported Packed BERT tasks for easy implementation
Adds all of the necessary utils/model heads imported into notebooks - preprocessing, postprocessing, model changes

Notes: For the time being, the models/ and utils/ are in this folder that goes into notebooks/ but ideally it would be nice to have the utils put into optimum/graphcore/ so they could be easily importable with the package - and the models/modeling_bert_packed.py could just be options within the default modeling_bert, and packing could be enabled through the AutoConfig (some tweaks would be needed for that, but nothing extensive) This also gives us a structure to add future packing tasks/notebooks

Fixes
I've removed the model classes and packing algorithm/dataset creation utils from the notebooks, noted that they were too complex and large as notebook code blocks requiring too much explanation and would be hard to maintain here. The intention of these notebooks is to give brief explanations of the differences between unpacked and packed at each stage and allows users to easily implement it using the importable methods.
A more in depth explanation of the packing/preproc/postproc/model change process will get its own notebooks/blog in future so we don't need to cover it for this notebook
I've used the env variables for pod type and executable dir
Rewritten most of these notebooks to not be as detailed/complex and use more active language - some of it is copied from existing notebooks for the same (unpacked) tasks - happy to change stuff

…erge

…eadjusted paths in notebooks

arsalanu · 2023-03-21T12:24:11Z

I think this is ready

arsalanu added 4 commits March 9, 2023 13:15

adding packed bert from optimum-main

1c39b2f

link fixes and images

32603da

updating notebook tests

c2041d1

removing notebook-saved local path to fix test

b08c363

anjleeg-gcai marked this pull request as draft March 10, 2023 11:48

arsalanu added 6 commits March 16, 2023 15:06

Updating with batched inference pipeline update from latest Optimum m…

91c74f9

…erge

changing default checkpoint for ml seq cls for tests

a5405e9

name change for var in comment

a308ad1

commenting push to hub

b1156ae

changing checkpoint directory to env var

0d76256

updated .gradient files with symlinks and prepare-datasets config + r…

a676a13

…eadjusted paths in notebooks

Ian Hales and others added 2 commits March 23, 2023 15:48

Update environment vars to match consistency PR.

f96eb96

Merge branch 'main' into packed-bert

0a35019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adding packed bert from optimum-main #71

adding packed bert from optimum-main #71

arsalanu commented Mar 9, 2023

arsalanu commented Mar 21, 2023

adding packed bert from optimum-main #71

Are you sure you want to change the base?

adding packed bert from optimum-main #71

Conversation

arsalanu commented Mar 9, 2023

arsalanu commented Mar 21, 2023