-
Notifications
You must be signed in to change notification settings - Fork 43
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
Audioset DataPanel #229
Audioset DataPanel #229
Conversation
Codecov Report
@@ Coverage Diff @@
## dev #229 +/- ##
=======================================
Coverage 85.69% 85.69%
=======================================
Files 50 50
Lines 3174 3174
Branches 617 617
=======================================
Hits 2720 2720
Misses 335 335
Partials 119 119
Flags with carried forward coverage won't be shown. Click here to find out more.
Continue to review full report at Codecov.
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM!
* delete nn * Add support for loading train and test set in cifar10" (#193) * Fix issue where tensor columns can't be indexed with pandas series (#195) * Update cifar10 to support test set too (#196) * Fix bacckwards compat issue with base_dir and gcs_image_column (#197) * Support backwards compatibility with nn (#198) * Bump version (#199) * Update contributing to support new dev main structure (#203) * Add args, kwargs to ColumnIOMixin._read_data (#204) Co-authored-by: Jesse Vig <45317205+jessevig@users.noreply.github.com> * Fix from_huggingface and add tests (#205) closes #201 * allow_pickle=true when loading numpy block (#206) * Add downloader to ImageColumn (#207) * Remove default addition of index (#208) * Remove default addition of index * Fix provenance tests * Add DEW contrib to registry (#209) * Catch ConnectionResetError (#210) * Add inaturalist to contrib (#211) * Add inaturalist to contrib * Add annotations to intarualist * Fix issue where arraycolumns can't be saved with jsonlines (#214) * Update the docs and add user guide. (#215) * Add contrib for enron (#217) * Fix PIL attribute error on list column representation (#218) * mmap path bug fix (#219) * Downgrade pytorch dependency bound (#220) * Fix issue with subclassing datapanel _state_keys (#224) * Use multiple slices instead of pa.Table.take in ArrowBlock (#226) * Fix issue where boolean list can't index (#227) * Add support for AudioColumn (#222) * Add waterbirds (#228) * Add use guide to indexing and stubs for remaining sections (#225) * Docs/build fix (#230) * Bump version (#231) * Audioset DataPanel (#229) * Add the audioset dataset * Add AudioColumn to audioset datapanel * Fix issue where old datapanels didn't have formatter state (#233) * Make audioset datapanels relational (#235) * Add coco, mir, and pascal (#239) * Make write only write columns in datapanel (#240) * Enforce contiguous index in pandas columns (#244) * Fix issue where ray pickle fails on lazy loader (#245) * Add support for groupby operation * Reorganize the implementation of datasets (#246) * Add support for persistent configuration (#247) * Implement sort for data panel and columns (#237) * Add emb module (#249) * clusterby stuff * Add clusterby * clusterby stuff * Add clusterby * Add embed op (#248) * Autoformat Co-authored-by: Sam Randall <1billionmore@gmail.com> * Reorganize ops code (#250) * Update CI to include 3.9 and 3.10 and to drop 3.7 * Add sample (#251) * Update ci.yml * Add several HAPI datasets (#252) * Update styling of docs (#253) * Bump version (#254) * Remove fastbpe Co-authored-by: Karan Goel <kgoel93@gmail.com> Co-authored-by: Karan Goel <kgoel@cs.stanford.edu> Co-authored-by: Jesse Vig <45317205+jessevig@users.noreply.github.com> Co-authored-by: Khaled Saab <36782882+khaledsaab@users.noreply.github.com> Co-authored-by: Priya2698 <52657555+Priya2698@users.noreply.github.com> Co-authored-by: sam-randall <38796503+sam-randall@users.noreply.github.com> Co-authored-by: Hannah Kim <61199762+hannahkim24@users.noreply.github.com> Co-authored-by: Sam Randall <1billionmore@gmail.com>
Utility functions for loading AudioSet dataset into Meerkat DataPanel.