Skip to content

SLR 500 Dataset #73

New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Open
5 of 13 tasks
cleong110 opened this issue Jun 10, 2024 · 1 comment · May be fixed by #111
Open
5 of 13 tasks

SLR 500 Dataset #73

cleong110 opened this issue Jun 10, 2024 · 1 comment · May be fixed by #111

Comments

@cleong110
Copy link
Contributor

cleong110 commented Jun 10, 2024

https://ustc-slr.github.io/datasets/2015_csl/
https://ieeexplore.ieee.org/document/8466903
Attention-Based 3D-CNNs for Large-Vocabulary Sign Language Recognition introduces it

Checklist for datasets:

  • Fork the repo
  • sync forks
  • git checkout master
  • git pull
  • New branch: dataset/something
  • Create a JSON along the lines of the schema below. e.g. FOO.json
  • Add the JSON to src/datasets. e.g. src/datasets/FOO.json
    • "language" field should not need "sign language". No need to say "American Sign Language", "American" will do.
    • Very concise "samples" field, the table does not have a lot of space to display it.
  • Add BibTex to src/references.bib.
    • prepend the citation key with dataset. e.g. dataset:sehyr2021asl
  • Commit/push the changes
  • Make a pull request!

Schema:

{
  "pub": {
    "name": string, # this gets used as the name of the dataset, e.g. "WLASL"
    "year": integer or null,
    "publication":string or null, # this matches a key in references.bib, e.g. "dataset:joshiISLTranslateDatasetTranslating2023"
    "url": string or null # URL to access it. e.g. "https://www.sign-lang.uni-hamburg.de/dgs-korpus/index.php/welcome.html"
  },
  "#loader": string or null, # the key you would use in the sign language datasets library. e.g. "dgs_corpus". Website will auto-link
  "#items": integer or null, # this is the number of unique signs in the column
  "#samples": string or null, # e.g. "1100 videos" or "8,257 Sentences"
  "#signers": integer or string or null, # number of unique signers
  "features": array of strings, ["feature1","feature2"], # I've seen things like "mouthing", "video:RGB", "pose:Kinect", "pose:OpenPose","text:Polish", "gloss:ASL", "writing:HamNoSys", etc.
  "language": string, # the Sign language or languages, e.g. "American" for American Sign Language (ASL)
  "license": string or null,
  "licenseUrl": string or null
}
@cleong110 cleong110 mentioned this issue Jun 10, 2024
10 tasks
@cleong110
Copy link
Contributor Author

cleong110 commented Jun 27, 2024

branch: cleong110:dataset/SLR500

@cleong110 cleong110 linked a pull request Jun 27, 2024 that will close this issue
# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant