Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

No such file or directory: '/root/.cache/huggingface/datasets/squad_multitask/highlight_qg_format/1.0.0/dataset_info.json' #98

Open
tonyken12345 opened this issue Mar 24, 2022 · 5 comments

Comments

@tonyken12345
Copy link

When i do python prepare_data.py,Why run out of this message?

@binggoml
Copy link

binggoml commented Apr 8, 2022

me too,this code has many errors

@JingxinLee
Copy link

nlp.utils.info_utils.NonMatchingSplitsSizesError: [{'expected': SplitInfo(name='train', num_bytes=226286197, num_examples=253276, dataset_name='squad_multitask'), 'recorded': SplitInfo(name='train', num_bytes=226284739, num_examples=253275, dataset_name='squad_multitask')}]

@XiaohanBi-Hub
Copy link

nlp.utils.info_utils.NonMatchingSplitsSizesError: [{'expected': SplitInfo(name='train', num_bytes=226286197, num_examples=253276, dataset_name='squad_multitask'), 'recorded': SplitInfo(name='train', num_bytes=226284739, num_examples=253275, dataset_name='squad_multitask')}]

I am facing the same error, did u solved it dude?

@ZihaoLin0123
Copy link

If you check the file data/squad_multitask/dataset_infos.json, you will find:
"splits": {"train": {"name": "train", "num_bytes": 226286197, "num_examples": 253276,......

You can change "226286197" to "226284739" and change "253276" to "253275", then you can fix the error.

@William9Baker
Copy link

I am facing the same error, do you solved it now?

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants