Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

Data cleaning, preprocessing, transformation, and tokenization #37

Open
sxaxmz opened this issue May 26, 2024 · 0 comments
Open

Data cleaning, preprocessing, transformation, and tokenization #37

sxaxmz opened this issue May 26, 2024 · 0 comments

Comments

@sxaxmz
Copy link

sxaxmz commented May 26, 2024

Hi, great work. I have a few questions as it seems that it was unclear how the data was preprocessed, cleaned, transformed, and tokenized for the purpose of fine-tuning, could you please elaborate or provide more information. Thanks.

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant