Skip to content
New issue

Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? # to your account

引入新角色后如何进行数据增广,从而达到微调的需求 #1

Open
yangpyoung opened this issue Sep 27, 2023 · 1 comment

Comments

@yangpyoung
Copy link

您好,目前加入新小说后,我想微调本地模型(chatGLM2\baichuan2等)。想了解下,如何只是针对新小说角色对话的话,微调是否只是需要新小说角色的对话数据和通用数据按比例混合即可,不太需要54K已有的对话数据。还有就是想问下,根据已有小说角色对话的数据增强是怎么做的,代码里我按照流程抽取了非主角对话和问题生成,但是根据问题去生成新的对话,需要配置config.ini这个配置没有说明...

@J1shen
Copy link
Owner

J1shen commented Sep 27, 2023

微调只是需要新小说角色的对话数据和通用数据按比例混合即可,知识库是新小说的。根据已有小说角色对话的数据增强在dataset生成环节,一样的操作生成新数据集即可。

# for free to join this conversation on GitHub. Already have an account? # to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants