-
Notifications
You must be signed in to change notification settings - Fork 5.7k
New issue
Have a question about this project? # for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “#”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? # to your account
使用RAG对话时,当询问几轮后,显存溢出,请问有人知道怎么释放对话历史吗?谢谢 #5226
Comments
@azhe1234 你好!我是Dosu,一个可以帮助你解决错误、回答问题并成为贡献者的机器人。在等待人类维护者时,我会尽力帮助你。 I found a similar closed issue regarding GPU memory overflow when using the Langchain-Chatchat repository. However, no solution was found in the comments of that issue [1]. For managing GPU memory during multi-turn conversations, you might want to consider the following:
Additionally, you can try using the To continue talking to Dosu, mention @dosu. Help Dosu learn! Give it feedback: Great Response | Irrelevant Answer | Incorrect Sources | Too Verbose | Hallucination | Other | Bug Report |
这个历史对话轮数是需要改代码吗,我的历史对话轮数不起作用 |
部署的是deepseek-r1:32b的模型,当进行两三轮对话时,显存一直增加,最后溢出,我在创建任务后,释放了内存,但是显存并没有被释放,请问有人知道怎么修改代码吗,谢谢!
The text was updated successfully, but these errors were encountered: