使用RAG对话时，当询问几轮后，显存溢出，请问有人知道怎么释放对话历史吗？谢谢 #5226

azhe1234 · 2025-02-13T11:44:17Z

部署的是deepseek-r1:32b的模型，当进行两三轮对话时，显存一直增加，最后溢出，我在创建任务后，释放了内存，但是显存并没有被释放，请问有人知道怎么修改代码吗，谢谢！

dosubot · 2025-02-13T11:46:06Z

@azhe1234 你好！我是Dosu，一个可以帮助你解决错误、回答问题并成为贡献者的机器人。在等待人类维护者时，我会尽力帮助你。

I found a similar closed issue regarding GPU memory overflow when using the Langchain-Chatchat repository. However, no solution was found in the comments of that issue ^[1].

For managing GPU memory during multi-turn conversations, you might want to consider the following:

PagedAttention and FlashAttention: These techniques help optimize memory usage by reducing memory waste and avoiding large intermediate attention matrices ^[2].
Quantized KV Cache and Model Parallelism: These methods can help reduce memory usage and accelerate computation ^[2].
Dynamic Batching and Inflight Batching: These strategies improve GPU utilization by adjusting batch sizes dynamically and scheduling at the step level ^[2].

Additionally, you can try using the torch.cuda.empty_cache() and torch.cuda.ipc_collect() functions in your script to clear GPU memory, as shown in your provided image. However, if these methods are not effective, you may need to explore more advanced memory management techniques or consider optimizing your model's memory usage further.

To continue talking to Dosu, mention @dosu.

^{Help Dosu learn! Give it feedback: Great Response | Irrelevant Answer | Incorrect Sources | Too Verbose | Hallucination | Other | Bug Report}

wanmuchun0 · 2025-02-14T02:15:44Z

这个历史对话轮数是需要改代码吗，我的历史对话轮数不起作用

wanmuchun0 · 2025-02-14T07:23:56Z

部署的是deepseek-r1:32b的模型，当进行两三轮对话时，显存一直增加，最后溢出，我在创建任务后，释放了内存，但是显存并没有被释放，请问有人知道怎么修改代码吗，谢谢！

老哥你咋设置好历史对话的啊，我的历史对话都不起作用

azhe1234 · 2025-02-14T07:25:41Z

部署的是deepseek-r1:32b的模型，当进行两三轮对话时，显存一直增加，最后溢出，我在创建任务后，释放了内存，但是显存并没有被释放，请问有人知道怎么修改代码吗，谢谢！

老哥你咋设置好历史对话的啊，我的历史对话都不起作用

我的也不好使，history为空，我还在尝试改

wanmuchun0 · 2025-02-18T06:02:18Z

部署的是deepseek-r1:32b的模型，当进行两三轮对话时，显存一直增加，最后溢出，我在创建任务后，释放了内存，但是显存并没有被释放，请问有人知道怎么修改代码吗，谢谢！

老哥你咋设置好历史对话的啊，我的历史对话都不起作用

我的也不好使，history为空，我还在尝试改

老哥改好了吗

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

使用RAG对话时，当询问几轮后，显存溢出，请问有人知道怎么释放对话历史吗？谢谢 #5226

使用RAG对话时，当询问几轮后，显存溢出，请问有人知道怎么释放对话历史吗？谢谢 #5226

azhe1234 commented Feb 13, 2025

dosubot bot commented Feb 13, 2025

wanmuchun0 commented Feb 14, 2025

wanmuchun0 commented Feb 14, 2025

azhe1234 commented Feb 14, 2025

wanmuchun0 commented Feb 18, 2025

使用RAG对话时，当询问几轮后，显存溢出，请问有人知道怎么释放对话历史吗？谢谢 #5226

使用RAG对话时，当询问几轮后，显存溢出，请问有人知道怎么释放对话历史吗？谢谢 #5226

Comments

azhe1234 commented Feb 13, 2025

dosubot bot commented Feb 13, 2025

wanmuchun0 commented Feb 14, 2025

wanmuchun0 commented Feb 14, 2025

azhe1234 commented Feb 14, 2025

wanmuchun0 commented Feb 18, 2025