How to run on A100 40G? #31

TopIdiot · 2024-12-20T03:59:59Z

When I run ./test_compute ../config_all/llama3-8B/1024.json directly, I got "Got bad cuda status: out of memory at line: 27/root/Nanoflow/pipeline/src/vortexData.cu".
Change the config model_configs.allocate_kv_data_batch to 100, I got Segmentation fault (core dumped). Then I change pipeline_configs to smaller, got Segmentation fault (core dumped) too.

I want to know if there are some rules on how to config it when using different kind of GPUs?

durant1999 · 2025-01-06T09:14:57Z

the same question...

fangbaolei · 2025-01-22T06:19:01Z

Got bad cuda status: out of memory at line: 27/ai/zhiyi/w/multimodal/openbmb/Nanoflow/pipeline/src/vortexData.cu 4090 24G报同样的错误

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to run on A100 40G? #31

How to run on A100 40G? #31

TopIdiot commented Dec 20, 2024 •

edited

Loading

durant1999 commented Jan 6, 2025

fangbaolei commented Jan 22, 2025

How to run on A100 40G? #31

How to run on A100 40G? #31

Comments

TopIdiot commented Dec 20, 2024 • edited Loading

durant1999 commented Jan 6, 2025

fangbaolei commented Jan 22, 2025

TopIdiot commented Dec 20, 2024 •

edited

Loading