GPU isn't used #2506

abrahimzaman360 · 2024-02-05T05:55:26Z

GPU is not utilized during the process!

muazhari · 2024-04-19T13:06:51Z

NEED IT TOO!!!

javier-cohere · 2024-04-26T08:17:27Z

Same here. Running in Colab and getting the warning that GPU is not being utilised

hahazei · 2024-04-30T07:12:57Z

I need too

javier-cohere · 2024-04-30T07:33:35Z

I did a little bit of investigation/debugging and here is what I learned. From what I see, there are two types of layout detection models in Unstructured:

Models that run with ONNXRuntime. That is, YoloX, Detectron_ONNX and many others use this method.
Native models: That is, Detectron2, whose weights are downloaded from HF and loaded in memory.

For the models that run with ONNXRuntime

ONNXRuntime has a series of providers available that it uses to run inference. In order to use the GPU, the TensorrtProvider and CUDAProvider need to be available. This can be checked by adding these two lines to your code:

from onnxruntime.capi import _pybind_state as C
logger.info(f"Available ONNXRT providers: {C.get_available_providers()}")

In my case, I was getting Available ONNXRT providers: ['AzureExecutionProvider', 'CPUExecutionProvider'], which means that GPU wasn't being used.

To utilise GPU, you need to install onnxruntime-gpu:

pip install onnxruntime-gpu if your CUDA drivers are <12.
pip install onnxruntime-gpu --extra-index-url https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/onnxruntime-cuda-12/pypi/simple/ otherwise.
See https://onnxruntime.ai/docs/install/#python-installs

After installing this library, I could see ['TensorrtExecutionProvider', 'CUDAExecutionProvider', 'CPUExecutionProvider']. I think ONNXRT uses the providers in order of preference, so first it will try to use Tensorrt, then CUDA.

For models that do not use ONNXRT

In the case of Detectron2, I could verify in the Unstructured code that the detectron2 model does not correctly get the device parameter that it needs to use cuda. This can be circumvented by leveraging the fact that unstructured first tries to load the model config from the environment variable UNSTRUCTURED_DEFAULT_MODEL_INITIALIZE_PARAMS_JSON_PATH, and if it is not set, it will load the default model config. See https://github.com/Unstructured-IO/unstructured-inference/blob/main/unstructured_inference/models/base.py#L67

You can load the default Detectron2 model config in your code, add device: "cuda", then dump it into a temporal file, and set the path to the config with `os.env["UNSTRUCTURED_DEFAULT_MODEL_INITIALIZE_PARAMS_JSON_PATH"] = "your_config_file".

However I do not recommend this approach since detectron already has an ONNX flavour and you don't really need to do all of this to use it. Moreover, YoloX works better as a layout model.

MthwRobinson · 2024-05-23T15:40:18Z

Thanks for the write up @javier-cohere ! The detectron2 model @javier-cohere mentioned at the end is no longer supported as of 0.14.1. If you'd still like to use detectron2, you can use the ONNX version, though we recommend yolox.

abrahimzaman360 added the bug Something isn't working label Feb 5, 2024

MthwRobinson closed this as completed May 23, 2024

MthwRobinson mentioned this issue May 24, 2024

feat/GPU acceleration with tessract #2590

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU isn't used #2506

GPU isn't used #2506

abrahimzaman360 commented Feb 5, 2024

muazhari commented Apr 19, 2024 •

edited

Loading

javier-cohere commented Apr 26, 2024 •

edited

Loading

hahazei commented Apr 30, 2024

javier-cohere commented Apr 30, 2024 •

edited

Loading

MthwRobinson commented May 23, 2024

GPU isn't used #2506

GPU isn't used #2506

Comments

abrahimzaman360 commented Feb 5, 2024

muazhari commented Apr 19, 2024 • edited Loading

javier-cohere commented Apr 26, 2024 • edited Loading

hahazei commented Apr 30, 2024

javier-cohere commented Apr 30, 2024 • edited Loading

MthwRobinson commented May 23, 2024

muazhari commented Apr 19, 2024 •

edited

Loading

javier-cohere commented Apr 26, 2024 •

edited

Loading

javier-cohere commented Apr 30, 2024 •

edited

Loading