[WIP] Integration with DeepLabCut 3.0 - PyTorch Engine #121

n-poulsen · 2025-02-28T10:58:21Z

This pull requests updates DeepLabCut-Live for models exported with DeepLabCut 3.0. TensorFlow models can still be used, and the code is siloed so that only the engine used to run the code is required as a package (i.e. no need to install TensorFlow if you want to run live pose estimation with PyTorch models).

If you want to give this PR a try, you can install the code in your local conda environment by running:

pip install "git+https://github.com/DeepLabCut/DeepLabCut-live.git@dlclive3"

…in code

…NX inference

…erence

maximpavliv · 2025-03-04T15:00:50Z

README.md

+- Feb 2021: DeepLabCut-Live! was featured in **Nature Methods**: [
+"Real-time behavioral analysis"](https://www.nature.com/articles/s41592-021-01072-z)


Suggested change

- Feb 2021: DeepLabCut-Live! was featured in **Nature Methods**: [

"Real-time behavioral analysis"](https://www.nature.com/articles/s41592-021-01072-z)

- Feb 2021: DeepLabCut-Live! was featured in **Nature Methods**:

["Real-time behavioral analysis"](https://www.nature.com/articles/s41592-021-01072-z)

maximpavliv · 2025-03-04T15:47:43Z

dlclive/benchmark.py

        resize=resize,
        cropping=cropping,
        dynamic=dynamic,
        display=display,
+        max_detections=max_detections,


DLCLive's constructor doesn't have a max_detections argument

maximpavliv · 2025-03-04T16:01:57Z

dlclive/benchmark.py

-    iterator = range(n_frames) if (print_rate) or (display) else tqdm(range(n_frames))
-    for i in iterator:
+    iterator = range(n_frames)
+    if print_rate or display:


Shouldn't we check if has_tqdm here?

maximpavliv · 2025-03-04T16:37:48Z

dlclive/benchmark.py

+        frame_width = int(cap.get(cv2.CAP_PROP_FRAME_WIDTH))
+        frame_height = int(cap.get(cv2.CAP_PROP_FRAME_HEIGHT))
+
+        vwriter = cv2.VideoWriter(


Let's name it vid_writer as in benchmark()? I find vid_writer indeed better

sneakers-the-rat · 2025-03-07T04:59:41Z

am only a peripheral maintainer here at this point, but i see that we have copied and pasted this whole directory? https://github.com/DeepLabCut/DeepLabCut/tree/main/deeplabcut/pose_estimation_pytorch

that sounds pretty fragile to me! if we're already copy/pasting it once, we might as well just make that a cut/paste into a separate deeplabcut-pytorch package that both deeplabcut and deeplabcut-live can depend on, right?

MMathisLab · 2025-03-07T07:24:02Z

Hey @sneakers-the-rat afaik its not strictly the same though; this the same reason we don't keep dlc-live in DeepLabCut main. We also used to have core, but it becomes yet another package to maintain and ultimately was more work for dev team. But I would leave it to @n-poulsen to give a better technical answer (as this addition here I haven't been deeply involved in yet) ❤️- thanks for your engagement!!!

sneakers-the-rat · 2025-03-07T07:41:57Z

oh ya i don't see a reason to have dlc-live in main dlc, i would think that the dep graph would look something like this

flowchart TB
    deeplabcut --> dlc-pytorch
    deeplabcut --> dlc-tensorflow
    dlc-pytorch --> dlclibrary
    dlc-tensorflow --> dlclibrary
    dlc-live --> dlc-pytorch
    dlc-live --> dlc-tensorflow
    dlc-live --> dlclibrary
    deeplabcut --> dlclibrary

where dlclibrary has stuff that is boilerplate or super lightweight and common to all the packages (like exceptions, utility functions, and so on), and then the dlc-pytorch and dlc-tensorflow have all the framework-specific stuff and models in them - this would also make a single-source of dependency compatibility ranges which i understand to be like an infinite problem. and then dlc-live and deeplabcut are the application layer that wraps the raw models and etc. up into user-facing tools.

We also used to have core, but it becomes yet another package to maintain and ultimately was more work for dev team

I believe y'all have your own processes and flow and etc. and don't mean to step on that. the thing that works is the thing that works. i would suspect that having multiple copies of what was initially the same code drift apart (or require active duplication as changes are made) would be more maintenance labor in all but the short term, but i also don't deal with tf or torch directly and am aware they introduce their own packaging nightmares wherever they go, so up to y'all ofc.

The thing that we did for the tensorflow stuff is just operate on serialized versions of the models, so the model code for tf isn't in the package at all - does torch not have a similar system for serializing and deserializing models? that worked out really well, and i partly credit that design decision with why this package has needed relatively little maintenance over its lifespan. that is a third way that would avoid new package but also not require a copy/paste :)

n-poulsen · 2025-03-07T11:47:57Z

Hi @sneakers-the-rat! Thanks for the input! When it comes to duplicating code, only the models from the deeplabcut/pose_estimation_pytorch folder were copied over. This is absolutely a sub-optimal solution, and requires maintenance when new models are added to DeepLabCut or bug fixes are made. This was done to have a prototype ready, but I'm still exploring better solutions. The issue is that:

PyTorch recommends exporting the state_dict for the model, which requires the model class to run inference
Serialization does exist with PyTorch through TorchScript, but
1. TorchScript is no longer in active development
2. It will be replaced by torch.export but it is a prototype under active development and there WILL BE BREAKING CHANGES in the future. I've seen they want to use it to replace TorchScript in 2025, but this makes it harder for me to want to use it. It's also currently not well integrated for torchvision models, so object detectors could not be exported. This will likely lead to issues with other operations we have in our models.
3. TorchScript can have the same issue with exporting models (when some non-standard operations are used). I'd need to test all of our model architectures to see which ones break - this will just take a bit of time but should be feasible
4. Not all pose models implemented for the PyTorch engine output heatmaps: so to go from the model output to predicted pose, we may require different predictors. I want to experiment with exporting predictor functions alonside the models (so they are fully serializable, taking an image as input and outputting an array of shape (num_individuals, num_keypoints, 3) containing the predicted pose) but that may take some time as well. An example of a model returning a different type of output would be RTMPose, which uses "coordinate classification".
Another option would be to serialize through ONNX, but the same issue with the predictors remain. I'm exploring this as well.

If we can't find a good way to serialize models, it's true that the best solution may be to have a separate package containing the models and predictors - but that's something I still want to explore.

If you have any opinions on the best way to this I'm glad to get any other input!

sneakers-the-rat · 2025-03-13T00:47:36Z

aha, that is extremely annoying there is no good way to serialize pytorch models, or at least serialization is in a transitional period right now.

I'm not gonna be maintaining it, and have said my piece re: splitting off into a separate package, so by all means! good to see this update getting ported over to dlc live, glhf with the rest of the impl, lmk if a review would be helpful <3

dikraMasrour and others added 24 commits February 24, 2025 15:30

Remove TensorFlow references

b11743c

Add comments on TF questions for changing dlc-live pipeline to Pytorch

c8f5839

Vanilla pytorch inference done; Commenting out TensorFlow references …

030c42c

…in code

change testing directory for Anna and allow to run on CPU

d75339a

Fix display, clean code, adapt frame processing, onnx inference

4de5829

add screenshots

40a9dd4

Fix CPU inference crash + GPU (cuda) & CPU support for Pytorch and ON…

2fc48f1

…NX inference

Video analysis feature

125a072

Improvements on benchmark_pytorch.py

9d2350e

Implement TensorRT optimisation on ONNX models and FP16 precision inf…

13b8fae

…erence

Avs live feed

8b41df6

Tutorial notebook in progress

d1e7df9

bug fixing h5 saving for live video feed

2fb39fc

add code to save numbers in csv and h5 as numbers, not tensor(number)

98b3a13

add timestamp suffix to videos and csv/h5 files

e6a914a

fix live inference and display, black and isort

29c8122

cleaning out unused files

9508a79

update docstrings, clean dlclive script

d61f892

Continued DeepLabCut-Live implementation for DeepLabCut 3.0

f527d38

working on README

4c07e04

improved docs

82daf43

improved docs for PyTorch code

40005ae

improved readme

c6a1f69

fix default top down dynamic cropping parameters

d738a2c

n-poulsen added the DLC 3.0 🔥 label Feb 28, 2025

n-poulsen mentioned this pull request Feb 28, 2025

[WIP] Integration with DeepLabCut 3.0 - PyTorch Engine #120

Closed

n-poulsen requested review from AlexEMG, MMathisLab and maximpavliv February 28, 2025 10:58

n-poulsen mentioned this pull request Feb 28, 2025

MobileNets DeepLabCutAIResidency/DLC_AI2024#24

Open

n-poulsen mentioned this pull request Feb 28, 2025

Option to export top-down models without detectors DeepLabCut/DeepLabCut#2911

Merged

maximpavliv suggested changes Mar 4, 2025

View reviewed changes

n-poulsen mentioned this pull request Mar 25, 2025

Minimum requirements for inference DeepLabCut/DeepLabCut#2930

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Integration with DeepLabCut 3.0 - PyTorch Engine #121

[WIP] Integration with DeepLabCut 3.0 - PyTorch Engine #121

n-poulsen commented Feb 28, 2025

maximpavliv Mar 4, 2025

maximpavliv Mar 4, 2025

maximpavliv Mar 4, 2025

maximpavliv Mar 4, 2025

sneakers-the-rat commented Mar 7, 2025

MMathisLab commented Mar 7, 2025

sneakers-the-rat commented Mar 7, 2025

n-poulsen commented Mar 7, 2025

sneakers-the-rat commented Mar 13, 2025

		- Feb 2021: DeepLabCut-Live! was featured in Nature Methods: [
		"Real-time behavioral analysis"](https://www.nature.com/articles/s41592-021-01072-z)

[WIP] Integration with DeepLabCut 3.0 - PyTorch Engine #121

Are you sure you want to change the base?

[WIP] Integration with DeepLabCut 3.0 - PyTorch Engine #121

Conversation

n-poulsen commented Feb 28, 2025

maximpavliv Mar 4, 2025

Choose a reason for hiding this comment

maximpavliv Mar 4, 2025

Choose a reason for hiding this comment

maximpavliv Mar 4, 2025

Choose a reason for hiding this comment

maximpavliv Mar 4, 2025

Choose a reason for hiding this comment

sneakers-the-rat commented Mar 7, 2025

MMathisLab commented Mar 7, 2025

sneakers-the-rat commented Mar 7, 2025

n-poulsen commented Mar 7, 2025

sneakers-the-rat commented Mar 13, 2025