GitHub - HeartWise-AI/Orion: Orion

Orion Video Training and Evaluation

Orion is a Python-based application designed for video training and evaluation tasks. It supports both regression and classification tasks and can be used with a variety of models.

Setup

Ensure you have Python 3.10 or later installed.
Clone the repository and navigate to the root directory.

git clone https://github.com/HeartWise-AI/Orion.git
cd Orion

Create a new environment and install the dependencies using uv.

python -m venv orion-venv
source orion-venv/bin/activate
pip install --upgrade pip
pip install uv
uv pip install -r requirements.txt

Ensure you have an account on wandb.ai and that you've logged in using wandb login

Usage

You can use Orion either through a Python script

python run_sweep.py
python run_inference.py

or build your own Jupyter notebook inspired by code samples in the notebooks/ folder.

Configuration file example

See notebooks/config/ for different config.YAML file examples. Here are some key configuration parameters:

[... keep the existing parameter descriptions ...]

Model-Specific Configurations

Different video models have specific requirements and recommendations for optimal performance. The table below summarizes the recommended frame sizes and learning rates for supported models:

Model	Recommended Frame Sizes	Recommended Learning Rate
x3d	Multiples of 8	1e-3 to 1e-4
swin3d	24 or 32	1e-4 to 1e-5
mvit	16	3e-5 (0.00003)

Additional Model-Specific Notes:

x3d:
- Supports various frame counts as long as they are multiples of 8.
- The x3d_m variant specifically supports video sizes of either 224x224 or 256x256.
swin3d:
- Only supports 24 or 32 frames.
- Ensure your configuration matches one of these frame counts.
mvit:
- Strictly requires 16 frames.
- Adjust your data preprocessing to match this requirement.

When configuring your model in the YAML file, ensure that the frames and resize parameters align with these recommendations. The lr (learning rate) parameter should be set within the recommended range for best results.

Using a Python Script

Prepare your configuration file in YAML format.

Run the main script with torchrun:

torchrun --standalone --nnodes=1 --nproc-per-node=2 orion/utils/video_training_and_eval.py --config_path=notebooks/config/config.yaml

This command runs the script in a distributed manner with 2 GPU processes on a single node.

Using a Jupyter Notebook

Prepare your configuration file in YAML format.
In your Jupyter notebook, import necessary modules and set up environment variables.
Define a class for your command-line arguments.
Load your configuration, create transforms, initialize a wandb run, and run the main process.

Using Weights & Biases Sweeps

Prepare your configuration and sweep configuration files in YAML format.
Run the run_sweep.py script to start the sweep agent (no argument needed - update the sweep_conf_file_path variable in the script to point to your sweep configuration file).
```
python run_sweep.py
```

Inference and Evaluation

To run inference and evaluation, use the run_inference.py script.

python run_inference.py --config_path notebooks/config/config_x3d_multi_output_classification_eval.yaml --splits inference --model_path outputs/outputs_folder_id/best.pt

The other arguments are optional: the data path (usually defined in the config file), the output directory (usually defined in the config file), the wandb id & resume flag (both not needed for inference).

Note

This application uses distributed training, so it is designed to be run on multiple GPUs. If you are running it on a machine with a single GPU or CPU, you may need to modify the code accordingly.

Validation of Model Configurations

The build_model function in video_training_and_eval.py includes checks to ensure that the frame and resize parameters in your configuration file are compatible with the chosen model. If these parameters don't meet the model-specific requirements, the function will raise a ValueError with an appropriate error message.

For example:

For swin3d models, it checks if the frame count is either 24 or 32.
For x3d models, it verifies that the frame count is a multiple of 8.
For x3d_m specifically, it checks if the resize value is either 224 or 256.
For mvit models, it ensures the frame count is exactly 16.

These checks help prevent configuration errors and ensure that your model is set up correctly for training or inference.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
notebooks		notebooks
orion		orion
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
dockerfile		dockerfile
poetry.lock		poetry.lock
precommit-hook.yaml		precommit-hook.yaml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
run_inference.py		run_inference.py
run_sweep.py		run_sweep.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Orion Video Training and Evaluation

Setup

Usage

Configuration file example

Model-Specific Configurations

Additional Model-Specific Notes:

Using a Python Script

Using a Jupyter Notebook

Using Weights & Biases Sweeps

Inference and Evaluation

Note

Validation of Model Configurations

About

Releases

Packages

Contributors 5

Languages

HeartWise-AI/Orion

Folders and files

Latest commit

History

Repository files navigation

Orion Video Training and Evaluation

Setup

Usage

Configuration file example

Model-Specific Configurations

Additional Model-Specific Notes:

Using a Python Script

Using a Jupyter Notebook

Using Weights & Biases Sweeps

Inference and Evaluation

Note

Validation of Model Configurations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages