# HyenaDNA Training Pipeline
A pipeline for training HyenaDNA models on genomic sequences using AWS SageMaker and HealthOmics.
## Setup
1. Install dependencies:
```bash
pip install -r requirements.txt
-
Configure AWS credentials and region
-
Open the notebook in SageMaker Studio:
- Navigate to notebooks/train_hyena_dna.ipynb
- Follow the instructions in the notebook
notebooks/
: Contains the main training notebookscripts/
: Helper modules for AWS sessions, data handling, and trainingsession_handler.py
: AWS/SageMaker session managementdata_handler.py
: HealthOmics data accesstraining_handler.py
: Training configuration and execution
See notebooks/train_hyena_dna.ipynb
for detailed usage instructions.