LLaMo: Large Language Model-based Molecular Graph Assistant

Jinyoung Park, Minseong Bae, Dohwan Ko, Hyunwoo J. Kim.

Official PyTorch implementation of the "LLaMo: Large Language Model-based Molecular Graph Assistant". (NeurIPS 2024)

TODO

Release the code.
Refactoring code to incorporate the huggingface.
Release the pre-trained huggingface model.

Enviroment

To install requirements, run:

git clone https://github.com/mlvlab/LLaMo.git
cd LLaMo
conda create -n llamo python==3.9
conda activate llamo
pip install torch==2.3.1 torchvision==0.18.1 torchaudio==2.3.1 --index-url https://download.pytorch.org/whl/cu118
pip install -r requirements.txt

Preparation

Pretrained graph encoder

We utilized the pre-trained graph encoder checkpoint from the link repository. Place the download graph model in the `MoleculeSTM/' folder.

Training

You can update the training config in the config_file folder.

Step1. Molecular graph-language alignment

python train.py --root_train 'data/MoleculeDesc/' --root_eval 'data/MoleculeDesc/' --devices '0,1,2,3' --filename "stage1" --max_epochs 3 --mode train --inference_batch_size 16 --batch_size 4 --config_file config_file/stage1.yaml --accumulate_grad_batches 4

Step2. Instruction tuning

python train.py --root_train 'data/instruction_tuning/' --root_eval 'data/MoleculeDesc/' --devices '0,1,2,3' --filename "stage2" --max_epochs 3 --mode train --inference_batch_size 16 --batch_size 4 --config_file config_file/stage2.yaml --accumulate_grad_batches 4 --stage_path "./all_checkpoints/stage1/last.ckpt"

Contact

If you have any questions, please create an issue on this repository or contact at lpmn678@korea.ac.kr.

Citation

If you find our work interesting, please consider giving a ⭐ and citation.

@inproceedings{park2024llamo,
  title={LLaMo: Large Language Model-based Molecular Graph Assistant},
  author={Park, Jinyoung and Bae, Minseong and Ko, Dohwan and Kim, Hyunwoo J},
  booktitle={NeurIPS},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
config_file		config_file
llamo		llamo
pipeline		pipeline
utils		utils
README.md		README.md
requirements.txt		requirements.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLaMo: Large Language Model-based Molecular Graph Assistant

TODO

Enviroment

Preparation

Pretrained graph encoder

Training

Step1. Molecular graph-language alignment

Step2. Instruction tuning

Contact

Citation

About

Releases

Packages

Languages

mlvlab/LLaMo

Folders and files

Latest commit

History

Repository files navigation

LLaMo: Large Language Model-based Molecular Graph Assistant

TODO

Enviroment

Preparation

Pretrained graph encoder

Training

Step1. Molecular graph-language alignment

Step2. Instruction tuning

Contact

Citation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages