ImagePulse project aims to provide dataset support for the next generation of image understanding and generation models, by atomizing the capabilities of these models and constructing atomic capability datasets.
- Dataset: https://www.modelscope.cn/datasets/DiffSynth-Studio/ImagePulse-ChangeAddRemove
- Dataset Construction Script: ./scripts/change_add_remove.py
- Dataset: https://www.modelscope.cn/datasets/DiffSynth-Studio/ImagePulse-ZoominZoomout
- Dataset Construction Script: ./scripts/zoomin_zoomout.py
- Dataset: https://www.modelscope.cn/datasets/DiffSynth-Studio/ImagePulse-StyleTransfer
- Dataset Construction Script: ./scripts/style_transfer.py
- Dataset: https://www.modelscope.cn/datasets/DiffSynth-Studio/ImagePulse-FaceID
- Dataset Construction Script: ./scripts/faceid.py
pip install -r requirements.txt
python change_add_remove.py \
--target_dir "data/dataset" \
--cache_dir "data/cache" \
--dashscope_api_key "sk-xxxxxxxxxxxxxxxx" \
--qwenvl_model_id "qwen-vl-max" \
--modelscope_access_token "xxxxxxxxxxxxxxx" \
--modelscope_dataset_id "DiffSynth-Studio/ImagePulse-ChangeAddRemove" \
--num_data 1000000 \
--max_num_files_per_folder 1000
target_dir
: Path to store the datasetcache_dir
: Cache pathdashscope_api_key
: DashScope API Key, required when calling DashScope APIqwenvl_model_id
: ID of the Qwen-VL model on DashScope, required when calling DashScope APImodelscope_access_token
: Access token from ModelScope, required when uploading datasets to ModelScopemodelscope_dataset_id
: Dataset ID on ModelScope, required when uploading datasets to ModelScopenum_data
: Total number of data samplesmax_num_files_per_folder
: Number of files per packaged folder
- DiffSynth-Studio: Provided Diffusion model inference support for this project
- ModelScope: Provided storage and download support for models and datasets in this project
- DashScope: Provided inference API support for large language models in this project