Skip to content

Latest commit

 

History

History
80 lines (60 loc) · 8.07 KB

README_zh.md

File metadata and controls

80 lines (60 loc) · 8.07 KB

ImagePulse-图律脉动

图律脉动项目旨在为下一代图像理解和生成模型提供数据集支撑,将模型的能力原子化,并构建原子能力数据集。

切换到英文

原子能力数据集

1. 修改、添加、移除

image_1 image_2 mask editing_instruction reverse_editing_instruction
Remove the mustache and beard, change the white shirt to a blue turtleneck sweater, and remove the glass of milk. Add a mustache and beard, change the blue turtleneck sweater to a white shirt, and add a glass of milk.
Add a silver butterfly to the glowing golden lace on her face. Remove the silver butterfly from the glowing golden lace on her face.
Remove the necklace. Add a necklace.

2. 放大、缩小

image_1 image_2 image_cropped mask editing_instruction reverse_editing_instruction
Zoom in to focus on the headband. Zoom out to show the full view of the anime girl.
Remove the superhero costume and replace it with a red shirt. Adjust the lighting to highlight the man's face. Add a superhero costume with a red and yellow emblem on the chest and a red cape. Adjust the lighting to emphasize the costume.
Remove the elephant and replace it with a large rock. Replace the large rock with an elephant.

3. 风格迁移

image_1 image_2 image_3 image_4 editing_instruction reverse_editing_instruction
transform the image into a cartoon style with vibrant colors and a confident expression. transform the image into a realistic portrait with a serious expression and subtle lighting.
transform the image to have a brighter, more colorful palette and a clear blue sky. transform the image to have a more muted color palette and an overcast sky.
transform the style of the image to an anime illustration, change the jacket to red, and add a cityscape background. transform the style of the image to a digital painting, change the jacket to black, and remove the cityscape background.

4. 人脸保持

image_face image_1 image_2 editing_instruction reverse_editing_instruction
Change the woman's white t-shirt to a white tank top. Change the woman's white tank top to a white t-shirt.
Add a nighttime street scene with bokeh lights in the background. Remove the nighttime street scene and bokeh lights from the background.
Change the background to a warmly lit room with lamps, change the suit to maroon, and add a sweater under the suit. Change the background to a dimly lit room with red lighting, change the suit to black, and remove the sweater.

运行数据集生成

python change_add_remove.py \
  --target_dir "data/dataset" \
  --cache_dir "data/cache" \
  --dashscope_api_key "sk-xxxxxxxxxxxxxxxx" \
  --qwenvl_model_id "qwen-vl-max" \
  --modelscope_access_token "xxxxxxxxxxxxxxx" \
  --modelscope_dataset_id "DiffSynth-Studio/ImagePulse-ChangeAddRemove" \
  --num_data 1000000 \
  --max_num_files_per_folder 1000
  • target_dir: 数据集存储路径
  • cache_dir: 缓存路径
  • dashscope_api_key: 百炼 API Key,调用百炼 API 时需填入
  • qwenvl_model_id: 百炼 上 Qwen-VL 模型的 ID,调用百炼 API 时需填入
  • modelscope_access_token: 魔搭社区 访问令牌,上传数据集到魔搭社区时需填入
  • modelscope_dataset_id: 魔搭社区 数据集 ID,上传数据集到魔搭社区时需填入
  • num_data: 数据样本总量
  • max_num_files_per_folder: 每个打包文件中的文件数量

致谢

  • DiffSynth-Studio:为本项目提供 Diffusion 模型推理支持
  • 魔搭社区:为本项目提供模型和数据集的存储与下载支持
  • 百炼:为本项目提供大型语言模型的推理 API 支持