ViStoryBench introduces a comprehensive and diverse benchmark for story visualization, enabling thorough evaluation of models across narrative complexity, character consistency, and visual style.
- (Refining..) Release final code and complete project.
- [2025/06/02] Upload paper (arxiv).
- [2025/05/21] Init project and release code (semi-finished).
80 stories and 344 characters, both Chinese and English,
Each story included Plot Correspondence, Setting Description, Shot Perspective Design, Characters Appearing and Static Shot Description
Each character included at least one inference image and corresponding prompt description.
📥 Download our ViStory Datasets (🤗huggingface) and put it into your local path
Use our standardized loading script dataset_load.py or your own data loader
Based on dataset_load.py, convert ViStory/ViStory-lite dataset to your method's required input format (converted dataset will be saved to /data/dataset_processed/your_method_name/) and modify the inference code for story visualization (suggest generated results save to /data/outputs/your_method_name/).
Example of UNO:
Other Methods:
- adapt2seedstory.py,
- adapt2storyadapter.py,
- adapt2storydiffusion.py,
- adapt2storygen.py,
- adapt2vlogger.py,
- adapt2animdirector.py
Make sure your output results are organized according to the following folder structure:
- method_name: The model used (e.g., StoryDiffusion, UNO, GPT4o, etc.)
- dataset_name: The dataset used (e.g., ViStory_en)
- story_id: The story identifier (e.g., 01, 02, etc.)
- timestamp: Generation run timestamp (YYYYMMDD-HHMMSS)
- shot_XX.jpg: Generated image for the shot
When you run the evaluation code, it will automatically perform data reading (ensure both the ViStoryBench dataset and the generated results conform to the standard directory structure specified above). The generated-results reading code has been uniformly integrated into the following file: code/data_process/outputs_read/read_outputs.py
Example of UNO:
--cref --csd_cross --csd_self --aesthetic --prompt_align2 --diversity