VideoMemory: Toward Consistent Video Generation via Memory Integration

Jinsong Zhou^1,3*, Yihua Du^1*, Xinli Xu^1*†, Luozhou Wang¹, Zijie Zhuang¹, Yehang Zhang¹, Shuaibo Li¹, Xiaojun Hu³, Bolan Su³, Ying-Cong Chen^1,2‡

¹HKUST(GZ) ²HKUST ³ByteDance

^*Equal Contribution ^†Project Lead ^‡Corresponding Author

Official implementation of VideoMemory: Toward Consistent Video Generation via Memory Integration.

VideoMemory is a multi-agent video generation framework built on LangGraph that automatically transforms screenplay text into coherent video content. By constructing a Visual Memory Bank to maintain consistency of characters, scenes, and props, it enables a high-quality automated video production pipeline.

🚩 Features

[✅] Multi-Agent Collaboration: Three-stage pipeline architecture (Storyboard → Memory → Visualization)
[✅] Visual Memory Bank: Automatically manages character, scene, and prop assets to ensure cross-shot visual consistency
[✅] Structured Output: Strict output control based on Pydantic Schema
[✅] Flexible Generation Backend: Supports Replicate (Nano-Banana) for image generation and Sora-2 for video generation

⚙️ Dependencies and Installation

We recommend using Python>=3.11 and uv package manager.

# Clone the repository
git clone https://github.com/your-username/VideoMemory.git
cd VideoMemory

# Create virtual environment and install dependencies using uv
uv sync
source .venv/bin/activate

Environment Variables

cp env.example .env

Edit the .env file with your API keys:

OPENAI_API_KEY=your_openai_api_key

# Generation API
REPLICATE_API_TOKEN=your_replicate_token

# LangSmith (Optional, for tracing)
LANGSMITH_API_KEY=your_langsmith_key
LANGSMITH_TRACING=true
LANGSMITH_PROJECT=VideoMemory

💫 Run

Prepare Scripts

Place screenplay files in the scripts/ directory following standard screenplay format.

Run the Pipeline

source .venv/bin/activate
python main.py

📚 Citation

If you find this project helpful in your research or applications, please cite it as follows:

@article{zhou2026videomemory,
  title={VideoMemory: Toward Consistent Video Generation via Memory Integration},
  author={Zhou, Jinsong and Du, Yihua and Xu, Xinli and Wang, Luozhou and Zhuang, Zijie and Zhang, Yehang and Li, Shuaibo and Hu, Xiaojun and Su, Bolan and Chen, Ying-cong},
  journal={arXiv preprint arXiv:2601.03655},
  year={2026}
}

📄 License

This project is licensed under the CC BY-NC-SA 4.0 (Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License).

The code is provided for academic research purposes only.

For any questions, please contact jzhou945@connect.hkust-gz.edu.cn

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
asset		asset
scripts		scripts
src		src
test		test
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
concat_videos.py		concat_videos.py
env.example		env.example
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

VideoMemory: Toward Consistent Video Generation via Memory Integration

🚩 Features

⚙️ Dependencies and Installation

Environment Variables

💫 Run

Prepare Scripts

Run the Pipeline

📚 Citation

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

VideoMemory: Toward Consistent Video Generation via Memory Integration

🚩 Features

⚙️ Dependencies and Installation

Environment Variables

💫 Run

Prepare Scripts

Run the Pipeline

📚 Citation

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages