Paper • 2606.23610 • Published • 10
Search is not available for this dataset
video video |
|---|
End of preview. Expand in Data Studio
Dataset for Vera: A Layered Diffusion Model for Content-Preserving Video Editing
Hongkai Zheng¹²* · Ta-Ying Cheng² · Benjamin Klein² · Yisong Yue² · Zhuoning Yuan²†
¹California Institute of Technology ²Netflix, Inc.
*Work done during an internship at Netflix †Project Lead
👁 Project Page
👁 Paper
👁 Dataset
👁 License
TL;DR: A layered diffusion framework for video editing. Vera jointly generates an edit layer, an alpha matte, and a composite video, separating what to generate from what to preserve.
Disclaimer: This is a research prototype, not an official product.
📋 Dataset Description
- Curated by: Hongkai Zheng, Ta-Ying Cheng, Benjamin Klein, Yisong Yue, Zhuoning Yuan
- License: Apache License 2.0
- Paper: Vera: A Layered Diffusion Model for Content-Preserving Video Editing
📦 Dataset Structure
🎞️ Splits — 49 Frames (3 sec)
Note: The current Vera models are trained on 49-frame sequences.
| Split | Edit Type | # Samples |
|---|---|---|
| train / 49-frames / realistic-set1-bg-change | background_replace | 914 |
| train / 49-frames / realistic-set1-obj-add | obj_add | 470 |
| train / 49-frames / realistic-set2-obj-add | obj_add | 770 |
| train / 49-frames / synthetic-bg-change | background_replace | 4,994 |
| train / 49-frames / synthetic-obj-add | obj_add | 4,848 |
| 49-Frame Train Total | 11,996 |
🎞️ Splits — 81 Frames (5 sec)
| Split | Edit Type | # Samples |
|---|---|---|
| train / 81-frames / realistic-set1-bg-change | background_replace | 457 |
| train / 81-frames / realistic-set1-obj-add | obj_add | 235 |
| train / 81-frames / realistic-set2-obj-add | obj_add | 385 |
| train / 81-frames / synthetic-bg-change | background_replace | 2,497 |
| train / 81-frames / synthetic-obj-add | obj_add | 2,431 |
| 81-Frame Train Total | 6,005 |
🧪 Test Splits
| Split | Edit Type | # Samples |
|---|---|---|
| test / bg-change | background_replace | 69 |
| test / obj-add | obj_add | 72 |
| Test Total | 141 |
🗂️ Data Sources
🏋️ Training Set
| Source | License |
|---|---|
| Pexels | Pexels License |
| Mixkit | Mixkit License |
| VideoMatte240K | MIT License |
🧪 Test Set
The test set is sourced from the training sources above, plus:
| Source | License |
|---|---|
| DAVIS | CC BY-NC 4.0 |
| VACEBench | Apache License 2.0 |
📝 Citation
@article{zheng2026vera,
title = {Vera: A Layered Diffusion Model for Content-Preserving Video Editing},
author = {Zheng, Hongkai and Cheng, Ta-Ying and Klein, Benjamin and Yue, Yisong and Yuan, Zhuoning},
journal = {arXiv preprint arXiv:2606.23610},
year = {2026}
}
- Downloads last month
- 10,124
