Paper • 2604.04911 • Published • 36
json dict | 0.jpg imagewidth (px) 1.15k 1.15k | 1.jpg imagewidth (px) 1.15k 1.15k | __key__ stringlengths 29 106 | __url__ stringclasses 224
values |
|---|---|---|---|---|
{
"SAMPLE_ID": "6d91b468-1fb5-4a04-94a7-0ea368b2ea1a",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw 45.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"val... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001361 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "eb30b38e-fedd-47c4-a485-5f2a73fb78fb",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw 180.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"va... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001362 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "27a5eeef-deda-42dc-88da-793e94247244",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw -45.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"va... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001363 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "d5b33494-e760-4769-924b-c81a0a8b3017",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw 135.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"va... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001364 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "5a9c2427-a2d5-456e-9471-6100cb02cac9",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw 180.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"va... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001365 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "f65150be-e82a-450f-aef0-bff1ee3f83ed",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw -135.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"v... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001366 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "be2f3248-6d2c-433c-9b45-957a5b8c8a8b",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw 45.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"val... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001367 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "63112599-f738-411d-b5f5-490845aac0d4",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw 45.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"val... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001368 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "fcba1102-8dc7-40f5-87ee-cca4339dda9c",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw 135.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"va... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001369 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "eec0d1f7-98f9-4e39-8a8c-0da6edf63455",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw -90.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"va... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001370 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "98cfd867-b813-47d2-8300-d021de0fdcdd",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw -45.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"va... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001371 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "4da0932b-f2fb-4a30-a2a9-d071c7e7feea",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw -45.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"va... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001372 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "ff4955f5-49a5-498e-8f39-a9d5214006fc",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw 90.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"val... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001373 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "63f58bcf-d54e-4453-b8e2-f83611def5de",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw -135.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"v... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001374 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "f6b32ceb-4fbd-4648-b0d0-accd754e8859",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw 45.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"val... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001375 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "5bf14743-a32f-42a9-9938-fa55228115f0",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw 180.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"va... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001376 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "de82e23b-f833-4850-8fce-ec9fdb97d33a",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw -135.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"v... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001377 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "8aafb5d0-2edf-472f-aeb5-c7d929daa5ed",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw -45.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"va... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001378 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "c8c49bf0-7621-4bec-867e-f3331de506bb",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw 135.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"va... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001379 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "9cf9c5ba-5f66-416a-b841-661ec8247122",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw 180.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"va... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001380 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "780978b2-bb54-4c5f-808a-cb85f8d2a522",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw -135.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"v... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001381 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "1d772323-c48b-4e1f-a02e-a4d943c81d6d",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw -90.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"va... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001382 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "d533833a-6cb6-4ee7-bd31-efb1fbf69a1e",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw 135.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"va... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001383 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "e69fb735-df75-4b08-8332-4691099d9a95",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw 180.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"va... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001384 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "05189aa0-c20f-4c83-affa-1136385695ea",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw -135.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"v... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001385 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "2a52665d-b847-4a6c-b686-8e7ebc85c5a2",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw 45.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"val... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001386 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "eaed8d47-1f2f-4ad5-bf65-44801148bd3c",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw 90.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"val... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001387 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "e7818d7a-c4cd-4a8c-bad1-5821eeaf9c47",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw 135.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"va... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001388 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "20e4b4c2-4503-47e8-83cb-b282b6d39a8a",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw -135.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"v... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001389 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "939dfa31-0919-4df5-b72c-381832f80e50",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw 135.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"va... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001390 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "a406a490-8de7-4eb7-b440-f9cc6ebcbd67",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw 180.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"va... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001391 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar | ||
{
"SAMPLE_ID": "96d950ec-c8c9-4e81-8371-2cb23b3e0704",
"conversations": [
{
"from": "human",
"value": "<image>\nMove the camera.\n- Camera rotation: Yaw -45.0°, Pitch 0.0°.\n- Camera zoom: unchanged.\n- Keep the 3D scene static; only change the viewpoint."
},
{
"from": "gpt",
"va... | 373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Plane_098_Y_00001392 | hf://datasets/EasonXiao-888/SpatialEdit-500K@e54a9c870f98487d63f789a5e1bde4d35937d227/camera_control/373a48a9-a38d-4a16-b269-27063281b7c1_asset_type_scene_Y-shard-000034.tar |
End of preview. Expand in Data Studio
SpatialEdit-500K
SpatialEdit-500K is a synthetic training dataset for fine-grained image spatial editing. It is built for learning geometry-aware edits such as object moving, object rotation, and camera viewpoint change.
The dataset was introduced in the paper SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing. It is generated with a controllable rendering pipeline to provide structured spatial transformations at scale.
Project Resources
- GitHub Repository: EasonXiao-888/SpatialEdit
- Model: SpatialEdit-16B
- Benchmark: SpatialEdit-Bench
Highlights
- Large-scale synthetic data: 500,000 samples for spatially grounded image editing.
- Comprehensive transformations: Covers both object-centric (moving, rotation) and camera-centric transformations.
- High fidelity: Generated with a controllable Blender pipeline rendering objects across diverse backgrounds with systematic camera trajectories.
- Precise labels: Provides precise ground-truth transformations for spatial manipulation tasks.
Citation
@article{xiao2026spatialedit,
title = {SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing},
author = {Xiao, Yicheng and Zhang, Wenhu and Song, Lin and Chen, Yukang and Li, Wenbo and Jiang, Nan and Ren, Tianhe and Lin, Haokun and Huang, Wei and Huang, Haoyang and Li, Xiu and Duan, Nan and Qi, Xiaojuan},
journal = {arXiv preprint arXiv:2604.04911},
year = {2026}
}
- Downloads last month
- 9,340
