Video-Generation on Xu'Blog

Video-Generation on Xu'Bloghttps://xuquant.com/tags/video-generation/Recent content in Video-Generation on Xu'BlogXu'Bloghttps://xuquant.com/og-default.pnghttps://xuquant.com/og-default.pngHugo -- 0.152.2zhFri, 15 May 2026 10:00:00 +0800从预测未来到驱动行动：机器人世界模型的架构与评测https://xuquant.com/posts/world-models/world-model-robot-learning/Fri, 15 May 2026 10:00:00 +0800https://xuquant.com/posts/world-models/world-model-robot-learning/围绕 NTU/UC Berkeley/Stanford 联合综述 World Model for Robot Learning，从闭环动机、六范式对比、评测转向到一个关于 disentangled metric 的批判，把机器人世界模型放回本系列的正交视角之中。Wan2.2 and the Boundary of Video World Modelshttps://xuquant.com/posts/world-models/wan2.2-video-world-model-boundary/Sat, 14 Mar 2026 10:00:00 +0800https://xuquant.com/posts/world-models/wan2.2-video-world-model-boundary/Wan2.2 pushes video generation toward photorealistic world simulation, but where is the boundary between generating videos and understanding worlds? This article examines the architecture, training, and fundamental limits of video-based world models.InSpatio-World: Real-Time 4D World Simulation via Spatiotemporal Autoregressive Modelinghttps://xuquant.com/posts/foundation-models/inspatio-world-4d-simulator/Sat, 25 Oct 2025 10:00:00 +0800https://xuquant.com/posts/foundation-models/inspatio-world-4d-simulator/InSpatio-World 深度技术分析：一个 13 亿参数的实时 4D 世界模拟器，通过隐式时空缓存与显式几何约束的结合，实现从单目视频以 24 FPS 进行新视角合成。