<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>Computer-Vision on Xu'Blog</title><link>https://xuquant.com/tags/computer-vision/</link><description>Recent content in Computer-Vision on Xu'Blog</description><image><title>Xu'Blog</title><url>https://xuquant.com/images/profile.jpg</url><link>https://xuquant.com/images/profile.jpg</link></image><generator>Hugo -- 0.152.2</generator><language>en</language><lastBuildDate>Thu, 30 Apr 2026 18:00:00 +0800</lastBuildDate><atom:link href="https://xuquant.com/tags/computer-vision/index.xml" rel="self" type="application/rss+xml"/><item><title>SceneVerse++: Lifting Unlabeled Internet Videos into 3D Scene Understanding Training Data</title><link>https://xuquant.com/posts/sceneverse-plus-data-engine-for-3d-scene-understanding/</link><pubDate>Thu, 30 Apr 2026 18:00:00 +0800</pubDate><guid>https://xuquant.com/posts/sceneverse-plus-data-engine-for-3d-scene-understanding/</guid><description>Deep analysis of CVPR 2026 SceneVerse++: how to build the largest-scale real-world 3D scene dataset from unlabeled internet videos, covering detection, segmentation, spatial VQA, and vision-language navigation.</description></item></channel></rss>