ViewCrafter is an advanced video diffusion model that synthesizes high-fidelity novel views from single or few images, combining generative capabilities with point-based 3D representation for precise camera pose control.
What is ViewCrafter?
ViewCrafter is an advanced video diffusion model developed by Peking University, CUHK, and Tencent. It synthesizes high-fidelity novel views from single or few images by combining the generative capabilities of video diffusion models with point-based 3D representation. This allows for precise control over camera poses to generate high-quality video frames. Through iterative view synthesis strategies and camera trajectory planning, ViewCrafter gradually expands 3D cues to generate a broader range of novel views. It has demonstrated strong generalization and performance across multiple datasets, offering new possibilities for immersive real-time rendering and scene-level text-to-3D generation applications.

Main Features of ViewCrafter
- Novel View Synthesis: Synthesizes new views from single or few images, expanding the user's perspective.
- 3D Scene Reconstruction: Reconstructs the 3D structure of scenes, providing a geometric foundation for novel view generation.
- Content Creation: Supports text descriptions or other creative inputs to generate 3D scenes, enhancing flexibility in content creation.
- Real-Time Rendering: Optimizes 3D scene representation for real-time rendering, suitable for virtual and augmented reality applications.
- Dataset Generalization: Validates model performance across multiple datasets, ensuring generalization across different scenarios.
Technical Principles of ViewCrafter
- Point Cloud Reconstruction: Extracts depth information from input images using dense stereo vision algorithms to build a 3D point cloud model of the scene.
- Video Diffusion Model: Uses generative models, particularly diffusion models, to generate novel views by gradually recovering clear images from noisy ones.
- Iterative View Synthesis: Continuously optimizes novel view generation, with each iteration including the generation of new views and updates to the point cloud model.
- Camera Trajectory Planning: Automatically plans camera movement trajectories to capture scenes from different angles, generating more comprehensive views.
- 3D Scene Understanding: Combines point clouds with generative models to understand the 3D structure of scenes, generating novel views consistent with the original scene.
Project Links for ViewCrafter
Application Scenarios of ViewCrafter
- Film Production: Generates new perspectives for special effects shots, enhancing visual effects in post-production.
- Game Development: Creates realistic game environments and backgrounds, providing a more immersive gaming experience.
- Virtual Reality (VR): Generates 360-degree panoramic images in VR applications, enhancing user immersion.
- Augmented Reality (AR): Seamlessly integrates virtual objects into the real world, offering richer interactive experiences.
- Architectural Visualization: Helps designers showcase architectural models from different angles, providing more intuitive design evaluations.