💡
研究动机:为什么高光照场景特别难?
Motivation: Why Are High-Illumination Scenes Hard?
逆渲染在高光照下面临三大互相纠缠的技术挑战
Three intertwined technical challenges for inverse rendering under high illumination
⚠️
挑战一
Challenge 1
渲染-几何断裂Render-Geometry Break
无显式 surface → 阴影计算不可靠
Visibility 估计偏差大
🔍
挑战二
Challenge 2
光照-材质耦合Light-Material Coupling
高光反射下材质和光混在一起
Ill-posed 反问题
📉
挑战三
Challenge 3
辐射-优化冲突Radiance-Optimization Conflict
极亮高光"绑架"梯度
暗处细节被忽略
IR-HGP 方案
IR-HGP Solution
HVD + GIFP + PARC
三模块协同解耦
PSNR 33.61 SOTA
🔥 高光照场景的三大技术瓶颈
🔥 Three Technical Bottlenecks Under High Illumination
瓶颈一:渲染-几何断裂 Bottleneck 1: Render-Geometry Break
  • 3DGS 用高斯椭球表示场景,没有显式表面(surface)
  • 3DGS represents scenes with Gaussians, no explicit surface
  • 导致 visibility 计算和阴影投射不可靠
  • Causing unreliable visibility and shadow computation
瓶颈二:光照-材质耦合 Bottleneck 2: Light-Material Coupling
  • 强光下的镜面反射使材质颜色和光照亮度不可分
  • Specular highlights make albedo and illumination inseparable
  • 从稀疏视图同时估计两者是严重 ill-posed 问题
  • Simultaneous estimation from sparse views is severely ill-posed
瓶颈三:辐射-优化冲突 Bottleneck 3: Radiance-Optimization Conflict
  • 极亮高光区域主导梯度,暗部细节被忽略
  • Extremely bright highlights dominate gradients, dark details ignored
  • heuristic 正则化破坏物理一致性,产生 baked-in shadow
  • Heuristic regularization breaks physics, causes baked-in shadows
❓ 为什么现有方案不够?
❓ Why Existing Solutions Fall Short?
方向Direction代表工作Works局限Limitation
NeRF-basedTensoIR渲染速度慢 (<1 FPS)Slow rendering (<1 FPS)
Gaussian-IRGS-IR无 visibility 处理No visibility handling
SDF-basedR3DG, DiscretizedSDF高光场景 PSNR 不足Insufficient highlight PSNR
OursIR-HGP三模块协同,物理一致3-module, physics-consistent
💡 我们的核心洞察: 💡 Our Key Insight: 现有方法将三个问题分开处理或完全忽略。我们提出统一框架, 用 HVD 解决几何断裂、用 GIFP 分离光照材质、用 PARC 稳定优化——三者协同实现物理一致的逆渲染。 Existing methods address or ignore these issues separately. We propose a unified framework where HVD fixes geometry break, GIFP separates light/material, and PARC stabilizes optimization—working together for physics-consistent inverse rendering.
🔬
方法概览:IR-HGP 三模块框架
Method Overview: IR-HGP Three-Module Framework
从多视角照片到可重打光 3D 资产的完整流程
Complete pipeline from multi-view photos to relightable 3D assets
1 混合可见性分解Hybrid Visibility Decomp.
HVD — 渲染-几何断裂解决方案 HVD — Solving Render-Geometry Break
📍 输入:稀疏多视角图像Input: Sparse Multi-view Images
K 视图输入 COLMAP 初始化
3DGS Surfels 表示场景 3DGS surfels represent scene
🔄 双路径渲染架构Dual-Path Rendering
快速路径Fast Path
  • 2D Gaussian Surfels
  • 2D Gaussian surfels
  • 实时渲染
精确路径Precise Path
  • 提取三角网格 Mesh
  • Extract triangle mesh
  • 光线追踪
📐 辐射分解公式Radiance Decomposition
$L_o(p,\omega_o) = L_{dir} \cdot V(p) + L_{ind}$
直接光 × Visibility Direct × Visibility + 间接光 (SH) + Indirect (SH)
精确阴影 + 实时速度Accurate Shadows + Real-time
Visibility 准确Accurate Visibility 92 FPS 保持92 FPS Maintained
解决渲染-几何断裂
SOLVES RENDER-GEOMETRY BREAK
2 生成式光照先验Generative Illum. Prior
GIFP — 光照-材质耦合解决方案 GIFP — Solving Light-Material Coupling
🖼️ 粗略光照特征 $L_{Coarse}$Coarse Lighting Feature $L_{Coarse}$
从多视角输入提取环境光初步估计 Initial env-map estimation from multi-view inputs
HDR Environment Map
🎨 条件扩散模型先验Conditional Diffusion Prior
Pre-trained Conditional DM
  • 预训练于真实 HDR 环境图数据集
  • Pre-trained on real HDR env-map dataset
  • Condition: $L_{Coarse}$ 光照特征
  • Condition: $L_{Coarse}$ lighting features
📊 SDS 损失引导SDS Loss Guidance
$\mathcal{L}_{GIFP} = \mathbb{E}_{\epsilon,t}\left[\|\epsilon - \epsilon_\theta(E, t)\|^2\right]$
Score Distillation Sampling 拉住优化器 SDS pulls optimizer toward realistic distribution
物理合理的光照估计Physically-Plausible Lighting
防止模糊/偏色No blur/color shift 材质干净分离Clean albedo separation
解决光照-材质耦合
SOLVES LIGHT-MATERIAL COUPLING
3 物理感知辐射校正Physically-Aware Rad. Correction
PARC — 辐射-优化冲突解决方案 PARC — Solving Radiance-Opt Conflict
📷 HDR 渲染图 + 目标图HDR Rendered + Target Image
极端 luminance 差异 高光主导梯度
🎛️ ACES 色调映射校正ACES Tone Mapping Correction
$\hat{I} = T_{ACES}(I;\,\beta)$    $\beta$: 可学习全局曝光参数
基于 ACES 曲线的自适应曝光映射ACES curve-based adaptive exposure mapping
仅 +1 自由度Only +1 DOF
⚖️ 梯度友好空间 LossGradient-Friendly Loss Space
$\mathcal{L}_{total} = \|\hat{I}_{render} - \hat{I}_{target}\|_1$
所有区域获得均衡梯度Balanced gradients across all regions
🏆 干净的材质贴图Clean Albedo Maps
消除 Baked-in Shadow ✨Eliminate Baked-in Shadow ✨
+1.49 dB PSNR gain
解决辐射-优化冲突
SOLVES RADIANCE-OPTIMIZATION CONFLICT
Fig. IR-HGP 三模块流水线:HVD 通过双路径(Surfels 快速渲染 + Mesh 精确光线追踪)分解直接/间接光并准确计算可见性; GIFP 利用预训练条件扩散模型的 SDS 损失约束光照估计不偏离真实分布; PARC 基于 ACES 色调映射引入可学习曝光参数 β,将 HDR 值映射到梯度友好空间,彻底消除 baked-in shadow。
Fig. IR-HGP 3-Module Pipeline: HVD decomposes direct/indirect light via dual-path (fast surfels + accurate mesh ray-tracing); GIFP uses SDS loss from pre-trained conditional diffusion model to constrain lighting estimation; PARC introduces learnable exposure β via ACES tone-mapping, mapping HDR to gradient-friendly space, eliminating baked-in shadows.
📊
实验结果:全面领先的 SOTA 性能
Experiments: Full-Metric SOTA Performance
在所有基准测试上超越现有方法的量化对比
Quantitative comparison against state-of-the-art on all benchmarks
方法 / Method Method 类型 Type Mean PSNR ↑ Mean SSIM ↑ Mean LPIPS ↓ 训练时间 Time FPS
TensoIR NeRF 28.220.93530.08405.4h5.4h<1
GS-IR Gaussian 29.250.92780.08800.6h0.6h208
R3DG Gaussian 29.810.96450.04931.1h1.1h51
DiscretizedSDF Gaussian 32.120.97000.04531.2h1.2h139
IR-HGP (Ours) Gaussian 33.61 0.9761 0.0369 1.5h 1.5h 92
🧪
消融实验:每个模块都有贡献
Ablation Study: Every Module Contributes
逐一移除各模块验证其必要性
Removing each module verifies its necessity
A1
🚫
移除 HVD 可见性计算
Remove HVD Visibility
去掉 mesh 光线追踪的可见性约束后:
After removing mesh ray-tracing visibility constraints:
  • 阴影投射不准确,出现漏阴影或多重阴影伪影
  • Inaccurate shadow casting, leaking/multiple shadow artifacts
  • 直接光照项的 visibility 调制失效
  • Direct light visibility modulation fails
PSNR ↓ 明显下降 | 阴影质量差
A2
🚫
移除 GIFP 扩散先验
Remove GIFP Diffusion Prior
去掉条件扩散模型的 SDS 先验约束后:
After removing conditional diffusion model's SDS prior:
  • 光照估计退化为模糊、偏色的不合理结果
  • Lighting degrades to unrealistic blurry/biased results
  • 材质和光照重新耦合,albedo 不纯
  • Material and lighting re-couple, impure albedo
PSNR ↓ ~2dB | 光照模糊偏色
A3
🚫
移除 PARC 辐射校正
Remove PARC Radiometric Corr.
去掉 ACES 色调映射的辐射校正后:
After removing ACES tone-mapping correction:
  • 出现 baked-in shadow——阴影被错误烘焙进 albedo
  • Baked-in shadow appears — shadows incorrectly baked into albedo
  • 高光区域产生过曝伪影
  • Overexposure artifacts in highlight regions
  • 换光照后阴影依然存在(最致命)
  • Shadows persist under relighting (most critical)
PSNR ↓ | Baked-in Shadow 复现
🏆
核心优势:为什么 IR-HGP 更强?
Key Advantages: Why IR-HGP Wins?
从设计哲学到实验效果的全方位优势总结
Comprehensive advantages from design philosophy to experimental results
01
🔀
双路径渲染:又快又准
Dual-Path Rendering: Fast & Accurate
传统方法要么快但不准(纯 3DGS),要么准但不快(NeRF)。我们的 HVD 模块采用"粗活细干"策略: 日常渲染走 2D Gaussian Surfels 路径(保持实时),定期切换到 Mesh 光线追踪计算精确 visibility 和阴影。 两者互补而非替代,实现了精度与速度的最佳平衡
Traditional methods are either fast but inaccurate (pure 3DGS) or accurate but slow (NeRF). Our HVD uses "coarse-fast + precise-slow": daily rendering via 2D Gaussian surfels (real-time), periodically switching to mesh ray-tracing for exact visibility and shadows. Complementary rather than replacement — optimal speed-accuracy balance.
92 FPS + 精确阴影 ★
02
🎨
扩散先验做光照"守门员"
Diffusion Prior as "Lighting Gatekeeper"
GIFP 是第一个将预训练扩散模型引入 3DGS 逆渲染的工作。 扩散模型就像一位"光照专家",它见过海量真实 HDR 环境图,知道什么光照是合理的。 当优化器试图把光估成不合理的样子时,SDS 损失会把它拉回来。 结果:光照估计不再模糊偏色,材质贴图干净分离。
GIFP is the first to introduce pre-trained diffusion models into 3DGS inverse rendering. The diffusion model acts as a "lighting expert" that has seen thousands of real HDR env-maps, knowing what plausible lighting looks like. When the optimizer tries implausible estimates, SDS loss pulls it back. Result: no more blurry/biased lighting, clean albedo separation.
~2dB PSNR from GIFP alone
03
📷
1 个参数根治 Baked-in Shadow
1 Parameter Cures Baked-in Shadow
PARC 只引入了 1 个标量参数 β(可学习全局曝光),却从根本上解决了困扰领域多年的 baked-in shadow 问题。 基于 ACES 电影级色调映射曲线,将 HDR 值映射到梯度友好空间, 让优化器能同时看清亮部和暗部的细节。不是 heuristic hack,而是有物理依据的校正。
PARC introduces only 1 scalar parameter β (learnable global exposure), yet fundamentally cures the long-standing baked-in shadow problem. Based on cinematic ACES tone-mapping, it maps HDR values to gradient-friendly space, letting the optimizer see details in both bright and dark regions simultaneously. Not a heuristic hack — physically-grounded correction.
+1.49 dB | 仅 +1 DOF | 物理一致 ✦
04
高反射物体上的碾压级优势
Dominant Lead on Reflective Objects
IR-HGP 在高反射/高光照物体上的优势尤为惊人。 以 Shiny Blender 数据集的 Helmet 物体为例: 我们的 PSNR 达到 35.00,而第二名 DiscretizedSDF 只有 30.29—— 差距近 5dB!这正是我们针对高光照场景设计的威力所在。
IR-HGP's advantage is especially dramatic on high-reflective/high-illumination objects. For Shiny Blender's Helmet object: our PSNR reaches 35.00 vs. runner-up DiscretizedSDF's 30.29 — a nearly 5dB gap! This is exactly where our high-illumination-focused design shines.
+4.71 dB on Shiny Helmet ⭐
05
🔗
三模块协同 > 简单叠加
Synergy > Simple Stacking
HVD、GIFP、PARC 不是三个独立 patch,而是紧密协作的有机整体: HVD 提供准确的 visibility 让 GIFP 的光照分解有可靠基础; GIFP 提供合理的光照让 PARC 的校正有正确的目标; PARC 保证整体优化的稳定性让 HVD/GIFP 的效果不被破坏。 1+1+1 > 3 的系统级设计。
HVD, GIFP, PARC are not independent patches but a tightly-integrated organic system: HVD provides accurate visibility as foundation for GIFP's light decomposition; GIFP supplies plausible lighting as correct target for PARC's correction; PARC ensures overall optimization stability protecting HVD/GIFP effects. A synergistic 1+1+1 > 3 system-level design.
System-level co-design ✦
06
🎮
面向落地:离线生产 + 在线渲染
Production-Ready: Offline Gen + Online Render
IR-HGP 的 pipeline 天然契合工业界需求: 离线阶段(1.5h/scene)从一组照片生成高质量 3D 资产(形状+材质+光照); 在线阶段(92 FPS)支持任意重打光和自由视角浏览。 这正是游戏美术、电商 3D 展示、XR 内容生产的理想 workflow—— 拍照即建模,建模即可用。
IR-HGP naturally fits industry workflows: Offline phase (1.5h/scene): generate high-quality 3D assets (shape+material+lighting) from photos; Online phase (92 FPS): arbitrary relighting and free-viewpoint browsing. The ideal workflow for game art, e-commerce 3D showcases, XR content production — shoot-to-model, model-to-use.
Game/XR/E-commerce ready 🎯