IR-HGP — Physically-Aware Gaussian Inverse Rendering

💡

研究动机：为什么高光照场景特别难？

Motivation: Why Are High-Illumination Scenes Hard?

逆渲染在高光照下面临三大互相纠缠的技术挑战

Three intertwined technical challenges for inverse rendering under high illumination

⚠️

挑战一

Challenge 1

渲染-几何断裂Render-Geometry Break
无显式 surface → 阴影计算不可靠
Visibility 估计偏差大

➕

🔍

挑战二

Challenge 2

光照-材质耦合Light-Material Coupling
高光反射下材质和光混在一起
Ill-posed 反问题

➕

📉

挑战三

Challenge 3

辐射-优化冲突Radiance-Optimization Conflict
极亮高光"绑架"梯度
暗处细节被忽略

→

✅

IR-HGP 方案

IR-HGP Solution

HVD + GIFP + PARC
三模块协同解耦
PSNR 33.61 SOTA

🔥 高光照场景的三大技术瓶颈

🔥 Three Technical Bottlenecks Under High Illumination

瓶颈一：渲染-几何断裂 Bottleneck 1: Render-Geometry Break

3DGS 用高斯椭球表示场景，没有显式表面（surface）
3DGS represents scenes with Gaussians, no explicit surface
导致 visibility 计算和阴影投射不可靠
Causing unreliable visibility and shadow computation

瓶颈二：光照-材质耦合 Bottleneck 2: Light-Material Coupling

强光下的镜面反射使材质颜色和光照亮度不可分
Specular highlights make albedo and illumination inseparable
从稀疏视图同时估计两者是严重 ill-posed 问题
Simultaneous estimation from sparse views is severely ill-posed

瓶颈三：辐射-优化冲突 Bottleneck 3: Radiance-Optimization Conflict

极亮高光区域主导梯度，暗部细节被忽略
Extremely bright highlights dominate gradients, dark details ignored
heuristic 正则化破坏物理一致性，产生 baked-in shadow
Heuristic regularization breaks physics, causes baked-in shadows

❓ 为什么现有方案不够？

❓ Why Existing Solutions Fall Short?

方向Direction	代表工作Works	局限Limitation
NeRF-based	TensoIR	渲染速度慢 (<1 FPS)	Slow rendering (<1 FPS)
Gaussian-IR	GS-IR	无 visibility 处理	No visibility handling
SDF-based	R3DG, DiscretizedSDF	高光场景 PSNR 不足	Insufficient highlight PSNR
Ours	IR-HGP	三模块协同，物理一致	3-module, physics-consistent

💡 我们的核心洞察： 💡 Our Key Insight: 现有方法将三个问题分开处理或完全忽略。我们提出统一框架，用 HVD 解决几何断裂、用 GIFP 分离光照材质、用 PARC 稳定优化——三者协同实现物理一致的逆渲染。 Existing methods address or ignore these issues separately. We propose a unified framework where HVD fixes geometry break, GIFP separates light/material, and PARC stabilizes optimization—working together for physics-consistent inverse rendering.

🔬

方法概览：IR-HGP 三模块框架

Method Overview: IR-HGP Three-Module Framework

从多视角照片到可重打光 3D 资产的完整流程

Complete pipeline from multi-view photos to relightable 3D assets

1 混合可见性分解Hybrid Visibility Decomp.
HVD — 渲染-几何断裂解决方案 HVD — Solving Render-Geometry Break

📍 输入：稀疏多视角图像Input: Sparse Multi-view Images

K 视图输入 COLMAP 初始化
3DGS Surfels 表示场景 3DGS surfels represent scene

↓

🔄 双路径渲染架构Dual-Path Rendering

快速路径Fast Path

2D Gaussian Surfels
2D Gaussian surfels
实时渲染

精确路径Precise Path

提取三角网格 Mesh
Extract triangle mesh
光线追踪

↓

📐 辐射分解公式Radiance Decomposition

$L_o(p,\omega_o) = L_{dir} \cdot V(p) + L_{ind}$

直接光 × Visibility Direct × Visibility + 间接光 (SH) + Indirect (SH)

↓

✅ 精确阴影 + 实时速度Accurate Shadows + Real-time

Visibility 准确Accurate Visibility 92 FPS 保持92 FPS Maintained

解决渲染-几何断裂

SOLVES RENDER-GEOMETRY BREAK

2 生成式光照先验Generative Illum. Prior
GIFP — 光照-材质耦合解决方案 GIFP — Solving Light-Material Coupling

🖼️ 粗略光照特征 $L_{Coarse}$Coarse Lighting Feature $L_{Coarse}$

从多视角输入提取环境光初步估计 Initial env-map estimation from multi-view inputs
HDR Environment Map

↓

🎨 条件扩散模型先验Conditional Diffusion Prior

Pre-trained Conditional DM

预训练于真实 HDR 环境图数据集
Pre-trained on real HDR env-map dataset
Condition: $L_{Coarse}$ 光照特征
Condition: $L_{Coarse}$ lighting features

↓

📊 SDS 损失引导SDS Loss Guidance

$\mathcal{L}_{GIFP} = \mathbb{E}_{\epsilon,t}\left[\|\epsilon - \epsilon_\theta(E, t)\|^2\right]$

Score Distillation Sampling 拉住优化器 SDS pulls optimizer toward realistic distribution

↓

✅ 物理合理的光照估计Physically-Plausible Lighting

防止模糊/偏色No blur/color shift 材质干净分离Clean albedo separation

解决光照-材质耦合

SOLVES LIGHT-MATERIAL COUPLING

3 物理感知辐射校正Physically-Aware Rad. Correction
PARC — 辐射-优化冲突解决方案 PARC — Solving Radiance-Opt Conflict

📷 HDR 渲染图 + 目标图HDR Rendered + Target Image

极端 luminance 差异高光主导梯度

↓

🎛️ ACES 色调映射校正ACES Tone Mapping Correction

$\hat{I} = T_{ACES}(I;\,\beta)$ $\beta$: 可学习全局曝光参数

基于 ACES 曲线的自适应曝光映射ACES curve-based adaptive exposure mapping
仅 +1 自由度Only +1 DOF

↓

⚖️ 梯度友好空间 LossGradient-Friendly Loss Space

$\mathcal{L}_{total} = \|\hat{I}_{render} - \hat{I}_{target}\|_1$

所有区域获得均衡梯度Balanced gradients across all regions

↓

🏆 干净的材质贴图Clean Albedo Maps

消除 Baked-in Shadow ✨Eliminate Baked-in Shadow ✨
+1.49 dB PSNR gain

解决辐射-优化冲突

SOLVES RADIANCE-OPTIMIZATION CONFLICT

Fig. IR-HGP 三模块流水线：HVD 通过双路径（Surfels 快速渲染 + Mesh 精确光线追踪）分解直接/间接光并准确计算可见性； GIFP 利用预训练条件扩散模型的 SDS 损失约束光照估计不偏离真实分布； PARC 基于 ACES 色调映射引入可学习曝光参数 β，将 HDR 值映射到梯度友好空间，彻底消除 baked-in shadow。

Fig. IR-HGP 3-Module Pipeline: HVD decomposes direct/indirect light via dual-path (fast surfels + accurate mesh ray-tracing); GIFP uses SDS loss from pre-trained conditional diffusion model to constrain lighting estimation; PARC introduces learnable exposure β via ACES tone-mapping, mapping HDR to gradient-friendly space, eliminating baked-in shadows.

📊

实验结果：全面领先的 SOTA 性能

Experiments: Full-Metric SOTA Performance

在所有基准测试上超越现有方法的量化对比

Quantitative comparison against state-of-the-art on all benchmarks

方法 / Method	Method	类型	Type	Mean PSNR ↑	Mean SSIM ↑	Mean LPIPS ↓	训练时间
TensoIR	NeRF	28.22	0.9353	0.0840	5.4h	5.4h	<1
GS-IR	Gaussian	29.25	0.9278	0.0880	0.6h	0.6h	208
R3DG	Gaussian	29.81	0.9645	0.0493	1.1h	1.1h	51
DiscretizedSDF	Gaussian	32.12	0.9700	0.0453	1.2h	1.2h	139
★ IR-HGP (Ours)	Gaussian	33.61	0.9761	0.0369	1.5h	1.5h	92

🧪

消融实验：每个模块都有贡献

Ablation Study: Every Module Contributes

逐一移除各模块验证其必要性

Removing each module verifies its necessity

🚫

移除 HVD 可见性计算

Remove HVD Visibility

去掉 mesh 光线追踪的可见性约束后：

After removing mesh ray-tracing visibility constraints:

阴影投射不准确，出现漏阴影或多重阴影伪影
Inaccurate shadow casting, leaking/multiple shadow artifacts
直接光照项的 visibility 调制失效
Direct light visibility modulation fails

PSNR ↓ 明显下降 | 阴影质量差

🚫

移除 GIFP 扩散先验

Remove GIFP Diffusion Prior

去掉条件扩散模型的 SDS 先验约束后：

After removing conditional diffusion model's SDS prior:

光照估计退化为模糊、偏色的不合理结果
Lighting degrades to unrealistic blurry/biased results
材质和光照重新耦合，albedo 不纯
Material and lighting re-couple, impure albedo

PSNR ↓ ~2dB | 光照模糊偏色

🚫

移除 PARC 辐射校正

Remove PARC Radiometric Corr.

去掉 ACES 色调映射的辐射校正后：

After removing ACES tone-mapping correction:

出现 baked-in shadow——阴影被错误烘焙进 albedo
Baked-in shadow appears — shadows incorrectly baked into albedo
高光区域产生过曝伪影
Overexposure artifacts in highlight regions
换光照后阴影依然存在（最致命）
Shadows persist under relighting (most critical)

PSNR ↓ | Baked-in Shadow 复现

🏆

核心优势：为什么 IR-HGP 更强？

Key Advantages: Why IR-HGP Wins?

从设计哲学到实验效果的全方位优势总结

Comprehensive advantages from design philosophy to experimental results

🔀

双路径渲染：又快又准

Dual-Path Rendering: Fast & Accurate

传统方法要么快但不准（纯 3DGS），要么准但不快（NeRF）。我们的 HVD 模块采用"粗活细干"策略：日常渲染走 2D Gaussian Surfels 路径（保持实时），定期切换到 Mesh 光线追踪计算精确 visibility 和阴影。两者互补而非替代，实现了精度与速度的最佳平衡。

Traditional methods are either fast but inaccurate (pure 3DGS) or accurate but slow (NeRF). Our HVD uses "coarse-fast + precise-slow": daily rendering via 2D Gaussian surfels (real-time), periodically switching to mesh ray-tracing for exact visibility and shadows. Complementary rather than replacement — optimal speed-accuracy balance.

92 FPS + 精确阴影 ★

🎨

扩散先验做光照"守门员"

Diffusion Prior as "Lighting Gatekeeper"

GIFP 是第一个将预训练扩散模型引入 3DGS 逆渲染的工作。扩散模型就像一位"光照专家"，它见过海量真实 HDR 环境图，知道什么光照是合理的。当优化器试图把光估成不合理的样子时，SDS 损失会把它拉回来。结果：光照估计不再模糊偏色，材质贴图干净分离。

GIFP is the first to introduce pre-trained diffusion models into 3DGS inverse rendering. The diffusion model acts as a "lighting expert" that has seen thousands of real HDR env-maps, knowing what plausible lighting looks like. When the optimizer tries implausible estimates, SDS loss pulls it back. Result: no more blurry/biased lighting, clean albedo separation.

~2dB PSNR from GIFP alone

📷

1 个参数根治 Baked-in Shadow

1 Parameter Cures Baked-in Shadow

PARC 只引入了 1 个标量参数 β（可学习全局曝光），却从根本上解决了困扰领域多年的 baked-in shadow 问题。基于 ACES 电影级色调映射曲线，将 HDR 值映射到梯度友好空间，让优化器能同时看清亮部和暗部的细节。不是 heuristic hack，而是有物理依据的校正。

PARC introduces only 1 scalar parameter β (learnable global exposure), yet fundamentally cures the long-standing baked-in shadow problem. Based on cinematic ACES tone-mapping, it maps HDR values to gradient-friendly space, letting the optimizer see details in both bright and dark regions simultaneously. Not a heuristic hack — physically-grounded correction.

+1.49 dB | 仅 +1 DOF | 物理一致 ✦

⚡

高反射物体上的碾压级优势

Dominant Lead on Reflective Objects

IR-HGP 在高反射/高光照物体上的优势尤为惊人。以 Shiny Blender 数据集的 Helmet 物体为例：我们的 PSNR 达到 35.00，而第二名 DiscretizedSDF 只有 30.29—— 差距近 5dB！这正是我们针对高光照场景设计的威力所在。

IR-HGP's advantage is especially dramatic on high-reflective/high-illumination objects. For Shiny Blender's Helmet object: our PSNR reaches 35.00 vs. runner-up DiscretizedSDF's 30.29 — a nearly 5dB gap! This is exactly where our high-illumination-focused design shines.

+4.71 dB on Shiny Helmet ⭐

🔗

三模块协同 > 简单叠加

Synergy > Simple Stacking

HVD、GIFP、PARC 不是三个独立 patch，而是紧密协作的有机整体： HVD 提供准确的 visibility 让 GIFP 的光照分解有可靠基础； GIFP 提供合理的光照让 PARC 的校正有正确的目标； PARC 保证整体优化的稳定性让 HVD/GIFP 的效果不被破坏。 1+1+1 > 3 的系统级设计。

HVD, GIFP, PARC are not independent patches but a tightly-integrated organic system: HVD provides accurate visibility as foundation for GIFP's light decomposition; GIFP supplies plausible lighting as correct target for PARC's correction; PARC ensures overall optimization stability protecting HVD/GIFP effects. A synergistic 1+1+1 > 3 system-level design.

System-level co-design ✦

🎮

面向落地：离线生产 + 在线渲染

Production-Ready: Offline Gen + Online Render

IR-HGP 的 pipeline 天然契合工业界需求： 离线阶段（1.5h/scene）从一组照片生成高质量 3D 资产（形状+材质+光照）； 在线阶段（92 FPS）支持任意重打光和自由视角浏览。这正是游戏美术、电商 3D 展示、XR 内容生产的理想 workflow—— 拍照即建模，建模即可用。

IR-HGP naturally fits industry workflows: Offline phase (1.5h/scene): generate high-quality 3D assets (shape+material+lighting) from photos; Online phase (92 FPS): arbitrary relighting and free-viewpoint browsing. The ideal workflow for game art, e-commerce 3D showcases, XR content production — shoot-to-model, model-to-use.

Game/XR/E-commerce ready 🎯

IR-HGP：基于生成式先验的物理感知高斯逆渲染 IR-HGP: Physically-Aware Gaussian Inverse Rendering via Generative Priors