Low-Light Video Denoising

Low-light video denoising research. Currently under review at IEEE ICME as second author. Beating previous SOTA with 29.22 dB PSNR and 0.863 SSIM on BVI-RLV with bidirectional warping and adaptive occlusion masking.

Optical flow fields from DPFlow (base), DPFlow-Noisy (fine-tuned, middle), and RAFT. Fine-tuning cut temporal residual MSE from 2.94 to 2.75 — sharper edges, less bleed into occluded regions.

Impact

Under review at IEEE ICME as second author
Reached 29.22 dB PSNR, 0.863 SSIM, and 0.334 LPIPS on BVI-RLV, all above the pre-tuned baseline
Fine-tuned model cut temporal residual MSE from 2.9417 (DPFlow base) to 2.7472 — a 6.6% reduction

Stack

PythonPyTorchPTLFlowRAFTOpenCVLPIPS

Key Decisions

Occlusion handling: Confidence-weighted fusion of forward and backward warps reduced artifacts in ambiguous regions.
Robustness: Explicit fallback paths kept long sequence runs alive when bidirectional warping failed on edge cases.
Evaluation discipline: Sequence-level and averaged metrics made model regressions immediately visible.

Technical Approach

Each frame is denoised/enhanced while temporal hidden states are cached across a sequence.
Optical flow is computed in both directions using RAFT/PTLFlow to align temporal context.
Occlusion masks are derived from forward-backward consistency error with an adaptive threshold.
Forward/backward warped candidates are fused using photometric confidence weighting.
A pipeline runner orchestrates training and evaluation and exports per-sequence plus averaged metrics.

Visual Comparison

Project Links

Repo