Publications

2024

Recent Trends in 3D Reconstruction of General Non-Rigid Scenes

Raza Yunus, Jan Eric Lenssen, Michael Niemeyer, Yiyi Liao, Christian Rupprecht, Christian Theobalt, Gerard Pons-Moll, Jia-Bin Huang, Vladislav Golyanik, Eddy Ilg

Eurographics (STAR) and Computer Graphics Forum 2024

Reconstructing models of the real world, including 3D geometry, appearance, and motion of real scenes, is essential for computer graphics and computer vision. It enables the synthesizing of photorealistic novel views, useful for the movie industry and AR/VR applications. It also facilitates the content creation necessary in computer games and AR/VR by avoiding laborious manual design processes. Further, such models are fundamental for intelligent computing systems that need to interpret real-world scenes and actions to act and interact safely with the human world. Notably, the world surrounding us is dynamic, and reconstructing models of dynamic, non-rigidly moving scenes is a severely underconstrained and challenging problem. This state-of-the-art report (STAR) offers the reader a comprehensive summary of state-of-the-art techniques with monocular and multi-view inputs such as data from RGB and RGB-D sensors, among others, conveying an understanding of different approaches, their potential applications, and promising further research directions. The report covers 3D reconstruction of general non-rigid scenes and further addresses the techniques for scene decomposition, editing and controlling, and generalizable and generative modeling. More specifically, we first review the common and fundamental concepts necessary to understand and navigate the field and then discuss the state-of-the-art techniques by reviewing recent approaches that use traditional and machine-learning-based neural representations, including a discussion on the newly enabled applications. The STAR is concluded with a discussion of the remaining limitations and open challenges.

2024

Recent Trends in 3D Reconstruction of General Non-Rigid Scenes

Neural Point Cloud Diffusion for Disentangled 3D Shape and Appearance Generation

Neural Parametric Gaussians for Monocular Non-Rigid Object Reconstruction

Quantum-Hybrid Stereo Matching With Nonlinear Regularization and Spatial Pyramids

2023

SimNP: Learning Self-Similarity Priors Between Neural Points

2022

ERF: Explicit Radiance Field Reconstruction From Scratch

NinjaDesc: Content-Concealing Visual Descriptors via Adversarial Learning

2021

Mitigating Reverse Engineering Attacks on Local Feature Descriptors

2020

Deep Local Shapes: Learning Local SDF Priors for Detailed 3D Reconstruction

TLIO: Tight Learned Inertial Odometry

Domain Adaptation of Learned Features for Visual Localization

2019

Overcoming Limitations of Mixture Density Networks: A Sampling and Fitting Framework for Multimodal Future Prediction

2018

What Makes Good Synthetic Training Data for Learning Disparity and Optical Flow Estimation?

FusionNet and AugmentedFlowNet: Selective Proxy Ground Truth for Training on Unlabeled Images

Occlusions, Motion and Depth Boundaries with a Generic Network for Disparity, Optical Flow or Scene Flow Estimation

Uncertainty Estimates and Multi-Hypotheses Networks for Optical Flow

2017

End-to-End Learning of Video Super-Resolution with Motion Compensation

DeMoN: Depth and Motion Network for Learning Monocular Stereo

Lucid Data Dreaming for Object Tracking

FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks

2016

A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation

2015

FlowNet: Learning Optical Flow with Convolutional Networks

2014

Reconstruction of Rigid Body Models from Motion Distorted Laser Range Data Using Optical Flow