Matthew Tancik

Matthew Tancik*,Ethan Weber*,Evonne Ng*,Ruilong Li,Brent Yi,Justin Kerr,Terrance Wang,Alexander Kristoffersen,Jake Austin,Kamyar Salahi,Abhik Ahuja,David McAllister,Angjoo Kanazawa

Hierarchical grouping in 3D by training a scale-conditioned affinity field from multi-level masks

SIGGRAPH (2023)

Project Website Github arXiv

A Modular Framework for Neural Radiance Field Development.

LERF: Language Embedded Radiance Fields

Justin Kerr*,Chung Min Kim*,Ken Goldberg,Angjoo Kanazawa,Matthew Tancik

ICCV (2023) Oral

Ayaan Haque,Matthew Tancik,Alexei Efros,Aleksander Hołyński,Angjoo Kanazawa,

Grounding CLIP vectors volumetrically inside a NeRF allows flexible natural language queries in 3D.

Instruct-NeRF2NeRF: Editing 3D Scenes with Instructions

ICCV (2023) Oral

Ruilong Li,Hang Gao,Matthew Tancik,Angjoo Kanazawa

Instruct-NeRF2NeRF enables instruction-based editing of NeRFs via a 2D diffusion model.

NerfAcc: Efficient Sampling Accelerates NeRFs

ICCV (2023)

Frederik Warburg,Ethan Weber*,Matthew Tancik,Aleksander Hołyński,Angjoo Kanazawa

NerfAcc integrates advanced efficient sampling techniques that lead to significant speedups in training various recent NeRF papers with minimal modifications to existing codebases.

Nerfbusters: Removing Ghostly Artifacts from Casually Captured NeRFs

ICCV (2023)

Justin Kerr,Letian Fu,Huang Huang,Yahav Avigal,Matthew Tancik,Jeffrey Ichnowski,Angjoo Kanazawa,Ken Goldberg

Nerfbusters proposes an evaluation procedure for in-the-wild NeRFs, and presents a method that uses a 3D diffusion prior to clean NeRFs.

Evo-NeRF: Evolving NeRF for Sequential Robot Grasping

CoRL (2022) Oral

OpenReview Project Website

We show that by training NeRFs incrementally over a stream of images, they can be used robotics grasping tasks. They are particularly useful in tasks involving transparent objects which are traditionally hard to compute geometry for.

The One Where They Reconstructed
3D Humans and Environments in TV Shows

Georgios Pavlakos*,Ethan Weber*,Matthew Tancik,Angjoo Kanazawa

ECCV (2022)

Matthew Tancik,Vincent Casser,Xinchen Yan,Sabeek Pradhan,Ben Mildenhall,Pratul P. Srinivasan,Jonathan T. Barron,Henrik Kretzschmar

We show that is it possible to reconstruct TV show in 3D. Further, reasoning about humans and their environment in 3D enables a broad range of downstream applications: re-identification, gaze estimation, cinematography and image editing.

Block-NeRF: Scalable Large Scene Neural View Synthesis

CVPR (2022) Oral

Alex Yu*,Sara Fridovich-Keil*,Matthew Tancik,Qinhong Chen,Benjamin Recht,Angjoo Kanazawa

We present a variant of Neural Radiance Fields that can represent large-scale environments. We build a grid of Block-NeRFs from 2.8 million images to create the largest neural scene representation to date, capable of rendering an entire neighborhood of San Francisco.

Plenoxels: Radiance Fields without Neural Networks

CVPR (2022) Oral

Alex Yu,Ruilong Li,Matthew Tancik,Hao Li,Ren Ng,Angjoo Kanazawa

We propose a view-dependent sparse voxel model, Plenoxel (plenoptic volume element), that can optimize to the same fidelity as Neural Radiance Fields (NeRFs) without any neural networks. Our typical optimization time is 11 minutes on a single GPU, a speedup of two orders of magnitude compared to NeRF.

PlenOctrees for Real-time Rendering of Neural Radiance Fields

ICCV (2021) Oral

arXiv Demo / Project Website Video

We introduce a method to render Neural Radiance Fields (NeRFs) in real time without sacrificing quality. Our method preserves the ability of NeRFs to perform free-viewpoint rendering of scenes with arbitrary geometry and view-dependent effects.

Mip-NeRF: A Multiscale Representation for Anti-Aliasing Neural Radiance Fields

Jonathan T. Barron,Ben Mildenhall,Matthew Tancik,Peter Hedman,Ricardo Martin-Brualla,Pratul P. Srinivasan

ICCV (2021) Oral - Best Paper Honorable Mention

Ajay Jain,Matthew Tancik,Pieter Abbeel

The rendering procedure used by neural radiance fields (NeRF) samples a scene with a single ray per pixel and may therefore produce renderings that are excessively blurred or aliased when training or testing images observe scene content at different resolutions. We prefilter the positional encoding function and train NeRF to generate anti-aliased renderings.

Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis

ICCV (2021)

Matthew Tancik*,Ben Mildenhall*,Terrance Wang,Divi Schmidt,Pratul P. Srinivasan,Jonathan T. Barron,Ren Ng

We introduce an auxiliary semantic consistency loss that encourages realistic renderings at novel poses. Our semantic loss allows us to supervise DietNeRF from arbitrary poses. We extract these semantics using a pre-trained visual encoder such as CLIP.

Learned Initializations for Optimizing Coordinate-Based Neural Representations

CVPR (2021) Oral

Alex Yu,Vickie Ye,Matthew Tancik,Angjoo Kanazawa

We find that standard meta-learning algorithms for weight initialization can enable faster convergence during optimization and can serve as a strong prior over the signal class being modeled, resulting in better generalization when only partial observations of a given signal are available.

pixelNeRF: Neural Radiance Fields from One or Few Images

CVPR (2021)

Pratul P. Srinivasan,Boyang Deng,Xiuming Zhang,Matthew Tancik,Ben Mildenhall,Jonathan T. Barron,

We propose a learning framework that predicts a continuous neural scene representation from one or few input images by conditioning on image features encoded by a convolutional neural network.

NeRV: Neural Reflectance and Visibility Fields for Relighting and View Synthesis

CVPR (2021)

Matthew Tancik*,Pratul P. Srinivasan*,Ben Mildenhall*,Sara Fridovich-Keil,Nithin Raghavan,Utkarsh Singhal,Ravi Ramamoorthi,Jonathan T. Barron,Ren Ng

We recover relightable NeRF-like models using neural approximations of expensive visibility integrals, so we can simulate complex volumetric light transport during training.

Fourier Features Let Networks Learn
High Frequency Functions in Low Dimensional Domains

NeurIPS (2020) Spotlight

Ben Mildenhall*,Pratul P. Srinivasan*,Matthew Tancik*,Jonathan T. Barron,Ravi Ramamoorthi,Ren Ng

We show that passing input points through a simple Fourier feature mapping enables a multilayer perceptron (MLP) to learn high-frequency functions in low-dimensional problem domains. These results shed light on recent advances in computer vision and graphics that achieve state-of-the-art results by using MLPs to represent complex 3D objects and scenes.

NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis

ECCV (2020) Oral - Best Paper Honorable Mention

arXiv Project Website Code Video Follow-ups

We propose an algorithm that represents a scene using a fully-connected (non-convolutional) deep network, whose input is a single continuous 5D coordinate (spatial location (x, y, z) and viewing direction (θ, φ)) and whose output is the volume density and view-dependent emitted radiance at that spatial location. With this representation we achieve state-of-the-art results for synthesizing novel views of scenes from a sparse set of input views.

StegaStamp: Invisible Hyperlinks in Physical Photographs

Matthew Tancik*,Ben Mildenhall*,Ren Ng

CVPR (2020)

Pratul P. Srinivasan*,Ben Mildenhall*,Matthew Tancik,Jonathan T. Barron,Richard Tucker,Noah Snavely

We present a deep learning method to hide imperceptible data into printed images that can be recovered after photographing the print. The method is robust to corruptions like shadows, occlusions, noice, and shift in color .

Lighthouse: Predicting Lighting Volumes
for Spatially-Coherent Illumination

CVPR (2020)