IBRNet: Learning Multi-View Image-Based Rendering
- Qianqian Wang, Zhicheng Wang, T. Funkhouser
- Computer ScienceComputer Vision and Pattern Recognition
- 25 February 2021
A method that synthesizes novel views of complex scenes by interpolating a sparse set of nearby views using a network architecture that includes a multilayer perceptron and a ray transformer that estimates radiance and volume density at continuous 5D locations.
Unsupervised Training for 3D Morphable Model Regression
- Kyle Genova, Forrester Cole, Aaron Maschinot, Aaron Sarna, Daniel Vlasic, W. Freeman
- Computer ScienceIEEE/CVF Conference on Computer Vision and…
- 1 June 2018
This work introduces three objectives: a batch distribution loss that encourages the output distribution to match the distribution of the morphable model, a loopback loss that ensures the network can correctly reinterpret its own output, and a multi-view identity loss that compares the features of the predicted 3D face and the input photograph from multiple viewing angles.
Towards Fairness in Visual Recognition: Effective Strategies for Bias Mitigation
- Zeyu Wang, Klint Qinami, Olga Russakovsky
- Computer ScienceComputer Vision and Pattern Recognition
- 26 November 2019
A simple but surprisingly effective visual recognition benchmark for studying bias mitigation, and a simple but similarly effective alternative to the inference-time Reducing Bias Amplification method of Zhao et al., and design a domain-independent training technique that outperforms all other methods.
CvxNet: Learnable Convex Decomposition
- Boyang Deng, Kyle Genova, S. Yazdani, Sofien Bouaziz, Geoffrey E. Hinton, A. Tagliasacchi
- Computer ScienceComputer Vision and Pattern Recognition
- 12 September 2019
This work introduces a network architecture to represent a low dimensional family of convexes, automatically derived via an auto-encoding process, and investigates the applications including automatic convex decomposition, image to 3D reconstruction, and part-based shape retrieval.
Learning Shape Templates With Structured Implicit Functions
- Kyle Genova, Forrester Cole, Daniel Vlasic, Aaron Sarna, W. Freeman, T. Funkhouser
- Computer ScienceIEEE International Conference on Computer Vision
- 12 April 2019
It is shown that structured implicit functions are suitable for learning and allow a network to smoothly and simultaneously fit multiple classes of shapes in a general shape template from data.
Text-based editing of talking-head video
- Ohad Fried, Ayush Tewari, Maneesh Agrawala
- Computer ScienceACM Transactions on Graphics
- 4 June 2019
This work proposes a novel method to edit talking-head video based on its transcript to produce a realistic output video in which the dialogue of the speaker has been modified, while maintaining a seamless audio-visual flow (i.e. no jump cuts).
Local Deep Implicit Functions for 3D Shape
- Kyle Genova, Forrester Cole, Avneesh Sud, Aaron Sarna, T. Funkhouser
- Computer ScienceComputer Vision and Pattern Recognition
- 12 December 2019
Local Deep Implicit Functions (LDIF), a 3D shape representation that decomposes space into a structured set of learned implicit functions that provides higher surface reconstruction accuracy than the state-of-the-art (OccNet), while requiring fewer than 1\% of the network parameters.
NeSF: Neural Semantic Fields for Generalizable Semantic Segmentation of 3D Scenes
- Suhani Vora, Noha Radwan, Daniel Duckworth
- Computer Science
- 25 November 2021
The NeSF method is the first to learn semantics by recognizing patterns in the geometry stored within a 3D neural field representation, and generalizes to novel scenes, and requires as little as one semantic map per scene at training time.
Panoptic Neural Fields: A Semantic Object-Aware Neural Scene Representation
- Abhijit Kundu, Kyle Genova, T. Funkhouser
- Computer ScienceComputer Vision and Pattern Recognition
- 9 May 2022
Panoptic Neural Fields is presented, an object-aware neural scene representation that decomposes a scene into a set of objects (things) and background (stuff) that can be smaller and faster than previous object- aware approaches, while still leveraging category-specific priors incorporated via meta-learned initialization.
Deep Structured Implicit Functions
- Kyle Genova, Forrester Cole, Avneesh Sud, Aaron Sarna, T. Funkhouser
- Computer ScienceArXiv
- 12 December 2019
Deep Structured Implicit Functions (DSIF), a 3D shape representation that decomposes space into a structured set of local deep implicit functions that provides 10.3 points higher surface reconstruction accuracy than the state-of-the-art (OccNet), while requiring fewer than 1 percent of the network parameters.
...
...