Skip to main content

Showing 1–50 of 50 results for author: Zwicker, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.19097  [pdf, other

    cs.GR cs.CV cs.HC cs.LG

    NARVis: Neural Accelerated Rendering for Real-Time Scientific Point Cloud Visualization

    Authors: Srinidhi Hegde, Kaur Kullman, Thomas Grubb, Leslie Lait, Stephen Guimond, Matthias Zwicker

    Abstract: Exploring scientific datasets with billions of samples in real-time visualization presents a challenge - balancing high-fidelity rendering with speed. This work introduces a novel renderer - Neural Accelerated Renderer (NAR), that uses the neural deferred rendering framework to visualize large-scale scientific point cloud data. NAR augments a real-time point cloud rendering pipeline with high-qual… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  2. arXiv:2406.10219  [pdf, other

    cs.CV cs.GR

    PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting

    Authors: Alex Hanson, Allen Tu, Vasu Singla, Mayuka Jayawardhana, Matthias Zwicker, Tom Goldstein

    Abstract: Recent advancements in novel view synthesis have enabled real-time rendering speeds and high reconstruction accuracy. 3D Gaussian Splatting (3D-GS), a foundational point-based parametric 3D scene representation, models scenes as large sets of 3D Gaussians. Complex scenes can comprise of millions of Gaussians, amounting to large storage and memory requirements that limit the viability of 3D-GS on d… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  3. arXiv:2404.12509  [pdf, other

    cs.GR cs.AI cs.CV cs.LG

    Compositional Neural Textures

    Authors: Peihan Tu, Li-Yi Wei, Matthias Zwicker

    Abstract: Texture plays a vital role in enhancing visual richness in both real photographs and computer-generated imagery. However, the process of editing textures often involves laborious and repetitive manual adjustments of textons, which are the small, recurring local patterns that define textures. In this work, we introduce a fully unsupervised approach for representing textures using a compositional ne… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  4. arXiv:2403.15651  [pdf, other

    cs.CV

    GaNI: Global and Near Field Illumination Aware Neural Inverse Rendering

    Authors: Jiaye Wu, Saeed Hadadan, Geng Lin, Matthias Zwicker, David Jacobs, Roni Sengupta

    Abstract: In this paper, we present GaNI, a Global and Near-field Illumination-aware neural inverse rendering technique that can reconstruct geometry, albedo, and roughness parameters from images of a scene captured with co-located light and camera. Existing inverse rendering techniques with co-located light-camera focus on single objects only, without modeling global illumination and near-field lighting mo… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  5. arXiv:2309.07749  [pdf, other

    cs.CV

    OmnimatteRF: Robust Omnimatte with 3D Background Modeling

    Authors: Geng Lin, Chen Gao, Jia-Bin Huang, Changil Kim, Yipeng Wang, Matthias Zwicker, Ayush Saraf

    Abstract: Video matting has broad applications, from adding interesting effects to casually captured movies to assisting video production professionals. Matting with associated effects such as shadows and reflections has also attracted increasing research activity, and methods like Omnimatte have been proposed to separate dynamic foreground objects of interest into their own layers. However, prior works rep… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: ICCV 2023. Project page: https://omnimatte-rf.github.io/

  6. Inverse Global Illumination using a Neural Radiometric Prior

    Authors: Saeed Hadadan, Geng Lin, Jan Novák, Fabrice Rousselle, Matthias Zwicker

    Abstract: Inverse rendering methods that account for global illumination are becoming more popular, but current methods require evaluating and automatically differentiating millions of path integrals by tracing multiple light bounces, which remains expensive and prone to noise. Instead, this paper proposes a radiometric prior as a simple alternative to building complete path integrals in a traditional diffe… ▽ More

    Submitted 17 May, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: Homepage: https://inverse-neural-radiosity.github.io

  7. arXiv:2303.14587  [pdf, other

    cs.CV

    PAniC-3D: Stylized Single-view 3D Reconstruction from Portraits of Anime Characters

    Authors: Shuhong Chen, Kevin Zhang, Yichun Shi, Heng Wang, Yiheng Zhu, Guoxian Song, Sizhe An, Janus Kristjansson, Xiao Yang, Matthias Zwicker

    Abstract: We propose PAniC-3D, a system to reconstruct stylized 3D character heads directly from illustrated (p)ortraits of (ani)me (c)haracters. Our anime-style domain poses unique challenges to single-view reconstruction; compared to natural images of human heads, character portrait illustrations have hair and accessories with more complex and diverse geometry, and are shaded with non-photorealistic conto… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

    Comments: CVPR 2023, code release: https://github.com/ShuhongChen/panic3d-anime-reconstruction

  8. arXiv:2204.11015  [pdf, other

    cs.CV

    Surface Reconstruction from Point Clouds by Learning Predictive Context Priors

    Authors: Baorui Ma, Yu-Shen Liu, Matthias Zwicker, Zhizhong Han

    Abstract: Surface reconstruction from point clouds is vital for 3D computer vision. State-of-the-art methods leverage large datasets to first learn local context priors that are represented as neural network-based signed distance functions (SDFs) with some parameters encoding the local contexts. To reconstruct a surface at a specific query location at inference time, these methods then match the local recon… ▽ More

    Submitted 23 April, 2022; originally announced April 2022.

    Comments: To appear at CVPR2022. Project page:this https URL https://mabaorui.github.io/PredictableContextPrior_page/

  9. arXiv:2201.13190  [pdf, other

    cs.GR cs.CV

    Differentiable Neural Radiosity

    Authors: Saeed Hadadan, Matthias Zwicker

    Abstract: We introduce Differentiable Neural Radiosity, a novel method of representing the solution of the differential rendering equation using a neural network. Inspired by neural radiosity techniques, we minimize the norm of the residual of the differential rendering equation to directly optimize our network. The network is capable of outputting continuous, view-independent gradients of the radiance fiel… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

  10. arXiv:2112.10203  [pdf, other

    cs.CV

    HVTR: Hybrid Volumetric-Textural Rendering for Human Avatars

    Authors: Tao Hu, Tao Yu, Zerong Zheng, He Zhang, Yebin Liu, Matthias Zwicker

    Abstract: We propose a novel neural rendering pipeline, Hybrid Volumetric-Textural Rendering (HVTR), which synthesizes virtual human avatars from arbitrary poses efficiently and at high quality. First, we learn to encode articulated human motions on a dense UV manifold of the human body surface. To handle complicated motions (e.g., self-occlusions), we then leverage the encoded information on the UV manifol… ▽ More

    Submitted 1 September, 2022; v1 submitted 19 December, 2021; originally announced December 2021.

    Comments: Accepted to 3DV 2022. See more results at https://www.cs.umd.edu/~taohu/hvtr/ Demo: https://www.youtube.com/watch?v=LE0-YpbLlkY

  11. arXiv:2111.12792  [pdf, other

    cs.CV

    Improving the Perceptual Quality of 2D Animation Interpolation

    Authors: Shuhong Chen, Matthias Zwicker

    Abstract: Traditional 2D animation is labor-intensive, often requiring animators to manually draw twelve illustrations per second of movement. While automatic frame interpolation may ease this burden, 2D animation poses additional difficulties compared to photorealistic video. In this work, we address challenges unexplored in previous animation interpolation systems, with a focus on improving perceptual qua… ▽ More

    Submitted 17 July, 2022; v1 submitted 24 November, 2021; originally announced November 2021.

    Comments: published at ECCV2022

  12. arXiv:2111.12685  [pdf, other

    cs.CV

    EgoRenderer: Rendering Human Avatars from Egocentric Camera Images

    Authors: Tao Hu, Kripasindhu Sarkar, Lingjie Liu, Matthias Zwicker, Christian Theobalt

    Abstract: We present EgoRenderer, a system for rendering full-body neural avatars of a person captured by a wearable, egocentric fisheye camera that is mounted on a cap or a VR headset. Our system renders photorealistic novel views of the actor and her motion from arbitrary virtual camera locations. Rendering full-body avatars from such egocentric images come with unique challenges due to the top-down view… ▽ More

    Submitted 24 November, 2021; originally announced November 2021.

    Comments: ICCV 2021. https://vcai.mpi-inf.mpg.de/projects/EgoRenderer/

  13. arXiv:2111.01785  [pdf, other

    cs.CV cs.LG

    PatchGame: Learning to Signal Mid-level Patches in Referential Games

    Authors: Kamal Gupta, Gowthami Somepalli, Anubhav Gupta, Vinoj Jayasundara, Matthias Zwicker, Abhinav Shrivastava

    Abstract: We study a referential game (a type of signaling game) where two agents communicate with each other via a discrete bottleneck to achieve a common goal. In our referential game, the goal of the speaker is to compose a message or a symbolic representation of "important" image patches, while the task for the listener is to match the speaker's message to a different view of the same image. We show tha… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: To appear at NeurIPS 2021

  14. arXiv:2108.03746  [pdf, other

    cs.CV

    Unsupervised Learning of Fine Structure Generation for 3D Point Clouds by 2D Projection Matching

    Authors: Chen Chao, Zhizhong Han, Yu-Shen Liu, Matthias Zwicker

    Abstract: Learning to generate 3D point clouds without 3D supervision is an important but challenging problem. Current solutions leverage various differentiable renderers to project the generated 3D point clouds onto a 2D image plane, and train deep neural networks using the per-pixel difference with 2D ground truth images. However, these solutions are still struggling to fully recover fine structures of 3D… ▽ More

    Submitted 8 August, 2021; originally announced August 2021.

    Comments: To appear at ICCV 2021. Our code, data and models are available at https://github.com/chenchao15/2D\_projection\_matching

  15. arXiv:2108.03743  [pdf, other

    cs.CV

    Hierarchical View Predictor: Unsupervised 3D Global Feature Learning through Hierarchical Prediction among Unordered Views

    Authors: Zhizhong Han, Xiyang Wang, Yu-Shen Liu, Matthias Zwicker

    Abstract: Unsupervised learning of global features for 3D shape analysis is an important research challenge because it avoids manual effort for supervised information collection. In this paper, we propose a view-based deep learning model called Hierarchical View Predictor (HVP) to learn 3D shape features from unordered views in an unsupervised manner. To mine highly discriminative information from unordered… ▽ More

    Submitted 8 August, 2021; originally announced August 2021.

    Comments: To appear at ACMMM 2021

  16. arXiv:2108.01819  [pdf, other

    cs.CV

    Transfer Learning for Pose Estimation of Illustrated Characters

    Authors: Shuhong Chen, Matthias Zwicker

    Abstract: Human pose information is a critical component in many downstream image processing tasks, such as activity recognition and motion tracking. Likewise, a pose estimator for the illustrated character domain would provide a valuable prior for assistive content creation tasks, such as reference pose retrieval and automatic character animation. But while modern data-driven techniques have substantially… ▽ More

    Submitted 30 November, 2021; v1 submitted 3 August, 2021; originally announced August 2021.

    Comments: published at WACV2022

  17. Neural Radiosity

    Authors: Saeed Hadadan, Shuhong Chen, Matthias Zwicker

    Abstract: We introduce Neural Radiosity, an algorithm to solve the rendering equation by minimizing the norm of its residual similar as in traditional radiosity techniques. Traditional basis functions used in radiosity techniques, such as piecewise polynomials or meshless basis functions are typically limited to representing isotropic scattering from diffuse surfaces. Instead, we propose to leverage neural… ▽ More

    Submitted 9 October, 2021; v1 submitted 26 May, 2021; originally announced May 2021.

  18. Continuous Curve Textures

    Authors: Peihan Tu, Li-Yi Wei, Koji Yatani, Takeo Igarashi, Matthias Zwicker

    Abstract: Repetitive patterns are ubiquitous in natural and human-made objects, and can be created with a variety of tools and methods. Manual authoring provides unmatched degree of freedom and control, but can require significant artistic expertise and manual labor. Computational methods can automate parts of the manual creation process, but are mainly tailored for discrete pixels or elements instead of mo… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

  19. arXiv:2011.13495  [pdf, other

    cs.CV

    Neural-Pull: Learning Signed Distance Functions from Point Clouds by Learning to Pull Space onto Surfaces

    Authors: Baorui Ma, Zhizhong Han, Yu-Shen Liu, Matthias Zwicker

    Abstract: Reconstructing continuous surfaces from 3D point clouds is a fundamental operation in 3D geometry processing. Several recent state-of-the-art methods address this problem using neural networks to learn signed distance functions (SDFs). In this paper, we introduce \textit{Neural-Pull}, a new approach that is simple and leads to high quality SDFs. Specifically, we train a neural network to pull quer… ▽ More

    Submitted 23 May, 2021; v1 submitted 26 November, 2020; originally announced November 2020.

    Comments: To appear at ICML2021. Code and data are available at https://github.com/mabaorui/NeuralPull

  20. arXiv:2009.03298  [pdf, other

    cs.CV cs.GR cs.LG

    Improved Modeling of 3D Shapes with Multi-view Depth Maps

    Authors: Kamal Gupta, Susmija Jabbireddy, Ketul Shah, Abhinav Shrivastava, Matthias Zwicker

    Abstract: We present a simple yet effective general-purpose framework for modeling 3D shapes by leveraging recent advances in 2D image generation using CNNs. Using just a single depth image of the object, we can output a dense multi-view depth map representation of 3D objects. Our simple encoder-decoder framework, comprised of a novel identity encoder and class-conditional viewpoint generator, generates 3D… ▽ More

    Submitted 7 September, 2020; originally announced September 2020.

  21. arXiv:2007.06127  [pdf, other

    cs.CV

    DRWR: A Differentiable Renderer without Rendering for Unsupervised 3D Structure Learning from Silhouette Images

    Authors: Zhizhong Han, Chao Chen, Yu-Shen Liu, Matthias Zwicker

    Abstract: Differentiable renderers have been used successfully for unsupervised 3D structure learning from 2D images because they can bridge the gap between 3D and 2D. To optimize 3D shape parameters, current renderers rely on pixel-wise losses between rendered images of 3D reconstructions and ground truth images from corresponding viewpoints. Hence they require interpolation of the recovered 3D structure a… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

    Comments: Accepted at ICML2020

  22. Fine-Grained 3D Shape Classification with Hierarchical Part-View Attentions

    Authors: Xinhai Liu, Zhizhong Han, Yu-Shen Liu, Matthias Zwicker

    Abstract: Fine-grained 3D shape classification is important for shape understanding and analysis, which poses a challenging research problem. However, the studies on the fine-grained 3D shape classification have rarely been explored, due to the lack of fine-grained 3D shape benchmarks. To address this issue, we first introduce a new 3D shape dataset (named FG3D dataset) with fine-grained class labels, which… ▽ More

    Submitted 28 December, 2020; v1 submitted 26 May, 2020; originally announced May 2020.

    Comments: Accepted by IEEE Transactions on Image Processing, 2020. The FG3D dataset is available at https://github.com/liuxinhai/FG3D-Net

  23. arXiv:2003.08240  [pdf, other

    cs.CV

    LRC-Net: Learning Discriminative Features on Point Clouds by Encoding Local Region Contexts

    Authors: Xinhai Liu, Zhizhong Han, Fangzhou Hong, Yu-Shen Liu, Matthias Zwicker

    Abstract: Learning discriminative feature directly on point clouds is still challenging in the understanding of 3D shapes. Recent methods usually partition point clouds into local region sets, and then extract the local region features with fixed-size CNN or MLP, and finally aggregate all individual local features into a global feature using simple max pooling. However, due to the irregularity and sparsity… ▽ More

    Submitted 21 March, 2020; v1 submitted 18 March, 2020; originally announced March 2020.

    Comments: To be published at GMP2020

  24. arXiv:2003.05559  [pdf, other

    cs.CV

    SeqXY2SeqZ: Structure Learning for 3D Shapes by Sequentially Predicting 1D Occupancy Segments From 2D Coordinates

    Authors: Zhizhong Han, Guanhui Qiao, Yu-Shen Liu, Matthias Zwicker

    Abstract: Structure learning for 3D shapes is vital for 3D computer vision. State-of-the-art methods show promising results by representing shapes using implicit functions in 3D that are learned using discriminative neural networks. However, learning implicit functions requires dense and irregular sampling in 3D space, which also makes the sampling methods affect the accuracy of shape reconstruction during… ▽ More

    Submitted 16 March, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

  25. arXiv:2002.09925  [pdf, other

    cs.HC cs.AI cs.DS cs.PL cs.SE

    ORCSolver: An Efficient Solver for Adaptive GUI Layout with OR-Constraints

    Authors: Yue Jiang, Wolfgang Stuerzlinger, Matthias Zwicker, Christof Lutteroth

    Abstract: OR-constrained (ORC) graphical user interface layouts unify conventional constraint-based layouts with flow layouts, which enables the definition of flexible layouts that adapt to screens with different sizes, orientations, or aspect ratios with only a single layout specification. Unfortunately, solving ORC layouts with current solvers is time-consuming and the needed time increases exponentially… ▽ More

    Submitted 23 February, 2020; originally announced February 2020.

    Comments: Published at CHI2020

  26. arXiv:2001.02728  [pdf, other

    cs.LG cs.CV stat.ML

    Learning Generative Models using Denoising Density Estimators

    Authors: Siavash A. Bigdeli, Geng Lin, Tiziano Portenier, L. Andrea Dunbar, Matthias Zwicker

    Abstract: Learning probabilistic models that can estimate the density of a given set of samples, and generate samples from that density, is one of the fundamental challenges in unsupervised machine learning. We introduce a new generative model based on denoising density estimators (DDEs), which are scalar functions parameterized by neural networks, that are efficiently trained to represent kernel density es… ▽ More

    Submitted 9 June, 2020; v1 submitted 8 January, 2020; originally announced January 2020.

    Comments: Code and models available at https://drive.google.com/file/d/1EzKRxnFG1Hd8g6Ggvt-jvKkgpDDwK2bY

  27. arXiv:1912.10545  [pdf, other

    cs.CV

    Learning to Generate Dense Point Clouds with Textures on Multiple Categories

    Authors: Tao Hu, Geng Lin, Zhizhong Han, Matthias Zwicker

    Abstract: 3D reconstruction from images is a core problem in computer vision. With recent advances in deep learning, it has become possible to recover plausible 3D shapes even from single RGB images for the first time. However, obtaining detailed geometry and texture for objects with arbitrary topology remains challenging. In this paper, we propose a novel approach for reconstructing point clouds from RGB i… ▽ More

    Submitted 22 December, 2019; originally announced December 2019.

  28. arXiv:1912.07109  [pdf, other

    cs.CV cs.GR cs.LG

    SDFDiff: Differentiable Rendering of Signed Distance Fields for 3D Shape Optimization

    Authors: Yue Jiang, Dantong Ji, Zhizhong Han, Matthias Zwicker

    Abstract: We propose SDFDiff, a novel approach for image-based shape optimization using differentiable rendering of 3D shapes represented by signed distance functions (SDFs). Compared to other representations, SDFs have the advantage that they can represent shapes with arbitrary topology, and that they guarantee watertight surfaces. We apply our approach to the problem of multi-view 3D reconstruction, where… ▽ More

    Submitted 22 February, 2022; v1 submitted 15 December, 2019; originally announced December 2019.

    Comments: CVPR2020 Full Paper (Oral Top 5%)

  29. arXiv:1911.12465  [pdf, other

    cs.CV

    3D Shape Completion with Multi-view Consistent Inference

    Authors: Tao Hu, Zhizhong Han, Matthias Zwicker

    Abstract: 3D shape completion is important to enable machines to perceive the complete geometry of objects from partial observations. To address this problem, view-based methods have been presented. These methods represent shapes as multiple depth images, which can be back-projected to yield corresponding 3D point clouds, and they perform shape completion by learning to complete each depth image using neura… ▽ More

    Submitted 27 November, 2019; originally announced November 2019.

    Comments: Accepted to AAAI 2020 as oral presentation

  30. L2G Auto-encoder: Understanding Point Clouds by Local-to-Global Reconstruction with Hierarchical Self-Attention

    Authors: Xinhai Liu, Zhizhong Han, Xin Wen, Yu-Shen Liu, Matthias Zwicker

    Abstract: Auto-encoder is an important architecture to understand point clouds in an encoding and decoding procedure of self reconstruction. Current auto-encoder mainly focuses on the learning of global structure by global shape reconstruction, while ignoring the learning of local structures. To resolve this issue, we propose Local-to-Global auto-encoder (L2G-AE) to simultaneously learn the local and global… ▽ More

    Submitted 2 August, 2019; originally announced August 2019.

  31. arXiv:1908.00120  [pdf, other

    cs.CV

    ShapeCaptioner: Generative Caption Network for 3D Shapes by Learning a Mapping from Parts Detected in Multiple Views to Sentences

    Authors: Zhizhong Han, Chao Chen, Yu-Shen Liu, Matthias Zwicker

    Abstract: 3D shape captioning is a challenging application in 3D shape understanding. Captions from recent multi-view based methods reveal that they cannot capture part-level characteristics of 3D shapes. This leads to a lack of detailed part-level description in captions, which human tend to focus on. To resolve this issue, we propose ShapeCaptioner, a generative caption network, to perform 3D shape captio… ▽ More

    Submitted 31 July, 2019; originally announced August 2019.

  32. arXiv:1907.12704  [pdf, other

    cs.CV

    Multi-Angle Point Cloud-VAE: Unsupervised Feature Learning for 3D Point Clouds from Multiple Angles by Joint Self-Reconstruction and Half-to-Half Prediction

    Authors: Zhizhong Han, Xiyang Wang, Yu-Shen Liu, Matthias Zwicker

    Abstract: Unsupervised feature learning for point clouds has been vital for large-scale point cloud understanding. Recent deep learning based methods depend on learning global geometry from self-reconstruction. However, these methods are still suffering from ineffective learning of local geometry, which significantly limits the discriminability of learned features. To resolve this issue, we propose MAP-VAE… ▽ More

    Submitted 29 July, 2019; originally announced July 2019.

    Comments: To appear at ICCV 2019

  33. arXiv:1905.07506  [pdf, ps, other

    cs.CV

    Parts4Feature: Learning 3D Global Features from Generally Semantic Parts in Multiple Views

    Authors: Zhizhong Han, Xinhai Liu, Yu-Shen Liu, Matthias Zwicker

    Abstract: Deep learning has achieved remarkable results in 3D shape analysis by learning global shape features from the pixel-level over multiple views. Previous methods, however, compute low-level features for entire views without considering part-level information. In contrast, we propose a deep neural network, called Parts4Feature, to learn 3D global features from part-level information in multiple views… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: To appear at IJCAI2019

  34. arXiv:1905.07503  [pdf, ps, other

    cs.CV

    3DViewGraph: Learning Global Features for 3D Shapes from A Graph of Unordered Views with Attention

    Authors: Zhizhong Han, Xiyang Wang, Chi-Man Vong, Yu-Shen Liu, Matthias Zwicker, C. L. Philip Chen

    Abstract: Learning global features by aggregating information over multiple views has been shown to be effective for 3D shape analysis. For view aggregation in deep learning models, pooling has been applied extensively. However, pooling leads to a loss of the content within views, and the spatial relationship among views, which limits the discriminability of learned features. We propose 3DViewGraph to resol… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: To appear at IJCAI2019

  35. arXiv:1904.08366  [pdf, other

    cs.CV

    Render4Completion: Synthesizing Multi-View Depth Maps for 3D Shape Completion

    Authors: Tao Hu, Zhizhong Han, Abhinav Shrivastava, Matthias Zwicker

    Abstract: We propose a novel approach for 3D shape completion by synthesizing multi-view depth maps. While previous work for shape completion relies on volumetric representations, meshes, or point clouds, we propose to use multi-view depth maps from a set of fixed viewing angles as our shape representation. This allows us to be free of the limitations of memory for volumetric representations and point cloud… ▽ More

    Submitted 21 September, 2019; v1 submitted 17 April, 2019; originally announced April 2019.

    Comments: ICCV 2019 workshop on Geometry meets Deep Learning

  36. arXiv:1903.06763  [pdf, other

    cs.GR cs.CV

    Smart, Deep Copy-Paste

    Authors: Tiziano Portenier, Qiyang Hu, Paolo Favaro, Matthias Zwicker

    Abstract: In this work, we propose a novel system for smart copy-paste, enabling the synthesis of high-quality results given a masked source image content and a target image context as input. Our system naturally resolves both shading and geometric inconsistencies between source and target image, resulting in a merged result image that features the content from the pasted source image, seamlessly pasted int… ▽ More

    Submitted 15 March, 2019; originally announced March 2019.

    Comments: 12 pages, 9 figures

  37. arXiv:1901.01499  [pdf, other

    cs.LG cs.CV stat.ML

    Understanding the (un)interpretability of natural image distributions using generative models

    Authors: Ryen Krusinga, Sohil Shah, Matthias Zwicker, Tom Goldstein, David Jacobs

    Abstract: Probability density estimation is a classical and well studied problem, but standard density estimation methods have historically lacked the power to model complex and high-dimensional image distributions. More recent generative models leverage the power of neural networks to implicitly learn and represent probability models over complex images. We describe methods to extract explicit probability… ▽ More

    Submitted 25 February, 2019; v1 submitted 5 January, 2019; originally announced January 2019.

  38. arXiv:1812.01874  [pdf, other

    eess.IV cs.CV

    Learning to Take Directions One Step at a Time

    Authors: Qiyang Hu, Adrian Wälchli, Tiziano Portenier, Matthias Zwicker, Paolo Favaro

    Abstract: We present a method to generate a video sequence given a single image. Because items in an image can be animated in arbitrarily many different ways, we introduce as control signal a sequence of motion strokes. Such control signal can be automatically transferred from other videos, e.g., via bounding box tracking. Each motion stroke provides the direction to the moving object in the input image and… ▽ More

    Submitted 14 August, 2020; v1 submitted 5 December, 2018; originally announced December 2018.

  39. arXiv:1811.02745  [pdf, ps, other

    cs.CV

    Y^2Seq2Seq: Cross-Modal Representation Learning for 3D Shape and Text by Joint Reconstruction and Prediction of View and Word Sequences

    Authors: Zhizhong Han, Mingyang Shang, Xiyang Wang, Yu-Shen Liu, Matthias Zwicker

    Abstract: A recent method employs 3D voxels to represent 3D shapes, but this limits the approach to low resolutions due to the computational cost caused by the cubic complexity of 3D voxels. Hence the method suffers from a lack of detailed geometry. To resolve this issue, we propose Y^2Seq2Seq, a view-based model, to learn cross-modal representations by joint reconstruction and prediction of view and word s… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

    Comments: To be pubilished at AAAI 2019

  40. arXiv:1811.02744  [pdf, ps, other

    cs.CV

    View Inter-Prediction GAN: Unsupervised Representation Learning for 3D Shapes by Learning Global Shape Memories to Support Local View Predictions

    Authors: Zhizhong Han, Mingyang Shang, Yu-Shen Liu, Matthias Zwicker

    Abstract: In this paper we present a novel unsupervised representation learning approach for 3D shapes, which is an important research challenge as it avoids the manual effort required for collecting supervised data. Our method trains an RNN-based neural network architecture to solve multiple view inter-prediction tasks for each shape. Given several nearby views of a shape, we define view inter-prediction a… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

    Comments: To be published at AAAI 2019

  41. arXiv:1811.02565  [pdf, ps, other

    cs.CV

    Point2Sequence: Learning the Shape Representation of 3D Point Clouds with an Attention-based Sequence to Sequence Network

    Authors: Xinhai Liu, Zhizhong Han, Yu-Shen Liu, Matthias Zwicker

    Abstract: Exploring contextual information in the local region is important for shape understanding and analysis. Existing studies often employ hand-crafted or explicit ways to encode contextual information of local regions. However, it is hard to capture fine-grained contextual information in hand-crafted or explicit manners, such as the correlation between different areas in a local region, which limits t… ▽ More

    Submitted 15 November, 2018; v1 submitted 6 November, 2018; originally announced November 2018.

    Comments: To be published in AAAI 2019

  42. arXiv:1808.07840  [pdf, other

    cs.LG cs.GR stat.ML

    Learning to Importance Sample in Primary Sample Space

    Authors: Quan Zheng, Matthias Zwicker

    Abstract: Importance sampling is one of the most widely used variance reduction strategies in Monte Carlo rendering. In this paper, we propose a novel importance sampling technique that uses a neural network to learn how to sample from a desired density represented by a set of samples. Our approach considers an existing Monte Carlo rendering algorithm as a black box. During a scene-dependent training phase,… ▽ More

    Submitted 22 March, 2024; v1 submitted 23 August, 2018; originally announced August 2018.

    Comments: 11 pages, 14 figure; authors' version, the definitive version of record is available at https://onlinelibrary.wiley.com/doi/10.1111/cgf.13628

    Journal ref: Computer Graphics Forum (CGF), 2019, 38(2): 169-179.(Eurographics 2019)

  43. arXiv:1807.05439  [pdf, other

    cs.CV

    Specular-to-Diffuse Translation for Multi-View Reconstruction

    Authors: Shihao Wu, Hui Huang, Tiziano Portenier, Matan Sela, Danny Cohen-Or, Ron Kimmel, Matthias Zwicker

    Abstract: Most multi-view 3D reconstruction algorithms, especially when shape-from-shading cues are used, assume that object appearance is predominantly diffuse. To alleviate this restriction, we introduce S2Dnet, a generative adversarial network for transferring multiple views of objects with specular reflection into diffuse ones, so that multi-view reconstruction methods can be applied more effectively. O… ▽ More

    Submitted 30 July, 2018; v1 submitted 14 July, 2018; originally announced July 2018.

    Comments: Accepted to ECCV 2018

  44. arXiv:1804.08972  [pdf, other

    cs.CV cs.GR

    FaceShop: Deep Sketch-based Face Image Editing

    Authors: Tiziano Portenier, Qiyang Hu, Attila Szabó, Siavash Arjomand Bigdeli, Paolo Favaro, Matthias Zwicker

    Abstract: We present a novel system for sketch-based face image editing, enabling users to edit images intuitively by sketching a few strokes on a region of interest. Our interface features tools to express a desired image manipulation by providing both geometry and color constraints as user-drawn strokes. As an alternative to the direct user input, our proposed system naturally supports a copy-paste mode,… ▽ More

    Submitted 7 June, 2018; v1 submitted 24 April, 2018; originally announced April 2018.

    Comments: 13 pages, 20 figures

  45. arXiv:1711.07410  [pdf, other

    cs.CV

    Disentangling Factors of Variation by Mixing Them

    Authors: Qiyang Hu, Attila Szabó, Tiziano Portenier, Matthias Zwicker, Paolo Favaro

    Abstract: We propose an approach to learn image representations that consist of disentangled factors of variation without exploiting any manual labeling or data domain knowledge. A factor of variation corresponds to an image attribute that can be discerned consistently across a set of images, such as the pose or color of objects. Our disentangled representation consists of a concatenation of feature chunks,… ▽ More

    Submitted 28 March, 2018; v1 submitted 20 November, 2017; originally announced November 2017.

    Comments: CVPR 2018

  46. arXiv:1711.02245  [pdf, other

    cs.CV

    Challenges in Disentangling Independent Factors of Variation

    Authors: Attila Szabó, Qiyang Hu, Tiziano Portenier, Matthias Zwicker, Paolo Favaro

    Abstract: We study the problem of building models that disentangle independent factors of variation. Such models could be used to encode features that can efficiently be used for classification and to transfer attributes between different images in image synthesis. As data we use a weakly labeled training set. Our weak labels indicate what single factor has changed between two data samples, although the rel… ▽ More

    Submitted 6 November, 2017; originally announced November 2017.

    Comments: Submitted to ICLR 2018

  47. arXiv:1709.03749  [pdf, other

    cs.CV cs.LG

    Deep Mean-Shift Priors for Image Restoration

    Authors: Siavash Arjomand Bigdeli, Meiguang Jin, Paolo Favaro, Matthias Zwicker

    Abstract: In this paper we introduce a natural image prior that directly represents a Gaussian-smoothed version of the natural image distribution. We include our prior in a formulation of image restoration as a Bayes estimator that also allows us to solve noise-blind image restoration problems. We show that the gradient of our prior corresponds to the mean-shift vector on the natural image distribution. In… ▽ More

    Submitted 4 October, 2017; v1 submitted 12 September, 2017; originally announced September 2017.

    Comments: NIPS 2017

  48. arXiv:1703.09964  [pdf, other

    cs.CV cs.GR

    Image Restoration using Autoencoding Priors

    Authors: Siavash Arjomand Bigdeli, Matthias Zwicker

    Abstract: We propose to leverage denoising autoencoder networks as priors to address image restoration problems. We build on the key observation that the output of an optimal denoising autoencoder is a local mean of the true data density, and the autoencoder error (the difference between the output and input of the trained autoencoder) is a mean shift vector. We use the magnitude of this mean shift vector,… ▽ More

    Submitted 29 March, 2017; originally announced March 2017.

  49. arXiv:1608.04642  [pdf, other

    cs.CV

    Temporally Consistent Motion Segmentation from RGB-D Video

    Authors: Peter Bertholet, Alexandru-Eugen Ichim, Matthias Zwicker

    Abstract: We present a method for temporally consistent motion segmentation from RGB-D videos assuming a piecewise rigid motion model. We formulate global energies over entire RGB-D sequences in terms of the segmentation of each frame into a number of objects, and the rigid motion of each object through the sequence. We develop a novel initialization procedure that clusters feature tracks obtained from the… ▽ More

    Submitted 16 August, 2016; originally announced August 2016.

    MSC Class: 68T45 ACM Class: I.4.8

  50. arXiv:1605.01583  [pdf, other

    cs.CE

    Bifurcation Analysis of Reaction Diffusion Systems on Arbitrary Surfaces

    Authors: Daljit Singh J. Dhillon, Michel C. Milinkovitch, Matthias Zwicker

    Abstract: In this paper we present computational techniques to investigate the solutions of two-component, nonlinear reaction-diffusion (RD) systems on arbitrary surfaces. We build on standard techniques for linear and nonlinear analysis of RD systems, and extend them to operate on large-scale meshes for arbitrary surfaces. In particular, we use spectral techniques for a linear stability analysis to charact… ▽ More

    Submitted 5 May, 2016; originally announced May 2016.

    Comments: This paper was submitted at the Journal of Mathematical Biology, Springer on 07th July 2015, in its current form (barring image references on the last page and cosmetic changes owning to rebuild for arXiv). The complete body of work presented here was included and defended as a part of my PhD thesis in Nov 2015 at the University of Bern