Skip to main content

Showing 1–5 of 5 results for author: Tsagkas, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.10123  [pdf, other

    cs.RO cs.CV

    Learning Precise Affordances from Egocentric Videos for Robotic Manipulation

    Authors: Gen Li, Nikolaos Tsagkas, Jifei Song, Ruaridh Mon-Williams, Sethu Vijayakumar, Kun Shao, Laura Sevilla-Lara

    Abstract: Affordance, defined as the potential actions that an object offers, is crucial for robotic manipulation tasks. A deep understanding of affordance can lead to more intelligent AI systems. For example, such knowledge directs an agent to grasp a knife by the handle for cutting and by the blade when passing it to someone. In this paper, we present a streamlined affordance learning system that encompas… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: Project page: https://reagan1311.github.io/affgrasp

  2. arXiv:2403.14526  [pdf, other

    cs.RO cs.AI cs.CV

    Click to Grasp: Zero-Shot Precise Manipulation via Visual Diffusion Descriptors

    Authors: Nikolaos Tsagkas, Jack Rome, Subramanian Ramamoorthy, Oisin Mac Aodha, Chris Xiaoxuan Lu

    Abstract: Precise manipulation that is generalizable across scenes and objects remains a persistent challenge in robotics. Current approaches for this task heavily depend on having a significant number of training instances to handle objects with pronounced visual and/or geometric part ambiguities. Our work explores the grounding of fine-grained part descriptors for precise manipulation in a zero-shot setti… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 8 pages, 4 figures

  3. arXiv:2305.12427  [pdf, other

    cs.CV

    VL-Fields: Towards Language-Grounded Neural Implicit Spatial Representations

    Authors: Nikolaos Tsagkas, Oisin Mac Aodha, Chris Xiaoxuan Lu

    Abstract: We present Visual-Language Fields (VL-Fields), a neural implicit spatial representation that enables open-vocabulary semantic queries. Our model encodes and fuses the geometry of a scene with vision-language trained latent features by distilling information from a language-driven segmentation model. VL-Fields is trained without requiring any prior knowledge of the scene object classes, which makes… ▽ More

    Submitted 25 May, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: Project page: https://tsagkas.github.io/vl-fields/

  4. Inference and Learning for Generative Capsule Models

    Authors: Alfredo Nazabal, Nikolaos Tsagkas, Christopher K. I. Williams

    Abstract: Capsule networks (see e.g. Hinton et al., 2018) aim to encode knowledge of and reason about the relationship between an object and its parts. In this paper we specify a generative model for such data, and derive a variational algorithm for inferring the transformation of each model object in a scene, and the assignments of observed parts to the objects. We derive a learning algorithm for the objec… ▽ More

    Submitted 21 October, 2022; v1 submitted 7 September, 2022; originally announced September 2022.

    Comments: 31 pages, 6 figures. This paper extends our previous work (arxiv:2103.06676) by covering the learning of the models as well as inference. Paper accepted for publication in Neural Computation

    Journal ref: Neural Computation 35(4) (2023) 727-761

  5. arXiv:2103.06676  [pdf, other

    cs.LG

    Inference for Generative Capsule Models

    Authors: Alfredo Nazabal, Nikolaos Tsagkas, Christopher K. I. Williams

    Abstract: Capsule networks (see e.g. Hinton et al., 2018) aim to encode knowledge and reason about the relationship between an object and its parts. In this paper we specify a \emph{generative} model for such data, and derive a variational algorithm for inferring the transformation of each object and the assignments of observed parts to the objects. We apply this model to (i) data generated from multiple ge… ▽ More

    Submitted 14 March, 2022; v1 submitted 11 March, 2021; originally announced March 2021.