Skip to main content

Showing 1–5 of 5 results for author: Baba, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.10498  [pdf, other

    eess.IV cs.CV

    Cervical Cancer Detection Using Multi-Branch Deep Learning Model

    Authors: Tatsuhiro Baba, Abu Saleh Musa Miah, Jungpil Shin, Md. Al Mehedi Hasan

    Abstract: Cervical cancer is a crucial global health concern for women, and the persistent infection of High-risk HPV mainly triggers this remains a global health challenge, with young women diagnosis rates soaring from 10\% to 40\% over three decades. While Pap smear screening is a prevalent diagnostic method, visual image analysis can be lengthy and often leads to mistakes. Early detection of the disease… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  2. arXiv:2401.10005  [pdf, other

    cs.CV cs.CL

    Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation

    Authors: Kohei Uehara, Nabarun Goswami, Hanqin Wang, Toshiaki Baba, Kohtaro Tanaka, Tomohiro Hashimoto, Kai Wang, Rei Ito, Takagi Naoya, Ryo Umagami, Yingyi Wen, Tanachai Anakewat, Tatsuya Harada

    Abstract: The increasing demand for intelligent systems capable of interpreting and reasoning about visual content requires the development of large Vision-and-Language Models (VLMs) that are not only accurate but also have explicit reasoning capabilities. This paper presents a novel approach to develop a VLM with the ability to conduct explicit reasoning based on visual content and textual instructions. We… ▽ More

    Submitted 17 July, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

  3. arXiv:2304.00964  [pdf, other

    cs.CV cs.CL

    Robust Text-driven Image Editing Method that Adaptively Explores Directions in Latent Spaces of StyleGAN and CLIP

    Authors: Tsuyoshi Baba, Kosuke Nishida, Kyosuke Nishida

    Abstract: Automatic image editing has great demands because of its numerous applications, and the use of natural language instructions is essential to achieving flexible and intuitive editing as the user imagines. A pioneering work in text-driven image editing, StyleCLIP, finds an edit direction in the CLIP space and then edits the image by mapping the direction to the StyleGAN space. At the same time, it i… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  4. arXiv:2205.04052  [pdf, other

    cs.RO eess.SY

    Robot formation control in nonlinear manifold using Koopman operator theory

    Authors: Yanran Wang, Tatsuya Baba, Takashi Hikihara

    Abstract: Formation control of multi-agent systems has been a prominent research topic, spanning both theoretical and practical domains over the past two decades. Our study delves into the leader-follower framework, addressing two critical, previously overlooked aspects. Firstly, we investigate the impact of an unknown nonlinear manifold, introducing added complexity to the formation control challenge. Seco… ▽ More

    Submitted 12 August, 2023; v1 submitted 9 May, 2022; originally announced May 2022.

  5. arXiv:2105.09034  [pdf, other

    cs.GR cs.CV

    Guided Facial Skin Color Correction

    Authors: Keiichiro Shirai, Tatsuya Baba, Shunsuke Ono, Masahiro Okuda, Yusuke Tatesumi, Paul Perrotin

    Abstract: This paper proposes an automatic image correction method for portrait photographs, which promotes consistency of facial skin color by suppressing skin color changes due to background colors. In portrait photographs, skin color is often distorted due to the lighting environment (e.g., light reflected from a colored background wall and over-exposure by a camera strobe), and if the photo is artificia… ▽ More

    Submitted 19 May, 2021; originally announced May 2021.

    Comments: 12 pages, 16 figures