OMRON SINIC X Corporation (HQ: Bunkyo-ku, Tokyo; President and CEO: Masaki Suwa, hereinafter "OSX") will present the latest research findings at the 2025 IEEE/CVF International Conference on Computer Vision (hereinafter "ICCV 2025").
ICCV 2025 is one of the the premier biennial international conferences in the field of computer縲vision. The conference will be held from October 19 to October 23, 2025, in Honolulu, Hawai'i, (local time). This year, 2,701 out of 11,239 submissions were accepted, resulting in an acceptance rate of approximately 24%.
The research paper to be presented by OSX has been selected as a Highlight1) in recognition of its exceptional quality and potential impact. The following provides an overview of the paper.
1) In 2025, out of 2,701 accepted papers, 263 (approximately 9.7%) were selected as Highlight.
Authors | Kuniaki Saito (OSX), Donghyun Kim (Korea University), Kwanyong Park (University of Seoul), Atsushi Hashimoto (OSX), Yoshitaka Ushiku (OSX) |
Research Introduction | CaptionSmiths is a controllable image captioning framework that allows smooth adjustment of caption properties such as length, descriptiveness, and word uniqueness--within a single model. Unlike existing models, which lack explicit conditioning and struggle with smooth transitions between styles, CaptionSmiths quantifies these properties as continuous scalar values and interpolates between learned endpoint representations (e.g., very short 竊 very long).This enables fine-grained control over caption styles. Experiments show that CaptionSmiths not only improves lexical alignment, but also reduces caption length control error by over 500% compared to strong baselines. |
Related Links | https://arxiv.org/abs/2507.01409 https://ksaito-ut.github.io/captionsmiths_web/ |
窶サAuthor information is current as of the date of writing or submission. Please be advised that the information may become outdated after that point.
縲
OMRON SINIC X Corporation is a strategic subsidiary seeking to realize the "near-future design" that OMRON forecasts. It is comprised of researchers with cutting-edge knowledge and experience across a wide range of technological domains, including AI, Robotics, IoT, and Sensing. With the aim of addressing social issues, OSX integrates innovative technologies with business models and strategies in technology and IP to create near-future design. Additionally, the company accelerates the creation of these designs through collaborative research with universities and external research institutions.
笳Website: https://www.omron.com/sinicx/en/
笳Activities: https://www.omron.com/sinicx/en/activity/
For any inquiries about OSX, please contact us here.