Skip to yearly menu bar Skip to main content


This page is cached for 1 hour. Changes to affiliation or name in your local profile may take up to 60 minutes to appear here.

FuLLaMa: Training-free Diffusion-based Object Removal with Context Preservation Poster Session 6 + Refreshments
Ilke Demir ⋅ Umur Ciftci
Tucson Ballroom & Prefunction Space 129
FLARES: Fast and Accurate LiDAR Multi-Range Semantic Segmentation Poster Session 3
Bin Yang ⋅ Alexandru Condurache
Tucson Ballroom & Prefunction Space 51
DocWaveDiff: A Predict-and-Refine approch for Document Image Enhancement with Wavelet U-Nets and Diffusion models Poster Session 6 + Refreshments
Matteo Marulli ⋅ Marco Bertini
Tucson Ballroom & Prefunction Space 124
Photo Dating by Facial Age Aggregation Poster Session 6 + Refreshments
Jakub Paplham ⋅ Vojtech Franc
Tucson Ballroom & Prefunction Space 86
MergeSlide: Continual Model Merging and Task-to-Class Prompt-Aligned Inference for Lifelong Learning on Whole Slide Images Poster Session 4 + Reception
Bui Cao Doanh ⋅ Ba Ngo ⋅ Pham Luan ⋅ Khang Nguyen ⋅ Mai Nguyen ⋅ Yasuhiko Nakashima
Tucson Ballroom & Prefunction Space 55
MorphXAI: An Explainable Framework for Morphological Analysis of Parasites in Blood Smear Images Poster Session 2 + Refreshments
Aqsa Yousaf ⋅ Sint Sint Win ⋅ Megan Coffee ⋅ Habeeb Olufowobi
Tucson Ballroom & Prefunction Space 68
Trajectory Tactics: When Transformers Learn Exploration to Generate Online Signature Poster Session 2 + Refreshments
Anurag Pandey ⋅ Aditya Nigam ⋅ Arnav Bhavsar ⋅ Ashutosh Sharma ⋅ Basu Verma ⋅ Divya Acharya ⋅ Mohd Amir
Tucson Ballroom & Prefunction Space 85
SCAdapter: Content-Style Disentanglement for Diffusion Style Transfer Poster Session 6 + Refreshments
Luan Thanh Trinh
Tucson Ballroom & Prefunction Space 11
Generalized Category Discovery for LiDAR Semantic Segmentation Poster Session 6 + Refreshments
Minseok Kim ⋅ Jiyong Boo ⋅ Kuk-Jin Yoon
Tucson Ballroom & Prefunction Space 115
ART: Actor-Related Tubelet for Detecting Complex-shaped Action Tubes Poster Session 1
Jiaojiao Zhao
Tucson Ballroom & Prefunction Space 30
MBTI: Metric-Based Textual Inversion for Fine-Grained Image Generation Poster Session 1
ByungKwan Chae ⋅ Youngjae Choi ⋅ Heewon Kim
Tucson Ballroom & Prefunction Space 106
VRAgent: Self-Refining Agent for Zero-Shot Multimodal Video Retrieval Poster Session 6 + Refreshments
Ketul Shah ⋅ Pankaj Nathani ⋅ Rama Chellappa ⋅ Fabian Caba Heilbron
Tucson Ballroom & Prefunction Space 91
FocalComm: Hard Instance-Aware Multi-Agent Perception Poster Session 5
Dereje Shenkut ⋅ Vijayakumar Bhagavatula
Tucson Ballroom & Prefunction Space 46
CommonForms: A Large, Diverse Dataset for Form Field Detection Poster Session 1
Joe Barrow
Tucson Ballroom & Prefunction Space 112
MuseDance: A Diffusion-based Music-Driven Image Animation System Poster Session 3
Zhikang Dong ⋅ Weituo Hao ⋅ Ju-Chiang Wang ⋅ Peng Zhang ⋅ Pawel Polak
Tucson Ballroom & Prefunction Space 86
ZebraPose: Zebra Detection and Pose Estimation using only Synthetic Data Poster Session 5
Elia Bonetto ⋅ Aamir Ahmad
Tucson Ballroom & Prefunction Space 78
SmoothDiffusion-VE: Real-time Generative Video Editing Using Adaptive Feature Cache Poster Session 6 + Refreshments
Mustafa Munir ⋅ Sophia Zalewski ⋅ Shiqiu Liu ⋅ David Tarjan ⋅ Sushmitha Belede ⋅ Anjul Patney ⋅ Radu Marculescu
Tucson Ballroom & Prefunction Space 120
An improved architecture for part-based animal re-identification through semantic segmentation distillation Poster Session 4 + Reception
Eugênio Dias Ribeiro Neto ⋅ Marc Chaumont ⋅ Gérard Subsol ⋅ Michel Garine-Wichatitsky ⋅ Hélène Guis
Tucson Ballroom & Prefunction Space 95
Towards High-Fidelity, Identity-Preserving Real-Time Makeup Transfer: Decoupling Style Generation Poster Session 3
Kin Chau Lydia Chau ⋅ Zhi Yu ⋅ Ruowei Jiang
Tucson Ballroom & Prefunction Space 64
MMCM: Multimodality-aware Metric using Clustering-based Modes for Probabilistic Human Motion Prediction Poster Session 2 + Refreshments
Kyotaro Tokoro ⋅ Hiromu Taketsugu ⋅ Norimichi Ukita
Tucson Ballroom & Prefunction Space 116
FARF-Net: Frequency-guided Adaptive Receptive Field Network for Edge-enhanced Polyp Segmentation Poster Session 2 + Refreshments
Xue Li ⋅ Aiwen Jiang ⋅ Hongqian Yu ⋅ Xiao Yang
Tucson Ballroom & Prefunction Space 88
VOCAL: Visual Odometry via ContrAstive Learning Poster Session 3
Chi-Yao Huang ⋅ Zeel Bhatt ⋅ “YZ” Yezhou Yang
Tucson Ballroom & Prefunction Space 36
Deep Image Decomposition for Medical Imaging Anonymization and Curation Poster Session 6 + Refreshments
Yael Elkin ⋅ Gal Arie ⋅ Tammy Raviv Raviv
Tucson Ballroom & Prefunction Space 3
Fast Vision Mamba: Pooling Spatial Dimensions for Accelerated Processing Poster Session 3
Saarthak Kapse ⋅ Robin Betz ⋅ Srinivasan Sivanandan
Tucson Ballroom & Prefunction Space 1
CoreCaption: Core Caption based Text-to-Video Retrieval Poster Session 5
Junkyu Jang
Tucson Ballroom & Prefunction Space 77
Subspace-Guided Knowledge Distillation for Efficient Model Transfer Poster Session 4 + Reception
Zeeshan Hayder ⋅ Ali Cheraghian ⋅ Lars Petersson ⋅ Mehrtash Harandi
Tucson Ballroom & Prefunction Space 74
AGENet: Adaptive Edge-aware Geodesic Distance Learning for Few-Shot Medical Image Segmentation Poster Session 3
ZIYUAN GAO
Tucson Ballroom & Prefunction Space 131
PerVL-Bench: Benchmarking Multimodal Personalization for Large Vision–Language Models Poster Session 5
Minsung Kim
Tucson Ballroom & Prefunction Space 85
Histopath-C: Towards Realistic Domain Shifts for Histopathology Vision-Language Adaptation Poster Session 4 + Reception
Mehrdad Noori ⋅ Gustavo Vargas Hakim ⋅ David OSOWIECHI ⋅ Fereshteh Shakeri ⋅ Ali Bahri ⋅ Moslem Yazdanpanah ⋅ Sahar Dastani ⋅ Ismail Ayed ⋅ Christian Desrosiers
Tucson Ballroom & Prefunction Space 58
Training-Free Few-Shot Segmentation via Vision-Language Guided Prompting Poster Session 5
Euihyun Yoon ⋅ Taejin Park ⋅ Jaekoo Lee
Tucson Ballroom & Prefunction Space 69
SimForce: Force and Surface Electromyography from Full Body Video with Graph Neural Nets Poster Session 3
Esha Dasgupta ⋅ Boeun Kim ⋅ Sang-Hoon Yeo ⋅ Hyung Jin Chang
Tucson Ballroom & Prefunction Space 38
Virtually Unrolling the Herculaneum Papyri by Diffeomorphic Spiral Fitting Poster Session 5
Paul Henderson
Tucson Ballroom & Prefunction Space 58
Adversarial Pseudo-replay for Exemplar-free Class-incremental Learning Poster Session 6 + Refreshments
Hiroto Honda
Tucson Ballroom & Prefunction Space 28
SAFER-AiD: Saccade-Assisted Foveal-peripheral vision Enhanced Reconstruction for Adversarial Defense Poster Session 2 + Refreshments
Jiayang Liu ⋅ Daniel Tso ⋅ Yiming Bu ⋅ Qinru Qiu
Tucson Ballroom & Prefunction Space 30
Towards Unconstrained Cross-View Pose Estimation Poster Session 6 + Refreshments
Alexander Wollam ⋅ Kyle Ashley ⋅ Maxim Shugaev ⋅ Oliver Arend ⋅ Ilya Semenov ⋅ Hadis Dashtestani ⋅ Sumved Ravi ⋅ Nathan Jacobs
Tucson Ballroom & Prefunction Space 118
PromptGAR: Flexible Promptive Group Activity Recognition Poster Session 4 + Reception
Zhangyu Jin ⋅ Andrew Feng ⋅ Ankur Chemburkar ⋅ Celso de Melo
Tucson Ballroom & Prefunction Space 17
Delta-LLaVA: Base-then-Specialize Alignment for Token-Efficient Vision-Language Models Poster Session 3
Mohamad Zamini ⋅ Diksha Shukla
Tucson Ballroom & Prefunction Space 70
Spec-Gloss Surfels and Normal–Diffuse Priors for Relightable Glossy Objects Poster Session 4 + Reception
Georgios Kouros ⋅ Minye Wu ⋅ Tinne Tuytelaars
Tucson Ballroom & Prefunction Space 13
Enhancing Reverse Distillation with Core Exemplar Learning for Unified Multi-Class Anomaly Detection Poster Session 6 + Refreshments
Heechul Lim ⋅ Min-Soo Kim ⋅ Hyun-Boo Lee ⋅ Suk-Ju Kang ⋅ Kang-Wook Chon ⋅ Haeyun Lee
Tucson Ballroom & Prefunction Space 37
Human knowledge integrated multi-modal learning for single source domain generalization Poster Session 2 + Refreshments
Ayan Banerjee ⋅ Kuntal Thakur ⋅ Sandeep Gupta
Tucson Ballroom & Prefunction Space 92
OpenCowID: Zero-Shot Visual Identification of Dairy Cows Poster Session 2 + Refreshments
Omkar Prabhune ⋅ Younghyun Kim
Tucson Ballroom & Prefunction Space 8
PaRaChute: Pathology-Radiology Cross-Modal Fusion for Missing-Modality-Robust Survival Prediction Poster Session 1
Pietro Caforio ⋅ Isabella Poles ⋅ Marco Santambrogio
Tucson Ballroom & Prefunction Space 69
3DSceneEditor: Controllable 3D Scene Editing with Gaussian Splatting Poster Session 2 + Refreshments
Ziyang Yan ⋅ Yihua Shao ⋅ Minwen Liao ⋅ Siyu Chen ⋅ Nan Wang ⋅ Muyuan Lin ⋅ Jenq-Neng Hwang ⋅ Hao Zhao ⋅ Fabio Remondino ⋅ Lei Li
Tucson Ballroom & Prefunction Space 42
Enhancing Visual Planning with Auxiliary Tasks and Multi-token Prediction Poster Session 3
Ce Zhang ⋅ Yale Song ⋅ Ruta Desai ⋅ Michael Iuzzolino ⋅ Joseph Tighe ⋅ Gedas Bertasius ⋅ Satwik Kottur
Tucson Ballroom & Prefunction Space 122
Alignment and Distillation: A Robust Framework for Multimodal Domain Generalizable Human Action Recognition Poster Session 5
Hyeonbin Ji ⋅ Juyeob Lee ⋅ Eunil Park
Tucson Ballroom & Prefunction Space 106
BAFLE-DCT: Bypassing Adversarial Filters via Frequency-Selective Embedding in the DCT Domain Poster Session 5
Balapuwaduge Mendis ⋅ Farah Kandah ⋅ Sathya Aakur
Tucson Ballroom & Prefunction Space 16
Grounding Descriptions in Images informs Zero-Shot Visual Recognition Poster Session 4 + Reception
Shaunak Halbe ⋅ Junjiao Tian ⋅ Joseph J ⋅ James Smith ⋅ Katherine Stevo ⋅ Vineeth Balasubramanian ⋅ Zsolt Kira
Tucson Ballroom & Prefunction Space 133
A Universal Self-Attention Enhancement for Bridging Low-bit Quantization and Vision Transformers Poster Session 1
Jiahe Qian ⋅ Peisong Wang ⋅ Zhengyang Zhuge ⋅ Qinghao Hu ⋅ Jian Cheng
Tucson Ballroom & Prefunction Space 35
Joint Optimization of Camera Model and Deep Neural Network for Image Recognition Poster Session 6 + Refreshments
Youta Noboru ⋅ Yuko Ozasa ⋅ Masayuki Tanaka
Tucson Ballroom & Prefunction Space 41
Low-Rank Expert Merging for Multi-Source Domain Adaptation in Person Re-Identification Poster Session 2 + Refreshments
Taha Mustapha Nehdi ⋅ Nairouz Mrabah ⋅ ATIF BELAL ⋅ Marco Pedersoli ⋅ Eric Granger
Tucson Ballroom & Prefunction Space 38
SasMamba: A Lightweight Structure-Aware Stride State Space Model for 3D Human Pose Estimation Poster Session 2 + Refreshments
Hu Cui ⋅ Wenqiang Hua ⋅ Renjing Huang ⋅ ShuRui Jia ⋅ Tessai Hayama
Tucson Ballroom & Prefunction Space 124
The Perceptual Observatory Characterizing Robustness and Grounding in MLLMs Poster Session 2 + Refreshments
Tejas Anvekar ⋅ Fenil Bardoliya ⋅ Pavan Turaga ⋅ Chitta Baral ⋅ Vivek Gupta
Tucson Ballroom & Prefunction Space 23
Procedure Learning via Regularized Gromov-Wasserstein Optimal Transport Poster Session 5
Syed Mahmood ⋅ Ali Ali ⋅ Umer Ahmed ⋅ Fawad Fateh ⋅ Zeeshan Zia ⋅ Quoc-Huy Tran
Tucson Ballroom & Prefunction Space 107
Optimal Transport for Rectified Flow Image Editing: Unifying Inversion-Based and Direct Methods Poster Session 5
Marian Lupaşcu ⋅ Mihai-Sorin Stupariu
Tucson Ballroom & Prefunction Space 92
PRISM-CAFO: Prior-conditioned Remote-sensing Infrastructure Segmentation and Mapping for CAFOs Poster Session 2 + Refreshments
Oishee Bintey Hoque ⋅ Nibir Mandal ⋅ Kyle Luong ⋅ Mandy Wilson ⋅ Samarth Swarup ⋅ Madhav Marathe ⋅ Abhijin Adiga
Tucson Ballroom & Prefunction Space 64
Diffusion Noise Optimization for Synthetic VLM Training Poster Session 5
Ren Ohkubo ⋅ Rintaro Yanagi ⋅ Hirokatsu Kataoka ⋅ Yutaka Satoh
Tucson Ballroom & Prefunction Space 59
Federated Model Synchronization for Diagnostic Redefinition through a Novel Selective Parameter Unlearning Poster Session 1
Mayank Kundalwal Kundalwal ⋅ Mamta Mamta ⋅ Deepak Mishra ⋅ Asif Ekbal
Tucson Ballroom & Prefunction Space 134
MapleGrasp: Mask-guided Feature Pooling for Language-driven Efficient Robotic Grasping Poster Session 6 + Refreshments
Vineet Bhat ⋅ Naman Patel ⋅ Prashanth Krishnamurthy ⋅ Ramesh Karri ⋅ Farshad Khorrami
Tucson Ballroom & Prefunction Space 34
Multi-view stereo with multiple projectors for oneshot entire shape scan based on Neural SDF and DSSS demultiplexing Poster Session 4 + Reception
Kota Nishihara ⋅ Ryo Furukawa ⋅ Ryusuke Sagawa ⋅ Hiroshi Kawasaki
Tucson Ballroom & Prefunction Space 115
Interaction-via-Actions: Cattle Interaction Detection with Joint Learning of Action-Interaction Latent Space Poster Session 2 + Refreshments
Ren Nakagawa ⋅ Yang Yang ⋅ Risa Shinoda ⋅ Hiroaki Santo ⋅ Kenji Oyama ⋅ Fumio Okura ⋅ Takenao Ohkawa
Tucson Ballroom & Prefunction Space 54
1LoRA: Summation Compression for Very-Low Rank Adaptation Poster Session 2 + Refreshments
Alessio Quercia ⋅ Zhuo Cao ⋅ Arya Bangun ⋅ Richard Paul ⋅ Abigail Morrison ⋅ Ira Assent ⋅ Hanno Scharr
Tucson Ballroom & Prefunction Space 80
SeqFeedNet: Sequential Feature Feedback Network for Background Subtraction Poster Session 6 + Refreshments
Yu-Shun Huang ⋅ Yu-Shun Huang ⋅ Yi-Xiang Yang
Tucson Ballroom & Prefunction Space 95
Understanding the Visual Projection Space of Multimodal LLMs Poster Session 5
SungHeon Jeong ⋅ Yoojeong Song ⋅ Yoojeong Song
Tucson Ballroom & Prefunction Space 24
Real-Time Tracking of Flexible Markers in Low-Contrast Fluoroscopy Using a Deep Neural Network Trained Solely on Synthetic Data Poster Session 2 + Refreshments
Tomoki Uchiyama ⋅ Yukinobu Sakata ⋅ Ryusuke Hirai ⋅ Hitoshi Ishikawa ⋅ Shinichiro Mori
Tucson Ballroom & Prefunction Space 119
DRWKV: Focusing on Object Edges for Low-Light Image Enhancement Poster Session 2 + Refreshments
Xuecheng Bai ⋅ Yuxiang Wang ⋅ Boyu Hu ⋅ Qinyuan Jie ⋅ Chuanzhi Xu ⋅ Kechen Li ⋅ Hongru Xiao ⋅ Yuk Chung
Tucson Ballroom & Prefunction Space 14
A Multi-Agent Diffusion Approach for MRI Anomaly Segmentation via Modality-Specific LoRA Specialization Poster Session 1
Wafa Ghallabi ⋅ Muhammad Zaigham Zaheer ⋅ Ritesh Thawkar ⋅ Omkar Thawakar ⋅ Salman Khan ⋅ Fahad Khan
Tucson Ballroom & Prefunction Space 13
Event-based Graph Representation with Spatial and Motion Vectors for Asynchronous Object Detection Poster Session 3
Aayush Verma ⋅ Arpitsinh Vaghela ⋅ Bharatesh Chakravarthi ⋅ Kaustav Chanda ⋅ “YZ” Yezhou Yang
Tucson Ballroom & Prefunction Space 83
OSEG: Improving Diffusion sampling through Orthogonal Smoothed Energy Guidance Poster Session 5
Masud Fahim ⋅ Nazmus Saqib ⋅ JOON-MIN GIL
Tucson Ballroom & Prefunction Space 19
SGPMIL: Sparse Gaussian Process Multiple Instance Learning Poster Session 1
Andreas Lolos ⋅ Stergios Christodoulidis ⋅ Aris Moustakas ⋅ Jose Dolz ⋅ Maria Vakalopoulou
Tucson Ballroom & Prefunction Space 49
CAST: Evaluating Multi-Object Trackers with Context-Aware Switch and Transfer Scores Poster Session 6 + Refreshments
Jin Bai ⋅ Gregory Hager
Tucson Ballroom & Prefunction Space 6
M-ErasureBench: A Comprehensive Multimodal Evaluation Benchmark for Concept Erasure in Diffusion Models Poster Session 1
Ju-Hsuan Weng ⋅ Jia-Wei Liao ⋅ Cheng-Fu Chou ⋅ Jun-Cheng Chen
Tucson Ballroom & Prefunction Space 51
Unified Control for Inference-Time Guidance of Denoising Diffusion Models Poster Session 4 + Reception
Maurya Goyal ⋅ Anuj Singh ⋅ Hadi Rad
Tucson Ballroom & Prefunction Space 110
EVTP-IVS: Effective Visual Token Pruning For Unifying Instruction Visual Segmentation In Multi-Modal Large Language Models Poster Session 5
Wenhui Zhu ⋅ Xiwen Chen ⋅ Zhipeng Wang ⋅ Shao Tang ⋅ Sayan Ghosh ⋅ XUANZHAO DONG ⋅ Rajat Koner ⋅ Yalin Wang
Tucson Ballroom & Prefunction Space 129
SceneEval: Evaluating Semantic Coherence in Text-Conditioned 3D Indoor Scene Synthesis Poster Session 6 + Refreshments
Hou In Ivan Tam ⋅ Hou In Derek Pun ⋅ Austin Wang ⋅ Angel Chang ⋅ Manolis Savva
Tucson Ballroom & Prefunction Space 15
InteracTalker: Prompt-Based Human-Object Interaction with Co-Speech Gesture Generation Poster Session 2 + Refreshments
Sreehari Rajan ⋅ Kunal Bhosikar ⋅ Charu Sharma
Tucson Ballroom & Prefunction Space 3
BrandFusion: Aligning Image Generation with Brand Styles Poster Session 2 + Refreshments
Parul Gupta ⋅ Varun Khurana ⋅ Yaman Singla ⋅ Balaji Krishnamurthy ⋅ Abhinav Dhall
Tucson Ballroom & Prefunction Space 86
From Cognitive Priors to Instance Semantics: A Unified Framework for Multi-task Affective Computing Poster Session 6 + Refreshments
Guanyu Hu ⋅ Dimitrios Kollias ⋅ Xinyu Yang
Tucson Ballroom & Prefunction Space 128
CalibBEV: LiDAR-Camera Calibration via BEV Alignment Poster Session 4 + Reception
Filippo D'Addeo ⋅ Lorenzo Cipelli ⋅ Adriano Cardace ⋅ Emanuele Ghelfi ⋅ Andrea Zinelli ⋅ Massimo Bertozzi
Tucson Ballroom & Prefunction Space 6
ITSELF: Attention Guided Fine-Grained Alignment for Vision–Language Retrieval Poster Session 2 + Refreshments
TIEN-HUY NGUYEN ⋅ Huu-Loc Tran ⋅ Thanh Ngo
Tucson Ballroom & Prefunction Space 4
Detecting Out-of-Distribution Objects through Class-Conditioned Inpainting Poster Session 2 + Refreshments
Quang-Huy Nguyen ⋅ Jin Peng Zhou ⋅ Zhenzhen Liu ⋅ Khanh-Huyen Bui ⋅ Kilian Weinberger ⋅ Wei-Lun Chao ⋅ Dung Le
Tucson Ballroom & Prefunction Space 50
Image-Guided Semantic Pseudo-LiDAR Point Generation for 3D Object Detection Poster Session 5
MINSEUNG LEE ⋅ Seokha Moon ⋅ Seung Lee ⋅ Reza Mahjourian ⋅ Jinkyu Kim
Tucson Ballroom & Prefunction Space 127
Structured Context Learning for Generic Event Boundary Detection Poster Session 4 + Reception
Xin Gu ⋅ Congcong Li ⋅ Xinyao Wang ⋅ Dexiang Hong ⋅ Libo Zhang ⋅ Tiejian Luo ⋅ Longyin Wen ⋅ Heng Fan
Tucson Ballroom & Prefunction Space 50
MooTrack360: A Novel Fisheye Camera Dataset for Robust Multi Diary Cow Detection and Tracking Poster Session 1
Rasmus Christiansen ⋅ Toan Nguyen ⋅ Lasse Malskær ⋅ Leon Bodenhagen ⋅ Dirk Kraft
Tucson Ballroom & Prefunction Space 44
Saliency-Guided DETR for Moment Retrieval and Highlight Detection Poster Session 1
Aleksandr Gordeev ⋅ Vladimir Dokholyan ⋅ Irina Tolstykh ⋅ Maksim Kuprashevich
Tucson Ballroom & Prefunction Space 87
Gaussian Representations for Video Poster Session 1
Sachin Shah ⋅ Anustup Choudhury ⋅ Guan-Ming Su ⋅ Jaclyn Pytlarz ⋅ Christopher Metzler ⋅ Trisha Mittal
Tucson Ballroom & Prefunction Space 79
SVD-Det: A Lightweight Framework for Video Forgery Detection Using Semantic and Visual Defect Cues Poster Session 6 + Refreshments
Tsung-Shan Yang ⋅ Tianyu Zhang ⋅ Feng Qian ⋅ Bing Yan ⋅ Chung Chieh Kuo
Tucson Ballroom & Prefunction Space 40
Semi-supervised Domain Adaptation via Mutual Alignment through Joint Error Poster Session 4 + Reception
Dexuan Zhang ⋅ Thomas Westfechtel ⋅ Tatsuya Harada
Tucson Ballroom & Prefunction Space 109
Lose Your Self (LoYS): an adversarial entropy-based unsupervised approach for model debiasing Poster Session 4 + Reception
Vito Paolo Pastore ⋅ Massimiliano Ciranni ⋅ Vittorio Murino
Tucson Ballroom & Prefunction Space 137
Learning Mask-Aware Offsets: Two-branch Deformable Attention Networks for Inpainting with Masked Region Avoidance Poster Session 1
Hyeongseok Oh ⋅ Joonki Paik
Tucson Ballroom & Prefunction Space 98
TiCLS : Tightly Coupled Language Text Spotter Poster Session 3
Leeje Jang ⋅ Yijun Lin ⋅ Yao-Yi Chiang ⋅ Jerod Weinman
Tucson Ballroom & Prefunction Space 78
EmojiDiff: Advanced Facial Expression Control with High Identity Preservation in Portrait Generation Poster Session 1
Liangwei Jiang ⋅ Ruida Li ⋅ Zhifeng Zhang ⋅ Shuo Fang ⋅ Chenguang Ma
Tucson Ballroom & Prefunction Space 32
Towards Reliable Test-Time Adaptation: Style Invariance as a Correctness Likelihood Poster Session 3
Gilhyun Nam ⋅ Taewon Kim ⋅ Joonhyun Jeong ⋅ Eunho Yang
Tucson Ballroom & Prefunction Space 16
4D Multimodal Co-attention Fusion Network with Latent Contrastive Alignment for Alzheimer's Diagnosis Poster Session 4 + Reception
YUXIANG WEI ⋅ Yanteng Zhang ⋅ Xi Xiao ⋅ Tianyang Wang ⋅ Xiao Wang ⋅ Vince Calhoun
Tucson Ballroom & Prefunction Space 112
DNA: Dual-branch Network with Adaptation for Open-Set Online Handwriting Generation Poster Session 3
Tsai-Ling Huang ⋅ Nhat-Tuong Do-Tran ⋅ Ngoc-Hoang-Lam Le ⋅ Hong-Han Shuai ⋅ Ching-Chun Huang
Tucson Ballroom & Prefunction Space 120
Advancing Multimodal LLMs by Large-Scale 3D Visual Instruction Dataset Generation Poster Session 5
Liu He ⋅ Xiao Zeng ⋅ Yizhi Song ⋅ Albert Chen ⋅ Lu Xia ⋅ Shashwat Verma ⋅ Sankalp Dayal ⋅ Min Sun ⋅ Cheng-Hao Kuo ⋅ Daniel Aliaga
Tucson Ballroom & Prefunction Space 9
Towards Photorealistic Style Transfer with Multimodal Guidance and Robustness to Content Images in Arbitrary Styles Poster Session 4 + Reception
Ruikai Zhou ⋅ Yating Liu ⋅ Yi Xu
Tucson Ballroom & Prefunction Space 35
Optimizing against Infeasible Inclusions from Data for Semantic Segmentation through Morphology Poster Session 6 + Refreshments
Shamik Basu ⋅ Luc Van Gool ⋅ Christos Sakaridis
Tucson Ballroom & Prefunction Space 31
Flood-LDM: Generalizable Latent Diffusion Models for rapid and accurate zero-shot High-Resolution Flood Mapping Poster Session 6 + Refreshments
Sun Han Neo ⋅ Sachith Seneviratne ⋅ Herath Mudiyanselage Viraj Vidura Herath ⋅ Abhishek Saha ⋅ Sanka Rasnayaka ⋅ Lucy Marshall
Tucson Ballroom & Prefunction Space 82
UniGaze: Towards Universal Gaze Estimation via Large-scale Pre-Training Poster Session 5
Jiawei Qin ⋅ Xucong Zhang ⋅ Yusuke Sugano
Tucson Ballroom & Prefunction Space 2
ODEt(ODEl): Shortcutting the Time and the Length in Diffusion and Flow Models for Faster Sampling Poster Session 5
Denis Gudovskiy ⋅ Wenzhao Zheng ⋅ Tomoyuki Okuno ⋅ Yohei Nakata ⋅ Kurt Keutzer
Tucson Ballroom & Prefunction Space 30
JOCA: Task-Driven Joint Optimisation of Camera Hardware and Adaptive Camera Control Algorithms Poster Session 3
Chengyang Yan ⋅ Mitch Bryson ⋅ Donald Dansereau
Tucson Ballroom & Prefunction Space 97
BiPO: Bidirectional Partial Occlusion Network for Text-to-Motion Synthesis Poster Session 1
Seong-Eun Hong ⋅ SooBin Lim ⋅ JuYeong Hwang ⋅ Minwook Chang ⋅ Hyeongyeop Kang
Tucson Ballroom & Prefunction Space 4
PHYSPLAT: a Framework for Photorealistic Hybrid Simulation of Real and Synthetic Elements using 3D Gaussian Splatting Poster Session 2 + Refreshments
Mario Alfonso-Arsuaga ⋅ Henar Dominguez-Elvira ⋅ Jorge Guerrero ⋅ Andrea Castiella-Aguirrezabala ⋅ Lorenzo Domínguez ⋅ Jorge García-González ⋅ Maria Naranjo-Almeida ⋅ Marc Comino-Trinidad ⋅ Jorge Lopez-Moreno
Tucson Ballroom & Prefunction Space 20
Uplifting Table Tennis: A Robust, Real-World Application for 3D Trajectory and Spin Estimation Poster Session 6 + Refreshments
Daniel Kienzle ⋅ Katja Ludwig ⋅ Julian Lorenz ⋅ Shin'ichi Satoh ⋅ Rainer Lienhart
Tucson Ballroom & Prefunction Space 23
AUTOCORRELATION-BASED FIDUCIAL MARKERS FOR TRACEABILITY Poster Session 1
BENCHEIKH ISMAIL ⋅ Max Dunitz ⋅ Marie d'Autume ⋅ Marc Pic ⋅ Enric Meinhardt-Llopis ⋅ Gabriele Facciolo ⋅ Pablo Musé
Tucson Ballroom & Prefunction Space 129
QC-SF: Improving Computer Vision for Airborne LiDAR Point Clouds of Boreal Forests with Quebec Simulated Forest Dataset Poster Session 4 + Reception
Olivier Stocker ⋅ Reza Mahmoudi Kouhi ⋅ Omid Reisi Gahrouei ⋅ Thierry Badard ⋅ Eric Guilbert
Tucson Ballroom & Prefunction Space 71
ControlEvents: Controllable Synthesis of Event Camera Data with Foundational Prior from Image Diffusion Models Poster Session 4 + Reception
Yixuan Hu ⋅ Yuxuan Xue ⋅ Simon Klenk ⋅ Daniel Cremers ⋅ Gerard Pons-Moll
Tucson Ballroom & Prefunction Space 117
SurfDist: Interpretable Three-Dimensional Instance Segmentation Using Curved Surface Patches Poster Session 4 + Reception
Jackson Borchardt ⋅ Saul Kato
Tucson Ballroom & Prefunction Space 120
ReBrain: Brain MRI Reconstruction from Sparse CT Slice via Retrieval-Augmented Diffusion Poster Session 3
Junming Liu ⋅ Yifei Sun ⋅ Weihua Cheng ⋅ Yujin Kang ⋅ Yirong Chen ⋅ Ding Wang ⋅ Guosun Zeng
Tucson Ballroom & Prefunction Space 104
MSRTrack: LLM-Powered Object Tracking with Motion and Semantic Reasoning Poster Session 1
Tong Shen ⋅ Di Wang ⋅ José Moura
Tucson Ballroom & Prefunction Space 80
CONCORD: Concept-Informed Diffusion for Dataset Distillation Poster Session 4 + Reception
Jianyang Gu ⋅ Haonan Wang ⋅ Ruoxi Jia ⋅ Saeed Vahidian ⋅ Vyacheslav Kungurtsev ⋅ Wei Jiang ⋅ Yiran Chen
Tucson Ballroom & Prefunction Space 93
Detection-Driven Object Count Optimization for Text-to-Image Diffusion Models Poster Session 2 + Refreshments
Oz Zafar ⋅ Yuval Cohen ⋅ Lior Wolf ⋅ Idan Schwartz
Tucson Ballroom & Prefunction Space 45
Accelerated Dose Generation in Gamma Knife Radiosurgery Using a Wavelet Diffusion Model for Sparse Representation Poster Session 1
Sangyoon Lee ⋅ Shubhendu Mishra ⋅ Yoichi Watanabe
Tucson Ballroom & Prefunction Space 88
A framework for real-time Surgical Phase Recognition with application to Robot-Assisted Partial Nephrectomy Poster Session 1
Marco Mezzina ⋅ Tom Vercauteren ⋅ Tinne Tuytelaars ⋅ Matthew Blaschko
Tucson Ballroom & Prefunction Space 24
4D-Animal: Freely Reconstructing Animatable 3D Animals from Videos Poster Session 1
Shanshan Zhong ⋅ Jiawei Peng ⋅ Zehan Zheng ⋅ Zhongzhan Huang ⋅ Wufei Ma ⋅ Guofeng Zhang ⋅ Qihao Liu ⋅ Alan Yuille ⋅ Jieneng Chen
Tucson Ballroom & Prefunction Space 58
A Novel Metric for Detecting Memorization in Generative Models for Brain MRI Synthesis Poster Session 3
Antonio Scardace ⋅ Lemuel Puglisi ⋅ Francesco Guarnera ⋅ Sebastiano Battiato ⋅ Daniele Ravi
Tucson Ballroom & Prefunction Space 91
VIZOR: Viewpoint-Invariant Zero-Shot Scene Graph Generation for 3D Scene Reasoning Poster Session 6 + Refreshments
Madhavaram Vivek Vardhan ⋅ Vartika Sengar ⋅ Arkadipta De ⋅ Charu Sharma
Tucson Ballroom & Prefunction Space 131
Fused Similarity Measure Based Alignment with Dual-Scale Adaptive Selection for Weakly Supervised Video Anomaly Detection Poster Session 3
Yuegao Lu ⋅ Hong-Jie Xing ⋅ Chun-Guo Li
Tucson Ballroom & Prefunction Space 26
Distilling Diversity and Control in Diffusion Models Poster Session 1
Rohit Gandikota ⋅ David Bau
Tucson Ballroom & Prefunction Space 125
Automated Pore Detection from In-Situ FDM 3D Printing Video: A Comparative Evaluation of Modern Segmentation Models Poster Session 4 + Reception
Abdullah Al Ahad Khan ⋅ Md Islam ⋅ Lin Li ⋅ Lai Jiang ⋅ Noushin Ghaffari
Tucson Ballroom & Prefunction Space 37
Better Safe Than Sorry? Overreaction Problem of Vision Language Models in Visual Emergency Recognition Poster Session 4 + Reception
Dasol Choi ⋅ Seunghyun Lee ⋅ Youngsook Song
Tucson Ballroom & Prefunction Space 42
FastHMR: Accelerating Human Mesh Recovery via Token and Layer Merging with Diffusion Decoding Poster Session 5
Soroush Mehraban ⋅ Andrea Iaboni ⋅ Babak Taati
Tucson Ballroom & Prefunction Space 89
Latent Uncertainty-Aware Multi-View SDF Scan Completion Poster Session 3
Faezeh Zakeri ⋅ Lukas Ruppert ⋅ Raphael Braun ⋅ Hendrik Lensch
Tucson Ballroom & Prefunction Space 61
SCALEX: Scalable Concept and Latent Exploration for Diffusion Models Poster Session 3
Emily Zhixuan Zeng ⋅ Yuhao Chen ⋅ Alexander Wong
Tucson Ballroom & Prefunction Space 67
Gen-AFFECT: Generation of Avatar Fine-grained Facial Expressions with Consistent identiTy Poster Session 3
Hao Yu ⋅ Rupayan Mallick ⋅ Margrit Betke ⋅ Sarah Bargal
Tucson Ballroom & Prefunction Space 93
UnderWater SLAM with Laser-light sectioning method using ST-GAT Poster Session 1
Heyang Gao ⋅ Kazuto Ichimaru ⋅ Takafumi Iwaguchi ⋅ Hiroshi Kawasaki
Tucson Ballroom & Prefunction Space 9
START: Spatial and Textual Learning for Chart Understanding Poster Session 6 + Refreshments
Zhuoming Liu ⋅ Xiaofeng Gao ⋅ Feiyang Niu ⋅ Qiaozi Gao ⋅ Liu Liu ⋅ Robinson Piramuthu
Tucson Ballroom & Prefunction Space 90
Co-STAR: Collaborative Curriculum Self-Training with Adaptive Regularization for Source-Free Video Domain Adaptation Poster Session 6 + Refreshments
Amirhossein Dadashzadeh ⋅ Parsa Esmati ⋅ Majid Mirmehdi
Tucson Ballroom & Prefunction Space 59
PDV: Prompt Directional Vectors for Zero-shot Composed Image Retrieval Poster Session 6 + Refreshments
Osman Tursun ⋅ Sinan Kalkan ⋅ Simon Denman ⋅ Clinton Fookes
Tucson Ballroom & Prefunction Space 51
How to Design and Train Your Implicit Neural Representation for Video Compression Poster Session 1
Matthew Gwilliam ⋅ Roy Zhang ⋅ Namitha Padmanabhan ⋅ Hongyang Du ⋅ Abhinav Shrivastava
Tucson Ballroom & Prefunction Space 70
Chain-of-Look Spatial Reasoning for Dense Surgical Instrument Counting Poster Session 6 + Refreshments
Rishikesh Bhyri ⋅ Brian Quaranto ⋅ Junsong Yuan ⋅ Peter Kim ⋅ Nan Xi
Tucson Ballroom & Prefunction Space 125
From Darkness to Detail: Frequency-Aware SSMs for Low-Light Vision Poster Session 5
Eashan Adhikarla ⋅ Kai Zhang ⋅ Gong Chen ⋅ John Nicholson ⋅ Brian Davison
Tucson Ballroom & Prefunction Space 110
Global Focal and Radial Distortion Averaging from Radial Fundamental Matrices for Robust Self-Calibration Poster Session 4 + Reception
Sergei Solonets ⋅ Daniil Sinitsyn ⋅ Daniel Cremers
Tucson Ballroom & Prefunction Space 47
Hymavi : A Hybrid Mamba-Attention Network in Multi-View Framework for Volumetric Medical Image Segmentation Poster Session 5
Sy Tran ⋅ Jin Kyu Gahm
Tucson Ballroom & Prefunction Space 20
OpenLVLM-MIA: A Controlled Benchmark Revealing the Limits of Membership Inference Attacks on Large Vision-Language Models Poster Session 2 + Refreshments
Miyamoto Ryoto ⋅ Xin Fan ⋅ Fuyuko Kido ⋅ Tsuneo Matsumoto ⋅ Hayato Yamana
Tucson Ballroom & Prefunction Space 120
Beyond Faces: A Multimodal Person Clustering for Unconstrained Environments Poster Session 4 + Reception
Sahngmin Yoo ⋅ Sangwon Lee ⋅ Seongin Jo
Tucson Ballroom & Prefunction Space 33
Fetal and Neonatal Cortical Surface Reconstruction with Anatomical Normal-guidance and Perceptual Enhancements Poster Session 6 + Refreshments
Jiyang Lee ⋅ Woori Bae ⋅ U-Geun Ji ⋅ Hanyeol Yang ⋅ Jong-Min Lee
Tucson Ballroom & Prefunction Space 53
Mitigating the Modality Gap: Few-Shot Out-of-Distribution Detection with Multi-modal Prototypes and Image Bias Estimation Poster Session 2 + Refreshments
Yimu Wang ⋅ Evelien Riddell ⋅ Adrian Chow ⋅ Sean Sedwards ⋅ Krzysztof Czarnecki
Tucson Ballroom & Prefunction Space 126
SPOC: Spatially-Progressing Object State Change Segmentation in Video Poster Session 3
Priyanka Mandikal ⋅ Tushar Nagarajan ⋅ Alex Stoken ⋅ Zihui Xue ⋅ Kristen Grauman
Tucson Ballroom & Prefunction Space 56
FAST-EQA: Efficient Embodied Question Answering with Global and Local Region Relevancy Poster Session 2 + Refreshments
Haochen Zhang ⋅ Nirav Savaliya ⋅ Faizan Siddiqui ⋅ Enna Sachdeva
Tucson Ballroom & Prefunction Space 24
Mobile-Oriented Video Diffusion: Enabling Text-to-Video Generation on Mobile Devices Without Retraining, Compression, or Pruning Poster Session 3
Bosung Kim ⋅ Kyuhwan Lee ⋅ Isu Jeong ⋅ Jungmin Cheon ⋅ Yeojin Lee ⋅ Seulki Lee
Tucson Ballroom & Prefunction Space 100
Understanding Generative AI Capabilities in Everyday Image Editing Tasks Poster Session 2 + Refreshments
Brandon Collins ⋅ Mohammad Reza Taesiri ⋅ Logan Bolton ⋅ Viet Lai ⋅ Franck Dernoncourt ⋅ Trung Bui ⋅ Anh Nguyen
Tucson Ballroom & Prefunction Space 78
TalkingPose: Efficient Face and Gesture Animation with Feedback-guided Diffusion Model Poster Session 3
Alireza Javanmardi ⋅ Pragati Jaiswal ⋅ Tewodros Habtegebrial ⋅ Christen Millerdurai ⋅ Shaoxiang Wang ⋅ Alain Pagani ⋅ Didier Stricker
Tucson Ballroom & Prefunction Space 17
Conversational Image Generation: Towards Multi-Round Personalized Generation with Multi-Modal Language Models Poster Session 6 + Refreshments
Haochen Zhang ⋅ Animesh Sinha ⋅ Felix Juefei-Xu ⋅ Haoyu Ma ⋅ Kunpeng Li ⋅ Zhipeng Fan ⋅ Xiaoliang Dai ⋅ Tingbo Hou ⋅ Peizhao Zhang ⋅ Zecheng He
Tucson Ballroom & Prefunction Space 102
UniCalib: Targetless LiDAR-camera Calibration via Probabilistic Flow on Unified Depth Representations Poster Session 2 + Refreshments
Shu Han ⋅ Xubo Zhu ⋅ Ji Wu ⋅ Ximeng Cai ⋅ Wen Yang ⋅ Huai Yu ⋅ Gui-Song Xia
Tucson Ballroom & Prefunction Space 47
DOODLE: Diffusion-based Out-of-Distribution Learning for Open-set LiDAR Semantic Segmentation Poster Session 2 + Refreshments
Changgyoon Oh ⋅ Hyeonseong Kim ⋅ Daehyun We ⋅ Jongoh Jeong ⋅ Yujeong Chae ⋅ Kuk-Jin Yoon
Tucson Ballroom & Prefunction Space 82
Logit-Adjusted Test-Time Adaptation under Partial Class Imbalance Poster Session 5
Thilina Weerasinghe ⋅ Ruwan Tennakoon ⋅ WeiQin Chuah ⋅ Alireza Bab-Hadiashar
Tucson Ballroom & Prefunction Space 17
Conditional Text-to-Image Generation with Reference Guidance Poster Session 2 + Refreshments
Taewook Kim ⋅ Ze Wang ⋅ Zhengyuan Yang ⋅ Jiang Wang ⋅ Lijuan Wang ⋅ Zicheng Liu ⋅ Qiang Qiu
Tucson Ballroom & Prefunction Space 139
From SAM to DINOv2: Towards Distilling Foundation Models to Lightweight Baselines for Generalized Polyp Segmentation Poster Session 2 + Refreshments
Shivanshu Agnihotri ⋅ Snehashis Majhi ⋅ Deepak Nayak ⋅ Debesh Jha
Tucson Ballroom & Prefunction Space 33
Leveraging Pretrained Representations for Cross-Modal Point Cloud Completion Poster Session 1
Kshitij Kale ⋅ Hrishikesh U ⋅ V Sreenidhe ⋅ Shylaja S
Tucson Ballroom & Prefunction Space 10
RPT-SR: Regional Prior attention Transformer for infrared image Super-Resolution Poster Session 4 + Reception
Youngwan Jin ⋅ Incheol Park ⋅ Yagiz Nalcakan ⋅ Hyeongjin Ju ⋅ Sang Yeo ⋅ Shiho Kim
Tucson Ballroom & Prefunction Space 86
CropAT: Leveraging Diffusion-Generated Target-Like Cropped Objects for Pseudo-Label Refinement in Domain-Adaptive Object Detection Poster Session 4 + Reception
Chen-Che Huang ⋅ Tzuhsuan Huang ⋅ Jun-Cheng Chen
Tucson Ballroom & Prefunction Space 32
ArchitectHead: Continuous Level of Detail Control for 3D Gaussian Head Avatars Poster Session 2 + Refreshments
Peizhi Yan ⋅ Rabab Ward ⋅ Qiang Tang ⋅ Shan Du
Tucson Ballroom & Prefunction Space 21
TimeRefine: Temporal Grounding with Time Refining Video LLM Poster Session 4 + Reception
Xizi Wang ⋅ Feng Cheng ⋅ Ziyang Wang ⋅ Huiyu Wang ⋅ Md Mohaiminul Islam ⋅ Lorenzo Torresani ⋅ Mohit Bansal ⋅ Gedas Bertasius ⋅ David Crandall
Tucson Ballroom & Prefunction Space 75
Reviving Unsupervised Optical Flow: Concept Reevaluation, Multi-Scale Advances and Full Open-Source Release Poster Session 2 + Refreshments
Azin Jahedi ⋅ Marc Rivinius ⋅ Noah Senn ⋅ Andres Bruhn
Tucson Ballroom & Prefunction Space 12
EllipssianNet: Image-guided Sampling of 2D Gaussians for Gaussian Splatting Poster Session 2 + Refreshments
MyoungGon Kim ⋅ JeongHyeon Ahn ⋅ Seohyeon Park ⋅ Hyemi Kim ⋅ Seunghyun Park ⋅ Jung Hwang ⋅ JungHyun Han
Tucson Ballroom & Prefunction Space 66
MaxInfo: A Training-Free Key-Frame Selection Method Using Maximum Volume for Enhanced Video Understanding Poster Session 5
Pengyi Li ⋅ Irina Abdullaeva ⋅ Alexander Gambashidze ⋅ Andrei Kuznetsov ⋅ Ivan Oseledets
Tucson Ballroom & Prefunction Space 133
Splatter Layout: Geometry-embedded 3D Reconstruction via Surface Unfolding Poster Session 6 + Refreshments
Bryan Heryanto ⋅ Tackgeun You ⋅ Chanwoo Kim ⋅ Hwasup Lim
Tucson Ballroom & Prefunction Space 49
Relevance-aware Multi-context Contrastive Decoding for Retrieval-augmented Visual Question Answering Poster Session 6 + Refreshments
Jongha Kim ⋅ Byungoh Ko ⋅ Jeehye Na ⋅ Jinsung Yoon ⋅ Hyunwoo Kim
Tucson Ballroom & Prefunction Space 132
Unsupervised Discovery of Long-Term Spatiotemporal Periodic Workflows in Human Activities Poster Session 5
Fan Yang ⋅ Quanting Xie ⋅ Atsunori Moteki ⋅ Shoichi Masui ⋅ Shan Jiang ⋅ Kanji Uchino ⋅ Yonatan Bisk ⋅ Graham Neubig
Tucson Ballroom & Prefunction Space 3
Ordinal-Aware Multimodal Engagement Recognition for Collaborative Learning Poster Session 2 + Refreshments
Nha Tran ⋅ Dat Ly ⋅ Phi Ta ⋅ Hung Nguyen ⋅ Hien Nguyen
Tucson Ballroom & Prefunction Space 96
MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities Poster Session 6 + Refreshments
Tooba Tehreem Sheikh ⋅ Jean Lahoud ⋅ Rao Anwer ⋅ Fahad Khan ⋅ Salman Khan ⋅ Hisham Cholakkal
Tucson Ballroom & Prefunction Space 135
Dragonite: Single-Step Drag-based Image Editing with Geometric-Semantic Guidance Poster Session 3
Meng-Ting Jhong ⋅ Tai-Ming Huang ⋅ Shang-Fu Chen ⋅ Wen-Huang Cheng ⋅ Kailung Hua
Tucson Ballroom & Prefunction Space 13
Action Anticipation at a Glimpse: To What Extent Can Multimodal Cues Replace Video? Poster Session 1
Manuel Benavent-Lledo ⋅ Konstantinos Bacharidis ⋅ Victoria Manousaki ⋅ Konstantinos Papoutsakis ⋅ Antonis Argyros ⋅ José García-Rodríguez
Tucson Ballroom & Prefunction Space 27
NERVE: Neighbourhood & Entropy-Guided Random-Walk for Training Free Open-Vocabulary Segmentation Poster Session 3
KUNAL MAHATHA ⋅ Jose Dolz ⋅ Christian Desrosiers
Tucson Ballroom & Prefunction Space 31
2S-CEDiff: A Two-Stage Diffusion Framework for Generating High-Fidelity Contrast-Enhanced CT Images from Non-Contrast Scans Poster Session 3
Yi-Bang Wu ⋅ Tzung-Dau Wang ⋅ Shang-Hong Lai
Tucson Ballroom & Prefunction Space 96
INRetouch: Context Aware Implicit Neural Representation for Photography Retouching Poster Session 4 + Reception
Omar Elezabi ⋅ Marcos Conde ⋅ Zongwei Wu ⋅ Radu Timofte
Tucson Ballroom & Prefunction Space 122
Optimization-Free Style Transfer for 3D Gaussian Splats Poster Session 6 + Refreshments
Raphael DuSablon ⋅ David Hart
Tucson Ballroom & Prefunction Space 80
Streaming Real-Time Trajectory Prediction Using Endpoint-Aware Modeling Poster Session 3
Alexander Prutsch ⋅ David Schinagl ⋅ Horst Possegger
Tucson Ballroom & Prefunction Space 134
Performance of Conformal Prediction in Capturing Aleatoric Uncertainty Poster Session 3
Misgina Tsighe Hagos ⋅ Claes Lundström
Tucson Ballroom & Prefunction Space 4
Distilling Offline Action Detection Models into Real-Time Streaming Models Poster Session 5
Deep Patel ⋅ Yasunori Babazazki ⋅ YASUTO NAGASE ⋅ Iain Melvin ⋅ Martin Min
Tucson Ballroom & Prefunction Space 39
Multi-Modal Soccer Scene Analysis with Masked Pre-Training Poster Session 3
Marc Peral ⋅ Guillem Capellera ⋅ Luis Ferraz ⋅ Antonio Romano ⋅ Antonio Agudo
Tucson Ballroom & Prefunction Space 59
GroupPortrait: Multi-ID Portrait Generation with High Identity Preservation and Fine-Grained Control Poster Session 5
Meijia Huang ⋅ Ruida Li ⋅ Bing Ma ⋅ Liangwei Jiang ⋅ Shuo Fang ⋅ Chenguang Ma
Tucson Ballroom & Prefunction Space 41
From Prompt to Production: Automating Brand-Safe Marketing Imagery with Text-to-Image Models Poster Session 5
Parmida Atighehchain ⋅ Henry Wang ⋅ Andrei Kapustin ⋅ Boris Lerner ⋅ Tiancheng Jiang ⋅ Taylor Jensen ⋅ Negin Sokhandan
Tucson Ballroom & Prefunction Space 97
GateFusion: Hierarchical Gated Cross-Modal Fusion for Active Speaker Detection Poster Session 1
Yu Wang ⋅ Juhyung Ha ⋅ Frangil Ramirez ⋅ Yuchen Wang ⋅ David Crandall
Tucson Ballroom & Prefunction Space 103
MemeTAG: Keyword-Driven Meme Classification through Tag Embedding Reconstruction Poster Session 6 + Refreshments
Akshit Sharma ⋅ Prashant Patil
Tucson Ballroom & Prefunction Space 46
IDEAL-M3D: Instance Diversity-Enriched Active Learning for Monocular 3D Detection Poster Session 1
Johannes Meier ⋅ Florian Günther ⋅ Riccardo Marin ⋅ Oussema Dhaouadi ⋅ Jacques Kaiser ⋅ Daniel Cremers
Tucson Ballroom & Prefunction Space 18
Gene-DML: Dual-Pathway Multi-Level Discrimination for Gene Expression Prediction from Histopathology Images Poster Session 4 + Reception
Yaxuan Song ⋅ Jianan Fan ⋅ Hang Chang ⋅ Weidong Cai
Tucson Ballroom & Prefunction Space 77
SceneEdited: A City-Scale Benchmark for 3D HD Map Updating via Image-Guided Change Detection Poster Session 5
Chun-Jung Lin ⋅ Tat-Jun Chin ⋅ Sourav Garg ⋅ Feras Dayoub
Tucson Ballroom & Prefunction Space 51
Sketch2Stitch: GANs for Abstract Sketch-Based Dress Synthesis Poster Session 2 + Refreshments
Faizan Khan ⋅ Faizan Khan ⋅ Davide Morelli ⋅ Marcella Cornia ⋅ Rita Cucchiara ⋅ Mohamed Elhoseiny
Tucson Ballroom & Prefunction Space 76
Mixed Diffusion for 3D Indoor Scene Synthesis Poster Session 1
Siyi Hu ⋅ Diego Martín Arroyo ⋅ Stephanie Debats ⋅ Fabian Manhardt ⋅ Luca Carlone ⋅ Federico Tombari
Tucson Ballroom & Prefunction Space 121
Predicting Task fMRI Contrasts from Resting-State fMRI Using Sparse 3D Convolutions Poster Session 5
Ivan Sviridov ⋅ Maria Boyko ⋅ Maksim Sharaev
Tucson Ballroom & Prefunction Space 50
FreeCond: Free Lunch in the Input Conditions of Text-Guided Inpainting Poster Session 4 + Reception
Teng-Fang Hsiao ⋅ Bo-Kai Ruan ⋅ Sung-Lin Tsai ⋅ Yi-Lun Wu ⋅ Hong-Han Shuai
Tucson Ballroom & Prefunction Space 116
Leveraging Semantic Attribute Binding for Free-Lunch Color Control in Diffusion Models Poster Session 6 + Refreshments
Héctor Laria ⋅ Alexandra Gomez-Villa ⋅ Jiang Qin ⋅ Muhammad Atif Butt ⋅ Bogdan Raducanu ⋅ Javier Vazquez-Corral ⋅ Joost van de Weijer ⋅ Kai Wang
Tucson Ballroom & Prefunction Space 47
Unified Alignment Protocol: Making Sense of the Unlabeled Data in New Domains Poster Session 3
Sabbir Ahmed ⋅ Mamshad Nayeem Rizve ⋅ Abdullah Al Arafat ⋅ Jacqueline Liu ⋅ Rahim Hossain ⋅ Mohaiminul Nahian ⋅ Adnan Siraj Rakin
Tucson Ballroom & Prefunction Space 6
AFRAgent : An Adaptive Feature Renormalization Based High Resolution Aware GUI agent Poster Session 1
Neeraj Anand ⋅ Rishabh Jain ⋅ Sohan Patnaik ⋅ Balaji Krishnamurthy ⋅ Mausoom Sarkar
Tucson Ballroom & Prefunction Space 110
Reconstructing Realistic and Relightable Eyes Poster Session 2 + Refreshments
Wesley Khademi ⋅ Jogendra Nath Kundu ⋅ Yatong An ⋅ Alexander Fix ⋅ David Colmenares
Tucson Ballroom & Prefunction Space 79
MomentMix Augmentation with Length-Aware DETR for Temporally Robust Moment Retrieval Poster Session 1
Seojeong Park ⋅ Jiho Choi ⋅ Kyungjune Baek ⋅ Hyunjung Shim
Tucson Ballroom & Prefunction Space 108
Learning Group Actions In Disentangled Latent Image Representations Poster Session 3
Farhana Hossain Swarnali ⋅ Miaomiao Zhang ⋅ TONMOY HOSSAIN
Tucson Ballroom & Prefunction Space 21
DMAT: An End-to-End Framework for Joint Atmospheric Turbulence Mitigation and Object Detection Poster Session 2 + Refreshments
Paul Hill ⋅ Zhiming Liu ⋅ Alin Achim ⋅ David Bull ⋅ Nantheera Anantrasirichai
Tucson Ballroom & Prefunction Space 121
Context-Preserving Dermoscopic Editing: Mask-Guided Lesion-Aware Diffusion for Attribute Modification Poster Session 4 + Reception
Tao Sun ⋅ Yun Jiang ⋅ Yarong Jin ⋅ Huanting Guo ⋅ Zequn Zhang
Tucson Ballroom & Prefunction Space 103
SceneShine: Illumination-aware Human Scene Gaussian Re-Splatting from Mobile Device Video Poster Session 6 + Refreshments
Xuqian Ren ⋅ Wenjia Wang ⋅ Mai Nguyen ⋅ Juho Kannala ⋅ Esa Rahtu
Tucson Ballroom & Prefunction Space 104
WarpRF: Multi-View Consistency for Training-Free Uncertainty Quantification and Applications in Radiance Fields Poster Session 4 + Reception
Sadra Safadoust ⋅ Fabio Tosi ⋅ Fatma Güney ⋅ Matteo Poggi
Tucson Ballroom & Prefunction Space 90
ChameleonTuner: Automatic ISP Color Tuning in Subjective Scenarios Poster Session 1
Zijie Tan ⋅ Yuxin Yue ⋅ Bahador Rashidi
Tucson Ballroom & Prefunction Space 29
Sketch-guided Cage-based 3D Gaussian Splatting Deformation Poster Session 3
Tianhao Xie ⋅ Noam Aigerman ⋅ Eugene Belilovsky ⋅ Tiberiu Popa
Tucson Ballroom & Prefunction Space 71
AugMapNet: Improving Spatial Latent Structure via BEV Grid Augmentation for Enhanced Vectorized Online HD Map Construction Poster Session 6 + Refreshments
Thomas Monninger ⋅ Md Zafar Anwar ⋅ Stanislaw Antol ⋅ Steffen Staab ⋅ Sihao Ding
Tucson Ballroom & Prefunction Space 127
DiT-VTON: Diffusion Transformer Framework for Unified Multi-Category Virtual Try-On and Virtual Try-All with Integrated Image Editing Poster Session 1
Qi Li ⋅ Shuwen Qiu ⋅ Kee Kiat Koo ⋅ Julien Han ⋅ Karim Bouyarmane
Tucson Ballroom & Prefunction Space 20
Denoise, Divide, Distill, and Predict (D3P): Towards Forecasting Long-horizon Real-world Anomaly from Normalcy Poster Session 5
Quentin Mérilleau ⋅ Snehashis Majhi ⋅ Antitza Dantcheva ⋅ Quan Kong ⋅ Lorenzo Garattoni ⋅ Gianpiero Francesca ⋅ Francois Bremond
Tucson Ballroom & Prefunction Space 43
Efficient Vision Transformers via Token Merging with Head-wise Attention Correction Poster Session 3
Yuki Ichikawa ⋅ Masato Motomura ⋅ Thiem Chu ⋅ Daichi Fujiki
Tucson Ballroom & Prefunction Space 95
Splannequin: Freezing Monocular Mannequin-Challenge Footage with Dual-Detection Splatting Poster Session 6 + Refreshments
Hao-Jen Chien ⋅ Yi-Chuan Huang ⋅ Chung-Ho Wu ⋅ Wei-Lun Chao ⋅ Yu-Lun Liu
Tucson Ballroom & Prefunction Space 79
MixER: From Cross-Modal to Mixed-Modal Visible-Infrared Re-Identification Poster Session 3
Alehdaghi ⋅ Rajarshi Bhattacharya ⋅ Dai Yannick ⋅ Pourya Shamsolmoali ⋅ Rafael M. O. Cruz ⋅ Eric Granger
Tucson Ballroom & Prefunction Space 49
BiNAR: A Bi-Modal Framework for Non-Aligned RGB-IR 3D Reconstruction via Gaussian Splatting Poster Session 4 + Reception
Zhongwen Wang ⋅ Han Ling ⋅ Weihao Zhang ⋅ Yinghui Sun ⋅ Quansen Sun
Tucson Ballroom & Prefunction Space 12
Uncertainty-Aware Vision-Language Segmentation for Medical Imaging Poster Session 6 + Refreshments
Aryan Das ⋅ Tanishq Rachamalla ⋅ Koushik Biswas ⋅ Swalpa Roy ⋅ Vinay Verma
Tucson Ballroom & Prefunction Space 122
Cluster-based Pseudo-labeling for Semi-Supervised LiDAR Semantic Segmentation Poster Session 1
Qingju Guo ⋅ Shuang Li ⋅ Jing Geng ⋅ Binhui Xie ⋅ Jiawei Shan ⋅ Wei Li
Tucson Ballroom & Prefunction Space 60
Semantic Map Guided Bird's-Eye View Learning for Online HD Map Construction Poster Session 6 + Refreshments
Huantao Ren ⋅ Hesham Eraqi ⋅ ABM Musa ⋅ Mohamed Moustafa
Tucson Ballroom & Prefunction Space 62
SilverLining: Data-First Mitigation of Spatial and Spectral Shortcuts Without Introducing New Confounders Poster Session 1
Balagopal Unnikrishnan ⋅ Michael Brudno ⋅ Chris McIntosh
Tucson Ballroom & Prefunction Space 124
HyPCA-Net: Advancing Multimodal Fusion in Medical Image Analysis Poster Session 2 + Refreshments
Joy Dhar ⋅ Manish Pandey ⋅ Debashis Das Chakladar ⋅ Maryam Haghighat ⋅ Azadeh Alavi ⋅ Sajib Mistry ⋅ Nayyar Zaidi
Tucson Ballroom & Prefunction Space 40
Causality-Driven Audits of Model Robustness Poster Session 5
Nathan Drenkow ⋅ William Paul ⋅ Christopher Ribaudo ⋅ Mathias Unberath
Tucson Ballroom & Prefunction Space 15
KD360-VoxelBEV: LiDAR and 360-degree Camera Cross Modality Knowledge Distillation for Bird’s-Eye-View Segmentation Poster Session 3
Wenke E ⋅ Yixin Sun ⋅ Jiaxu Liu ⋅ Hubert P. H. Shum ⋅ Amir Atapour-Abarghouei ⋅ Toby Breckon
Tucson Ballroom & Prefunction Space 54
Universal Neural Architecture Space: Covering ConvNets, Transformers and Everything in Between Poster Session 3
Ondrej Tybl ⋅ Lukas Neumann
Tucson Ballroom & Prefunction Space 73
FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs Poster Session 1
Carlos Plou ⋅ Cesar Borja ⋅ Ruben Martinez-Cantin ⋅ Ana Murillo
Tucson Ballroom & Prefunction Space 128
SmokeBench: Evaluating Multimodal Large Language Models for Wildfire Smoke Detection Poster Session 1
Tianye Qi ⋅ Weihao Li ⋅ Nick Barnes
Tucson Ballroom & Prefunction Space 100
Anatomically-guided masked autoencoder pre-training for aneurysm detection Poster Session 4 + Reception
Alberto Mario Ceballos Arroyo ⋅ Jisoo Kim ⋅ Chu-Hsuan Lin ⋅ Lei Qin ⋅ Geoffrey Young ⋅ Huaizu Jiang
Tucson Ballroom & Prefunction Space 135
Disentangle and Regularize: Sign Language Production with Articulator-Based Disentanglement and Channel-Aware Regularization Poster Session 6 + Refreshments
Meryem Taşyürek ⋅ Tuğçe Kızıltepe ⋅ Hacer Keles
Tucson Ballroom & Prefunction Space 119
AuthGuard: Generalizable Deepfake Detection via Language Guidance Poster Session 5
Guangyu Shen ⋅ Zhihua Li ⋅ Xiang Xu ⋅ Tianchen Zhao ⋅ Zheng Zhang ⋅ DONGSHENG An ⋅ Zhuowen Tu ⋅ Yifan Xing ⋅ Qin ZHANG
Tucson Ballroom & Prefunction Space 40
Single-step Diffusion for Image Compression at Ultra-Low Bitrates Poster Session 5
Chanung Park ⋅ Joo Chan Lee ⋅ Jong Hwan Ko
Tucson Ballroom & Prefunction Space 57
Odo: Depth-Guided Diffusion for Identity-Preserving Body Reshaping Poster Session 1
Siddharth Khandelwal ⋅ Sridhar Kamath ⋅ Arjun Jain
Tucson Ballroom & Prefunction Space 3
Color Bind: Exploring Color Perception in Text-to-Image Models Poster Session 2 + Refreshments
Shay Shomer-Chai ⋅ Wenxuan Peng ⋅ Bharath Hariharan ⋅ Hadar Averbuch-Elor
Tucson Ballroom & Prefunction Space 48
DenseBEV: Transforming BEV Grid Cells into 3D Objects Poster Session 2 + Refreshments
Marius Dähling ⋅ Sebastian Krebs ⋅ J. Zöllner
Tucson Ballroom & Prefunction Space 91
MIST: Multilingual Incidental Dataset for Scene Text Detection Poster Session 6 + Refreshments
Saumya Vijay Mundra ⋅ Ajoy Mondal ⋅ Jawahar CV
Tucson Ballroom & Prefunction Space 44
NeuroBridge: Few-Shot Cross-Modal Neuron Re-identification via Dual-Channel Deep Metric Learning Poster Session 6 + Refreshments
Wenwei Li ⋅ Mingwei Liao ⋅ Lingyi Cai ⋅ Anan LI
Tucson Ballroom & Prefunction Space 139
General and Domain-Specific Zero-shot Detection of Generated Images via Conditional Likelihood Poster Session 6 + Refreshments
Roy Betser ⋅ Omer Hofman ⋅ Roman Vainshtein ⋅ Guy Gilboa
Tucson Ballroom & Prefunction Space 58
Model-free Domain Adaptation for Concealed Multimodal Large-Language Models Poster Session 1
Yu Mitsuzumi ⋅ Akisato Kimura ⋅ Hisashi Kashima
Tucson Ballroom & Prefunction Space 118
Autoregressive Styled Text Image Generation, but Make it Reliable Poster Session 3
Carmine Zaccagnino ⋅ Fabio Quattrini ⋅ Vittorio Pippi ⋅ Silvia Cascianelli ⋅ Alessio Tonioni ⋅ Rita Cucchiara
Tucson Ballroom & Prefunction Space 72
Perception-Inspired Color Space Design for Photo White Balance Editing Poster Session 3
Yang Cheng ⋅ Ziteng Cui ⋅ Lin Gu ⋅ Shenghan Su ⋅ Zenghui Zhang
Tucson Ballroom & Prefunction Space 79
Beyond Realism: Learning the Art of Expressive Composition with StickerNet Poster Session 1
Haoming Lu ⋅ David Kocharian ⋅ Humphrey Shi
Tucson Ballroom & Prefunction Space 83
RobustGait: Robustness Analysis for Appearance Based Gait Recognition Poster Session 2 + Refreshments
Reeshoon Sayera ⋅ Akash Kumar ⋅ Sirshapan Mitra ⋅ Prudvi Kamtam ⋅ Yogesh Rawat
Tucson Ballroom & Prefunction Space 107
SHaSaM: Submodular Hard Sample Mining for Fair Facial Attribute Recognition Poster Session 6 + Refreshments
Anay Majee ⋅ Rishabh Iyer
Tucson Ballroom & Prefunction Space 25
MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes Poster Session 4 + Reception
Ruiyuan Gao ⋅ Kai Chen ⋅ Zhihao Li ⋅ Lanqing HONG ⋅ Zhenguo Li ⋅ Qiang Xu
Tucson Ballroom & Prefunction Space 138
Cosine Similarity is Almost All You Need (for Prototypical-Part Models) Poster Session 2 + Refreshments
Luke Moffett ⋅ Frank Willard ⋅ Maximillian Machado ⋅ Emmanuel Mokel ⋅ Jon Donnelly ⋅ Zhicheng Guo ⋅ Adam Costarino ⋅ Julia Yang ⋅ Giyoung Kim ⋅ Alina Barnett ⋅ Cynthia Rudin
Tucson Ballroom & Prefunction Space 17
M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models Poster Session 1
Hongyu Wang ⋅ Jiayu Xu ⋅ Senwei Xie ⋅ Ruiping Wang ⋅ Jialin Li ⋅ Zhaojie Xie ⋅ Bin Zhang ⋅ Chuyan Xiong ⋅ Xilin CHEN
Tucson Ballroom & Prefunction Space 37
Data-Driven Lipschitz Continuity: A Cost-Effective Approach to Improve Adversarial Robustness Poster Session 1
Erh-Chung Chen ⋅ Pin-Yu Chen ⋅ I-Hsin Chung ⋅ Che-Rung Lee
Tucson Ballroom & Prefunction Space 67
Q-Former Autoencoder: A Modern Framework for Medical Anomaly Detection Poster Session 6 + Refreshments
Francesco Dalmonte ⋅ Emirhan Bayar ⋅ Emre Akbas ⋅ Iuliana Georgescu
Tucson Ballroom & Prefunction Space 75
CAAC: Confidence-Aware Attention Calibration to Reduce Hallucinations in Large Vision-Language Models Poster Session 1
Mehrdad Fazli ⋅ Bowen Wei ⋅ Ahmet Sari ⋅ Ziwei Zhu
Tucson Ballroom & Prefunction Space 119
Test Time Adaptation Using Adaptive Quantile Recalibration Poster Session 5
Paria Mehrbod ⋅ Pedro Vianna ⋅ Geraldin Nanfack ⋅ Guy Wolf ⋅ Eugene Belilovsky
Tucson Ballroom & Prefunction Space 18
V2XScene: Multi-View Consistent 3D Scene Simulation for Collaborative Perception Poster Session 5
Yanfei Li ⋅ Yi GONG ⋅ Yuan Zeng
Tucson Ballroom & Prefunction Space 74
Point2Pose: A Generative Framework for 3D Human Pose Estimation with Multi-View Point Cloud Dataset Poster Session 5
Hyunsoo Lee ⋅ Daeum Jeon ⋅ Hyeokjae Oh
Tucson Ballroom & Prefunction Space 90
GeoHSAF: Geometric Hippocampus Shape Analysis Framework for Longitudinal Alzheimer's Disease Classification Poster Session 2 + Refreshments
MUBARAK OLAOLUWA ⋅ HENI LOUKIL ⋅ Arafet Sbei ⋅ Hassen Drira
Tucson Ballroom & Prefunction Space 71
Seeing is Believing (and Predicting): Context-Aware Multi-Human Behavior Prediction with Vision Language Models Poster Session 2 + Refreshments
Utsav Panchal ⋅ Yuchen Liu ⋅ Luigi Palmieri ⋅ Ilche Georgievski ⋅ Marco Aiello
Tucson Ballroom & Prefunction Space 52
Segmentation-Aware Latent Diffusion for Satellite Image Super-Resolution: Enabling Smallholder Farm Boundary Delineation Poster Session 2 + Refreshments
Aditi Agarwal ⋅ Anjali Jain ⋅ Nikita Saxena ⋅ Ishan Deshpande ⋅ Michal Kazmierski ⋅ Abigail Annkah ⋅ Nadav Sherman ⋅ Karthikeyan Shanmugam ⋅ Alok Talekar ⋅ Vaibhav Rajan
Tucson Ballroom & Prefunction Space 43
Learning from Unknown for Open-Set Test-Time Adaptation Poster Session 3
Taki Hasan Rafi ⋅ Amit Agarwal ⋅ Hitesh Patel ⋅ Dong-Kyu Chae
Tucson Ballroom & Prefunction Space 8
3D Gaussian Point Encoders Poster Session 2 + Refreshments
Jim James ⋅ Benjamin Wilson ⋅ Simon Lucey ⋅ James Hays
Tucson Ballroom & Prefunction Space 36
A Unified Diffusion-Based Framework for Multi-Agent Trajectory Prediction Integrating Structured Multi-Modal Representations Poster Session 5
Chenxi yang ⋅ Suyang Xi ⋅ Hong Ding ⋅ Yiqing Shen ⋅ Yunhao Liu
Tucson Ballroom & Prefunction Space 62
PointSt3R: Point Tracking through 3D Ground Correspondence Poster Session 6 + Refreshments
Rhodri Guerrier ⋅ Adam Harley ⋅ Dima Damen
Tucson Ballroom & Prefunction Space 22
False Alarm Rectification for Early Smoke Segmentation Poster Session 2 + Refreshments
Hongjin Zhao ⋅ Weihao Li ⋅ Ge-Peng Ji ⋅ Nick Barnes
Tucson Ballroom & Prefunction Space 53
Grounding Degradations in Natural Language for All-In-One Video Restoration Poster Session 4 + Reception
Muhammad Kamran Janjua ⋅ Amirhosein Ghasemabadi ⋅ Kunlin Zhang ⋅ Mohammad Salameh ⋅ Chao Gao ⋅ Di Niu
Tucson Ballroom & Prefunction Space 139
OracleGS: Grounding Generative Priors for Sparse-View Gaussian Splatting Poster Session 1
Atakan Topaloğlu ⋅ Kunyi Li ⋅ Michael Niemeyer ⋅ Nassir Navab ⋅ Ahmet Tekalp ⋅ Federico Tombari
Tucson Ballroom & Prefunction Space 8
CanKD: Cross-Attention-based Non-local operation for Feature-based Knowledge Distillation Poster Session 6 + Refreshments
Shizhe Sun ⋅ Wataru Ohyama
Tucson Ballroom & Prefunction Space 133
CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs Poster Session 5
Qizhen Lan ⋅ Qing Tian
Tucson Ballroom & Prefunction Space 132
Spacewalk-18: A Benchmark for Multimodal and Long-form Procedural Video Understanding in Novel Domains Poster Session 4 + Reception
Zitian Tang ⋅ Rohan Krishnan ⋅ Zhiqiu Yu ⋅ Chen Sun
Tucson Ballroom & Prefunction Space 18
Countering Multi-modal Representation Collapse through Rank-targeted Fusion Poster Session 4 + Reception
Seulgi Kim ⋅ Kiran Kokilepersaud ⋅ Mohit Prabhushankar ⋅ Ghassan AlRegib
Tucson Ballroom & Prefunction Space 44
Fine-grained Defocus Blur Control for Generative Image Models Poster Session 4 + Reception
Ayush Shrivastava ⋅ Connelly Barnes ⋅ Cecilia Zhang ⋅ Lingzhi Zhang ⋅ Andrew Owens ⋅ Sohrab Amirghodsi ⋅ Eli Shechtman
Tucson Ballroom & Prefunction Space 5
Lorentz Entailment Cone for Semantic Segmentation Poster Session 4 + Reception
Zahid Hasan ⋅ Masud Ahmed ⋅ Nirmalya Roy
Tucson Ballroom & Prefunction Space 89
FNOPT: Resolution-Agnostic, Self-Supervised Cloth Simulation using Meta-Optimization with Fourier Neural Operators Poster Session 5
Ruochen Chen ⋅ Thuy Tran ⋅ Shaifali Parashar
Tucson Ballroom & Prefunction Space 125
Gaussian Splatting Map Registration with Orthographic Bird's-Eye-View Renderings Poster Session 5
Hugo LEBLOND ⋅ Gilles SIMON ⋅ Renato Martins ⋅ Cedric Demonceaux ⋅ Marie-odile Berger
Tucson Ballroom & Prefunction Space 27
Boosting Medical Vision-Language Pretraining via Momentum Self-Distillation under Limited Computing Resources Poster Session 1
Phuc Pham ⋅ Nhu Pham ⋅ Ngoc Ly
Tucson Ballroom & Prefunction Space 82
WiSAR3D - Aerial LiDAR dataset for 3D object detection Poster Session 5
Oren Shrout ⋅ Ori Nizan ⋅ Yizhak Ben-Shabat ⋅ Ayellet Tal
Tucson Ballroom & Prefunction Space 75
LighthouseGS: Indoor Structure-aware 3D Gaussian Splatting for Panorama-Style Mobile Captures Poster Session 3
Seungoh Han ⋅ Jaehoon Jang ⋅ Hyunsu Kim ⋅ Jaeheung Surh ⋅ Junhyung Kwak ⋅ Hyowon Ha ⋅ Kyungdon Joo
Tucson Ballroom & Prefunction Space 50
Distilling What and Why: Enhancing Driver Intention Prediction with MLLMs Poster Session 6 + Refreshments
SAINITHIN ARTHAM ⋅ Avijit Dasgupta ⋅ Shankar Gangisetty ⋅ Jawahar CV
Tucson Ballroom & Prefunction Space 8
Modeling and Learning Multiple Hypotheses for Monocular 3D Object Detection Poster Session 5
Hyeonjeong Park ⋅ Peixi Xiong ⋅ Pei Yu ⋅ Wei Tang
Tucson Ballroom & Prefunction Space 118
Learning to Animate Images from A Few Videos to Portray Delicate Human Actions Poster Session 1
Haoxin Li ⋅ Yingchen Yu ⋅ Qilong Wu ⋅ Hanwang Zhang ⋅ Song Bai ⋅ Boyang Li
Tucson Ballroom & Prefunction Space 53
Towards Streaming LiDAR Object Detection with Point Clouds as Egocentric Sequences Poster Session 3
Mellon Zhang ⋅ Glen Chou ⋅ Saibal Mukhopadhyay
Tucson Ballroom & Prefunction Space 34
DUDA: Distilled Unsupervised Domain Adaptation for Lightweight Semantic Segmentation Poster Session 6 + Refreshments
Beomseok Kang ⋅ Niluthpol Mithun ⋅ Abhinav Rajvanshi ⋅ Han-pang Chiu ⋅ Supun Samarasekera
Tucson Ballroom & Prefunction Space 88
VADER: Towards Causal Video Anomaly Understanding with Relation-Aware Large Language Models Poster Session 6 + Refreshments
Ying Cheng ⋅ Yu-Ho Lin ⋅ Min-Hung Chen ⋅ Fu-En Yang ⋅ Shang-Hong Lai
Tucson Ballroom & Prefunction Space 10
QuEENet: Quantum-Enhanced Expressive Network for Image Classification Poster Session 6 + Refreshments
Shashank Bayal ⋅ Dawane Govind ⋅ Komal Komal ⋅ SANTOSH VIPPARTHI ⋅ Subrahmanyam Murala
Tucson Ballroom & Prefunction Space 65
ObjectCore -– Efficient Few-shot Logical Anomaly Detection using Object Representations Poster Session 3
Matic Fučka ⋅ Vitjan Zavrtanik ⋅ Danijel Skocaj
Tucson Ballroom & Prefunction Space 90
HodgeFormer: Transformers for Learnable Operators on Triangular Meshes through Data-Driven Hodge Matrices Poster Session 5
Akis Nousias ⋅ Stavros Nousias
Tucson Ballroom & Prefunction Space 95
OW-Rep: Open World Object Detection with Instance Representation Learning Poster Session 1
SUNOH LEE ⋅ Minsik Jeon ⋅ Jihong Min ⋅ Junwon Seo
Tucson Ballroom & Prefunction Space 33
Marshaled Learning: Bridging Large Neural Networks with Memory-Constrained Trusted Execution Environments in Federated Learning Poster Session 1
Shiwei Ding ⋅ Xiaoyong Yuan ⋅ Zhenlin Wang ⋅ Lan Zhang ⋅ Giuseppe Ateniese
Tucson Ballroom & Prefunction Space 62
DTMIR-Pro: Domain Translation with Prompt-based Latent-Space Generalization for Multi-Weather Image Restoration Poster Session 3
Ashutosh Kulkarni ⋅ Prashant Patil ⋅ SANTOSH VIPPARTHI ⋅ Subrahmanyam Murala ⋅ Balasubramanian Raman
Tucson Ballroom & Prefunction Space 89
SPAR-Det: Segmentation-guided and Prior-Aided Routing for Small Object Detection Poster Session 2 + Refreshments
Seungchan Kwon ⋅ Gyuil Lim ⋅ Youngjoon Han
Tucson Ballroom & Prefunction Space 70
TA-Prompting: Enhancing Video Large Language Models for Dense Video Captioning via Temporal Anchors Poster Session 1
Wei-Yuan Cheng ⋅ Kai-Po Chang ⋅ Chi-Pin Huang ⋅ Fu-En Yang ⋅ Frank Wang
Tucson Ballroom & Prefunction Space 22
Large Sign Language Models: Toward 3D American Sign Language Translation Poster Session 3
Sen Zhang ⋅ Sen Zhang ⋅ Di Liu ⋅ Zhaoyang Xia ⋅ Mingyu Zhao ⋅ Chaowei Tan ⋅ Vivian Li ⋅ Bo Liu ⋅ Dimitri Metaxas ⋅ Mubbasir Kapadia
Tucson Ballroom & Prefunction Space 18
CADE: Continual Weakly-supervised Video Anomaly Detection with Ensembles Poster Session 1
Satoshi HASHIMOTO ⋅ Tatsuya Konishi ⋅ Tomoya Kaichi ⋅ Kazunori Matsumoto ⋅ Mori Kurokawa
Tucson Ballroom & Prefunction Space 68
UNO: Unifying One-stage Video Scene Graph Generation via Object-Centric Visual Representation Learning Poster Session 2 + Refreshments
Huy Le ⋅ Nhat Chung ⋅ Tung Kieu ⋅ Jingkang Yang ⋅ Ngan Le
Tucson Ballroom & Prefunction Space 131
CSGaussian: Progressive Rate-Distortion Compression and Segmentation for 3D Gaussian Splatting Poster Session 5
Yu-Jen Tseng ⋅ Chia-Hao Kao ⋅ Jing-Zhong Chen ⋅ Alessandro Gnutti ⋅ Shao-Yuan Lo ⋅ Yen-Yu Lin ⋅ Wen-Hsiao Peng
Tucson Ballroom & Prefunction Space 103
RealDroneVision: Dataset and Architecture Advancements for Small-Object Drone Detection Poster Session 5
Arun Kumar Sivapuram ⋅ Pranav Peddinti ⋅ Harish Puppala ⋅ Komuravelli Prashanth ⋅ Jaladi Sri Harsha ⋅ Gorthi Subrahmanyam
Tucson Ballroom & Prefunction Space 84
AutoSew: A Geometric Approach to Stitching Prediction with Graph Neural Networks Poster Session 1
Pablo Ríos ⋅ Elena Garces ⋅ Jorge Lopez-Moreno
Tucson Ballroom & Prefunction Space 132
SDT-6D: Fully Sparse Depth-Transformer for Staged End-to-End 6D Pose Estimation in Industrial Multi-View Bin Picking Poster Session 6 + Refreshments
Nico Leuze ⋅ Maximilian Hoh ⋅ Samed Doğan ⋅ Nicolas Rodriguez Pena ⋅ Alfred Schöttl
Tucson Ballroom & Prefunction Space 114
Decomposition Sampling for Efficient Region Annotations in Active Learning Poster Session 3
Jingna Qiu ⋅ Frauke Wilm ⋅ Mathias Oettl ⋅ Jonas Utz ⋅ Maja Schlereth ⋅ Moritz Schillinger ⋅ Marc Aubreville ⋅ Katharina Breininger
Tucson Ballroom & Prefunction Space 119
Test-Time Adaptation through Semantically-guided Feature Decomposition for Few-shot Chest X-ray Diagnosis Poster Session 2 + Refreshments
Jayant Mahawar ⋅ Angshuman Paul
Tucson Ballroom & Prefunction Space 98
Hestia: Voxel-Face-Aware Hierarchical Next-Best-View Acquisition for Efficient 3D Reconstruction Poster Session 4 + Reception
Cheng-You Lu ⋅ Zhuoli Zhuang ⋅ Nguyen Le ⋅ da xiao ⋅ Yu-Cheng Chang ⋅ Thomas Do ⋅ Srinath Sridhar ⋅ Chin-teng Lin
Tucson Ballroom & Prefunction Space 97
Synthesizing Compositional Videos from Text Description Poster Session 5
Prajwal Singh ⋅ Kuldeep Kulkarni ⋅ Shanmuganathan Raman ⋅ Harsh Rangwani
Tucson Ballroom & Prefunction Space 93
SpikeRain: Towards Energy-Efficient Single Image Deraining with Spiking Neural Networks Poster Session 1
Md Tanvir Islam ⋅ Inzamamul Alam ⋅ Sambit Bakshi ⋅ Khan Muhammad ⋅ Javier Del Ser ⋅ Sangtae Ahn
Tucson Ballroom & Prefunction Space 105
Robust Multimodal Emotion Recognition from Incomplete Modalities via Query-Based Unimodal and Cross-Modal Learning Poster Session 4 + Reception
Ryo Miyoshi ⋅ Mayu Otani ⋅ Yuki Okafuji
Tucson Ballroom & Prefunction Space 59
ICONIC-444: A 3.1-Million-Image Dataset for OOD Detection Research Poster Session 6 + Refreshments
Gerhard Krumpl ⋅ Henning Avenhaus ⋅ Horst Possegger
Tucson Ballroom & Prefunction Space 116
Polymorph: Energy-Efficient Multi-Label Classification for Video Streams on Embedded Devices Poster Session 5
Saeid Ghafouri ⋅ Mohsen Fayyaz ⋅ Xiangchen Li ⋅ Deepu John ⋅ Bo Ji ⋅ Dimitrios Nikolopoulos ⋅ Hans Vandierendonck
Tucson Ballroom & Prefunction Space 61
mmWeaver: Environment-Specific mmWave Signal Synthesis from a Photo and Activity Description Poster Session 2 + Refreshments
Mahathir Monjur ⋅ Shahriar Nirjon
Tucson Ballroom & Prefunction Space 44
BlendCLIP: Bridging Synthetic and Real Domains for Zero-Shot 3D Object Classification with Multimodal Pretraining Poster Session 4 + Reception
Ajinkya Khoche ⋅ Gergő Nagy ⋅ Maciej Wozniak ⋅ Thomas Gustafsson ⋅ Patric Jensfelt
Tucson Ballroom & Prefunction Space 142
Sketch3R: Rapid and Realistic 3D VR Sketch Creation to Shape Retrieval Poster Session 6 + Refreshments
Mritunjoy Halder ⋅ Shivam Shukla ⋅ Lokender Tiwari ⋅ Raghav Mittal ⋅ Brojeshwar Bhowmick
Tucson Ballroom & Prefunction Space 140
Training-free Conditional Image Embedding Framework Leveraging Large Vision Language Models Poster Session 6 + Refreshments
Masayuki Kawarada ⋅ Kosuke Yamada ⋅ Antonio Tejero-de-Pablos ⋅ Naoto Inoue
Tucson Ballroom & Prefunction Space 42
DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions Poster Session 4 + Reception
Yifan Zhou ⋅ Takehiko Ohkawa ⋅ Guwenxiao Zhou ⋅ Kanoko Goto ⋅ Takumi Hirose ⋅ Yusuke Sekikawa ⋅ Nakamasa Inoue
Tucson Ballroom & Prefunction Space 102
Semi-supervised Key-Point Estimation for Echocardiography Video Poster Session 4 + Reception
Seok-Hwan Oh ⋅ hyeonjik lee ⋅ Guil Jung ⋅ Myeong-Gee Kim ⋅ Young-Min Kim ⋅ Hyuksool Kwon ⋅ Hyeon-min Bae
Tucson Ballroom & Prefunction Space 134
CLUE: Bringing Machine Unlearning to Mobile Devices Poster Session 3
A. Q. M. Sazzad Sayyed ⋅ Nathaniel Bastian ⋅ Michael Lucia ⋅ Ananthram Swami ⋅ Francesco Restuccia
Tucson Ballroom & Prefunction Space 80
Snapmoji: Instant Generation of Animatable Dual-Stylized Avatars Poster Session 2 + Refreshments
Eric Chen ⋅ Di Liu ⋅ Sizhuo Ma ⋅ Michael Vasilkovsky ⋅ Bing Zhou ⋅ Qiang Gao ⋅ Wenzhou Wang ⋅ Jiahao Luo ⋅ Dimitri Metaxas ⋅ Vincent Sitzmann ⋅ Jian Wang
Tucson Ballroom & Prefunction Space 51
From Bands to Depth: Understanding Bathymetry Decisions on Sentinel-2 Poster Session 2 + Refreshments
Satyaki Roy Chowdhury ⋅ Aswathnarayan Radhakrishnan ⋅ Hari Subramoni
Tucson Ballroom & Prefunction Space 62
From Street to Orbit: Training-Free Cross-View Retrieval via Location Semantics and LLM Guidance Poster Session 1
Jeongho Min ⋅ Dongyoung Kim ⋅ Jaehyup Lee
Tucson Ballroom & Prefunction Space 55
Codebook Knowledge with Mamba-Transformer For Low-Light Image Enhancement Poster Session 3
Runhua Deng ⋅ Aiwen Jiang ⋅ Long Peng ⋅ Qiuhai Yan
Tucson Ballroom & Prefunction Space 77
Morphing Through Time: Diffusion-Based Bridging of Temporal Gaps for Robust Alignment in Change Detection Poster Session 1
Seyedehanita Madani ⋅ Vishal Patel
Tucson Ballroom & Prefunction Space 42
FujiView: Multimodal Late-Fusion for Predicting Scenic Visibility Poster Session 4 + Reception
Bryce Bible ⋅ Shah Hasnaeen ⋅ Hairong Qi
Tucson Ballroom & Prefunction Space 131
Anatomy-VLM: A Fine-grained Vision-Language Model for Medical Interpretation Poster Session 2 + Refreshments
Difei Gu ⋅ Yunhe Gao ⋅ Mu Zhou ⋅ Dimitri Metaxas
Tucson Ballroom & Prefunction Space 135
Unsupervised Modular Adaptive Region Growing and RegionMix Classification for Wind Turbine Segmentation Poster Session 3
Raül Pérez-Gonzalo ⋅ Riccardo Magro ⋅ Andreas Espersen ⋅ Antonio Agudo
Tucson Ballroom & Prefunction Space 92
Learning Action Hierarchies via Hybrid Geometric Diffusion Poster Session 3
Arjun Kaushik Kaushik ⋅ Nalini Ratha ⋅ Venu Govindaraju
Tucson Ballroom & Prefunction Space 20
Self-Supervised Compression and Artifact Correction for Streaming Underwater Imaging Sonar Poster Session 3
Rongsheng Qian ⋅ Chi Xu ⋅ Xiaoqiang Ma ⋅ Hao Fang ⋅ Yili Jin ⋅ William Atlas ⋅ Jiangchuan Liu
Tucson Ballroom & Prefunction Space 123
BOP-Distrib: Revisiting 6D Pose Estimation Benchmarks for Better Evaluation under Visual Ambiguities Poster Session 2 + Refreshments
Boris Meden ⋅ Asma Brazi ⋅ Fabrice Mayran de Chamisso ⋅ Steve Bourgeois ⋅ Vincent Lepetit
Tucson Ballroom & Prefunction Space 16
VividAnimator: An End-to-End Audio and Pose-driven Half-Body Human Animation Framework Poster Session 4 + Reception
Donglin Huang ⋅ Yongyuan Li ⋅ Tianhang Liu ⋅ Junming Huang ⋅ Xiaoda Yang ⋅ Chi Wang ⋅ Weiwei Xu
Tucson Ballroom & Prefunction Space 4
Edge-Aware Image Manipulation via Diffusion Models with a Novel Structure-Preservation Loss Poster Session 4 + Reception
Minsu Gong ⋅ Nuri Ryu ⋅ Jungseul Ok ⋅ Sunghyun Cho
Tucson Ballroom & Prefunction Space 82
3D Superquadric Splatting Poster Session 4 + Reception
Daniel MacSwayne ⋅ Ales Leonardis ⋅ Jianbo Jiao
Tucson Ballroom & Prefunction Space 83
Learnable Query-Enhanced Pose Transformation Poster Session 2 + Refreshments
Yi-Zhen Wang ⋅ Hong-Han Shuai
Tucson Ballroom & Prefunction Space 59
VLMDiff: Leveraging Vision-Language Models for Multi-Class Anomaly Detection with Diffusion Poster Session 5
Samet Hicsonmez ⋅ Abd El Rahman Shabayek ⋅ Djamila Aouada
Tucson Ballroom & Prefunction Space 49
Bi-ICE: An Inner Interpretable Framework for Image Classification via Bi-directional Interactions between Concept and Input Embeddings Poster Session 3
Jinyung Hong ⋅ Yearim Kim ⋅ Keun Hee Park ⋅ Sangyu Han ⋅ Nojun Kwak ⋅ Theodore Pavlic
Tucson Ballroom & Prefunction Space 88
Bridging the Domain Gap in Small Multimodal Models: A Dual-level Alignment Perspective Poster Session 6 + Refreshments
Aveen Dayal ⋅ Peketi Divya ⋅ Nidhi Tiwari ⋅ Linga Reddy Cenkeramaddi ⋅ C Mohan ⋅ Abhinav Kumar
Tucson Ballroom & Prefunction Space 100
UniTabBank: A Large Scale Multi-Lingual, Multi-Layout, Multi-Type, Multi-Format Dataset for Table Detection Poster Session 5
Ajoy Mondal ⋅ Saumya Vijay Mundra ⋅ Avijit Dasgupta ⋅ Jawahar CV
Tucson Ballroom & Prefunction Space 66
Text Slider: Efficient and Plug-and-Play Continuous Concept Control for Image/Video Synthesis via LoRA Adapters Poster Session 1
Pin-Yen Chiu ⋅ I-Sheng Fang ⋅ Jun-Cheng Chen
Tucson Ballroom & Prefunction Space 59
VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models Poster Session 6 + Refreshments
Kailai Feng ⋅ Yabo Zhang ⋅ Haodong Yu ⋅ Zhilong Ji ⋅ Jinfeng Bai ⋅ Hongzhi Zhang ⋅ Wangmeng Zuo
Tucson Ballroom & Prefunction Space 97
PoseGaussian: Pose-Driven Novel View Synthesis for Robust 3D Human Reconstruction Poster Session 4 + Reception
Ju Shen ⋅ Chen Chen ⋅ Tam Nguyen ⋅ Vijayan Asari
Tucson Ballroom & Prefunction Space 69
STEG-AIW: Spatio-Temporal Gating and Adaptive-Timestep Inference for Efficient Spiking Neural Networks Poster Session 3
Gulfam A Saju ⋅ Anton Spirkin ⋅ Felipe Marcelino ⋅ Yuchou Chang
Tucson Ballroom & Prefunction Space 121
Workzone3D: A Multimodal Dataset for 3D Work Zone Perception in Autonomous Driving Poster Session 3
Shounak Sural ⋅ Nishad Sahu ⋅ Ragunathan Rajkumar
Tucson Ballroom & Prefunction Space 101
CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading Poster Session 3
Mishan Aliev ⋅ Dmitry Baranchuk ⋅ Kirill Struminsky
Tucson Ballroom & Prefunction Space 47
Prompt-OT: An Optimal Transport Regularization Paradigm for Knowledge Preservation in Vision-Language Model Adaptation Poster Session 1
Xiwen Chen ⋅ Wenhui Zhu ⋅ Peijie Qiu ⋅ Hao Wang ⋅ Huayu Li ⋅ Haiyu Wu ⋅ XUANZHAO DONG ⋅ Aris Sotiras ⋅ Yalin Wang ⋅ Abolfazl Razi
Tucson Ballroom & Prefunction Space 64
HumanGuideNet: Adapter-Based Alignment of Deep Neural Networks with Human Similarity Judgments Poster Session 2 + Refreshments
Xufu Liu ⋅ Yifan Yang ⋅ Zhengxin Zhang
Tucson Ballroom & Prefunction Space 37
PrevMatch: Revisiting and Maximizing Temporal Knowledge in Semi-Supervised Semantic Segmentation Poster Session 4 + Reception
Wooseok Shin ⋅ Hyun Joon Park ⋅ Jin Sob Kim ⋅ Juan Yun ⋅ Se Park ⋅ Sung Han
Tucson Ballroom & Prefunction Space 64
Zero-Shot Table Extraction in Business Documents: A Unified Benchmark with Error Taxonomy and Ecological Analysis Poster Session 4 + Reception
Eliott THOMAS ⋅ Mickael Coustaty ⋅ Aurélie JOSEPH ⋅ Tri-Cong Pham ⋅ Gaspar DELOIN ⋅ Elodie CAREL ⋅ Vincent d'Andecy ⋅ Jean-marc Ogier
Tucson Ballroom & Prefunction Space 66
MAESTRO: Masked AutoEncoders for Multimodal, Multitemporal, and Multispectral Earth Observation Data Poster Session 1
Antoine Labatie ⋅ Michael Vaccaro ⋅ Nina Lardiere ⋅ Anatol Garioud ⋅ Nicolas Gonthier
Tucson Ballroom & Prefunction Space 21
IMPACT: Interpretable Most Important Person Analysis and Classification using Transformer-based Models Poster Session 6 + Refreshments
Akshat Rampuria ⋅ Kamakshya Nayak ⋅ Kamalakar Thakare ⋅ Tushar Joshi ⋅ Aditya Singh ⋅ Haesol Park ⋅ Heeseung Choi ⋅ Debi Dogra ⋅ Ig-Jae Kim
Tucson Ballroom & Prefunction Space 93
MapVerse: A Benchmark for Geospatial Question Answering on Diverse Real-World Maps Poster Session 6 + Refreshments
Sharat Bhat ⋅ Harshita Khandelwal ⋅ Tushar Kataria ⋅ Vivek Gupta
Tucson Ballroom & Prefunction Space 92
HistoMILKD: A Multiple Instance Learning based Multi-Teacher Knowledge Distillation Framework for Whole Slide Image Classification Poster Session 3
Mayur Mallya ⋅ Ali Khajegili Mirabadi ⋅ Hossein Farahani ⋅ Ali Bashashati
Tucson Ballroom & Prefunction Space 45
SymNet: A Multi-Task Network for Joint Radio Map Reconstruction and Transmitter Localization Poster Session 1
Lyuzhou Ye ⋅ Thanh Le ⋅ Yan Huang
Tucson Ballroom & Prefunction Space 15
Perceptually Guided 3DGS Streaming and Rendering for Mixed Reality Poster Session 3
Yunxiang Zhang ⋅ Sai Mupparaju ⋅ Kenneth Chen ⋅ Jenna Kang ⋅ Xinyu Zhang ⋅ Maito Omori ⋅ Kazuyuki Arimatsu ⋅ Qi Sun
Tucson Ballroom & Prefunction Space 124
Cycle-consistent Multi-graph Matching for Self-supervised Annotation of C. Elegans Poster Session 6 + Refreshments
Sebastian Stricker ⋅ Christoph Karg ⋅ Lisa Hutschenreiter ⋅ Bogdan Savchynskyy ⋅ Dagmar Kainmueller
Tucson Ballroom & Prefunction Space 1
R3: Reconstruction, Raw, and Rain: Deraining Directly in the Bayer Domain Poster Session 4 + Reception
Nate Rothschild ⋅ Moshe Kimhi ⋅ Avi Mendelson ⋅ Chaim Baskin
Tucson Ballroom & Prefunction Space 98
Sun-E: Dataset and Benchmark for Event-Based Sun Sensing Poster Session 4 + Reception
Sydney Dolan ⋅ Alessandro Golkar
Tucson Ballroom & Prefunction Space 51
Dressing the Imagination: A Dataset for AI-Powered Translation of Text into Fashion Outfits and A Novel NeRA Adapter for Enhanced Feature Adaptation Poster Session 2 + Refreshments
Gayatri Deshmukh ⋅ Somsubhra De ⋅ Chirag Sehgal ⋅ Jishu Gupta ⋅ Sparsh Mittal
Tucson Ballroom & Prefunction Space 65
Mitigating Object and Action Hallucinations in Multimodal LLMs via Self-Augmented Contrastive Alignment Poster Session 3
Kai-Po Chang ⋅ Wei-Yuan Cheng ⋅ Chi-Pin Huang ⋅ Fu-En Yang ⋅ Frank Wang
Tucson Ballroom & Prefunction Space 24
Yunheon Lee, Juncheol Ye, Jaehong Kim, Dongsu Han NerVast: Compression-Efficient Scaling of Implicit Neural Video Representations via Scene-based Parameter-sharing Poster Session 2 + Refreshments
Yunheon Lee ⋅ Juncheol Ye ⋅ Jaehong Kim ⋅ Dongsu Han
Tucson Ballroom & Prefunction Space 114
End-to-End Fine-Tuning of 3D Texture Generation using Differentiable Rewards Poster Session 1
Amirhossein Zamani ⋅ Tianhao Xie ⋅ Amir Aghdam ⋅ Tiberiu Popa ⋅ Eugene Belilovsky
Tucson Ballroom & Prefunction Space 17
Reverse Personalization Poster Session 1
Han-Wei Kung ⋅ Tuomas Varanka ⋅ Nicu Sebe
Tucson Ballroom & Prefunction Space 95
DiffRegCD: Integrated Registration and Change Detection with Diffusion Features Poster Session 6 + Refreshments
Seyedehanita Madani ⋅ Rama Chellappa ⋅ Vishal Patel
Tucson Ballroom & Prefunction Space 29
FSP-DETR: Few-Shot Prototypical Parasitic Ova Detection Poster Session 4 + Reception
Shubham Trehan ⋅ Udhav Ramachandran ⋅ Akash Rao ⋅ Ruth Scimeca ⋅ Sathya Aakur
Tucson Ballroom & Prefunction Space 101
MIX-based Foreground and Background Patch Augmentation Guided by Physics and Material Properties for X-ray Detection Poster Session 1
Xintong Liu ⋅ Dongliang Chang ⋅ Yujun Tong ⋅ Zhanyu Ma
Tucson Ballroom & Prefunction Space 94
Controllable Long-term Motion Generation with Extended Joint Targets Poster Session 4 + Reception
Eunjong Lee ⋅ Eunhee Kim ⋅ Sanghoon Hong ⋅ Eunho Jung ⋅ Jihoon Kim
Tucson Ballroom & Prefunction Space 84
MuSACo: Multimodal Subject-Specific Selection and Adaptation for Expression Recognition with Co-Training Poster Session 3
Muhammad Osama Zeeshan ⋅ Natacha Gillet ⋅ Alessandro Lameiras Koerich ⋅ Marco Pedersoli ⋅ Francois Bremond ⋅ Eric Granger
Tucson Ballroom & Prefunction Space 66
A Deep Network for Object Detection on Inland Waters Poster Session 5
Dennis Griesser ⋅ Bastian Goldluecke ⋅ Matthias Franz ⋅ Georg Umlauf
Tucson Ballroom & Prefunction Space 76
Unsupervised Memorability Modeling from Tip-of-the-Tongue Retrieval Queries Poster Session 3
Sree Bhattacharyya ⋅ Yaman Singla ⋅ Sudhir Yarram ⋅ Somesh Singh ⋅ Harini S I ⋅ James Wang
Tucson Ballroom & Prefunction Space 126
VAST-ReID: A Low-Light Benchmark Dataset for Person Re-Identification with Visual and Attribute-Rich Semantic Tracking Poster Session 5
Hammad Khan ⋅ Rakesh Giri ⋅ Kamalakar Thakare ⋅ Heeseung Choi ⋅ Hyungjoo Jung ⋅ Debi Dogra ⋅ Ig-Jae Kim
Tucson Ballroom & Prefunction Space 4
CONSTANT: Towards High-Quality One-Shot Handwriting Generation with Patch Contrastive Enhancement and Style-Aware Quantization Poster Session 4 + Reception
Anh-Duy Le ⋅ Van Pham ⋅ Thanh Vo ⋅ Mai Toan ⋅ Tuan-Anh Tran
Tucson Ballroom & Prefunction Space 1
One-Shot Fine-Grained Re-Identification of Paint Marked Honey Bees using Vision Foundation Models Poster Session 1
Luke Meyers ⋅ Josué A. Rodríguez-Cordero ⋅ Remi Megret
Tucson Ballroom & Prefunction Space 54
Automated Suturing Skill Assessment in Robot-assisted Surgery from Endoscopic Videos using Clinically-guided Evaluation Criteria Poster Session 6 + Refreshments
Atharva Deo ⋅ Ujjwal Pasupulety ⋅ Nicholas Matsumoto ⋅ Jay Moran ⋅ Cherine Yang ⋅ Jeanine Kim ⋅ Rafal Kocielnik ⋅ Aurash Naser-Tavakolian ⋅ Andrew Hung
Tucson Ballroom & Prefunction Space 2
Enhancing Vision Language Corruption Robustness using Cross Distribution & Prompted Denoisers Poster Session 4 + Reception
Sameer Shafayet Latif ⋅ Sadab Shiper ⋅ K. Kiran ⋅ Md Ishmam ⋅ MD HOSSAIN ⋅ Abu Kamal ⋅ Md. Ashmafee
Tucson Ballroom & Prefunction Space 141
FCC: Fully Connected Correlation for One-Shot Segmentation Poster Session 4 + Reception
Seonghyeon Moon ⋅ Haein Kong ⋅ Muhammad Haris Khan ⋅ Mubbasir Kapadia ⋅ Yuewei Lin
Tucson Ballroom & Prefunction Space 52
UI-Styler: Ultrasound Image Style Transfer with Class-Aware Prompts for Cross-Device Diagnosis Using a Frozen Black-Box Inference Network Poster Session 2 + Refreshments
Nhat-Tuong Do-Tran ⋅ Ngoc-Hoang-Lam Le ⋅ Ching-Chun Huang
Tucson Ballroom & Prefunction Space 128
ISALux: Illumination and Semantics-Aware Transformer Employing Mixture of Experts for Low Light Image Enhancement Poster Session 6 + Refreshments
Raul Balmez ⋅ Alexandru Brateanu ⋅ Ciprian Orhei ⋅ Codruta Ancuti ⋅ Cosmin Ancuti
Tucson Ballroom & Prefunction Space 63
KMOPS: Keypoint-Driven Method for Multi-Object Pose and Metric Size Estimation from Stereo Images Poster Session 3
Ying-Kun Wu ⋅ Yi Shen ⋅ Tzuhsuan Huang ⋅ I-Sheng Fang ⋅ Jun-Cheng Chen
Tucson Ballroom & Prefunction Space 132
Learning Unified Spatio-temporal Representations for Efficient Compressed Video Understanding Poster Session 4 + Reception
Shristi Biswas Biswas ⋅ Efstathia Soufleri ⋅ Arani Roy ⋅ Kaushik Roy
Tucson Ballroom & Prefunction Space 45
HiGlassRM: Learning to Remove High-prescription Glasses via Synthetic Dataset Generation Poster Session 4 + Reception
Sebin Lee ⋅ Heewon Kim
Tucson Ballroom & Prefunction Space 28
Enhancing Object Detection Training via Joint Image-Annotation Generation Poster Session 2 + Refreshments
Roy Uziel ⋅ Oded Bialer
Tucson Ballroom & Prefunction Space 31
R-MMA: Enhancing Vision-Language Models with Recurrent Adapters for Few-Shot and Cross-Domain Generalization Poster Session 5
Md Fahim ⋅ Md Ishmam ⋅ Mir Sazzat Hossain ⋅ M Ashraful Amin ⋅ Amin Ali ⋅ A K M Mahbubur Rahman
Tucson Ballroom & Prefunction Space 67
OPFormer: Object Pose Estimation leveraging foundation model with geometric encoding Poster Session 3
Artem Moroz ⋅ Vít Zeman ⋅ Martin Mikšík ⋅ Elizaveta Isianova ⋅ Miroslav David ⋅ Pavel Burget ⋅ Varun Burde
Tucson Ballroom & Prefunction Space 133
Robust Scene Coordinate Regression via Geometrically-Consistent Global Descriptors Poster Session 6 + Refreshments
Son Tung Nguyen ⋅ Alejandro Fontan ⋅ Michael Milford ⋅ Tobias Fischer
Tucson Ballroom & Prefunction Space 96
SphereEdit: Spherical Semantic Editing in Diffusion Models Poster Session 6 + Refreshments
Salamata Konate ⋅ Hassan Hamidi ⋅ Elham Dolatabadi ⋅ Frank Rudzicz ⋅ Laleh Seyyed-Kalantari
Tucson Ballroom & Prefunction Space 84
NoHumansRequired: Autonomous High-Quality Image Editing Triplet Mining Poster Session 5
Maksim Kuprashevich ⋅ Grigorii Alekseenko ⋅ Irina Tolstykh ⋅ Georgii Fedorov ⋅ Bulat Suleimanov ⋅ Vladimir Dokholyan ⋅ Aleksandr Gordeev
Tucson Ballroom & Prefunction Space 25
ProtoGMVAE: A Variational Auto-Encoder with True Gaussian Mixture Prior for Prototypical-based Self-Explainability Poster Session 4 + Reception
Martin Blanchard ⋅ Christophe Ducottet ⋅ Damien Muselet ⋅ Olivier Delézay
Tucson Ballroom & Prefunction Space 106
Stabilizing Direct Training of Spiking Neural Networks: Membrane Potential Initialization and Threshold-robust Surrogate Gradient Poster Session 6 + Refreshments
Hyunho Kook ⋅ Byeongho Yu ⋅ Jeong Oh ⋅ Eunhyeok Park
Tucson Ballroom & Prefunction Space 123
MR-Pruner: Training-free Multi-resolution Visual Token Pruning for Multi-modal Large Language Models Poster Session 1
Seunghoon Han ⋅ Hyewon Lee ⋅ Soyoung Park ⋅ Jong-Ryul Lee ⋅ Sungsu Lim
Tucson Ballroom & Prefunction Space 104
Uncertainty-Aware Subset Selection for Robust Visual Explainability under Distribution Shifts Poster Session 2 + Refreshments
Madhav Gupta ⋅ Vishak Prasad C ⋅ Ganesh Ramakrishnan
Tucson Ballroom & Prefunction Space 22
SSMT-Net: A Semi-Supervised Multitask Transformer-Based Network for Thyroid Nodule Segmentation in Ultrasound Images Poster Session 5
Muhammad Umar Farooq ⋅ Abd Ur Rehman ⋅ Azka Rehman ⋅ Muhammad Usman ⋅ Dong-Kyu Chae
Tucson Ballroom & Prefunction Space 26
LooC: Effective Low-Dimensional Codebook for Compositional Vector Quantization Poster Session 1
Jie Li ⋅ Kwan-Yee K. Wong ⋅ Kai Han
Tucson Ballroom & Prefunction Space 16
Quantifying the Limits of Segmentation Foundation Models: Modeling Challenges in Segmenting Tree-Like and Low-Contrast Objects Poster Session 4 + Reception
Yixin Zhang ⋅ Nicholas Konz ⋅ Kevin Kramer ⋅ Maciej Mazurowski
Tucson Ballroom & Prefunction Space 88
Fully Unsupervised Self-debiasing of Text-to-Image Diffusion Models Poster Session 1
Korada Sri Vardhana ⋅ Shrikrishna Lolla ⋅ Soma Biswas
Tucson Ballroom & Prefunction Space 117
Beyond the Highlights: Video Retrieval with Salient and Surrounding Contexts Poster Session 2 + Refreshments
Jaehun Bang ⋅ Moon Ye-Bin ⋅ Tae-Hyun Oh ⋅ Kyungdon Joo
Tucson Ballroom & Prefunction Space 74
Histogram Assisted Quality Aware Generative Model for Resolution Invariant NIR Image Colorization Poster Session 5
Abhinav Abhinav ⋅ Rajeev Ranjan Dwivedi ⋅ Samiran Das ⋅ Vinod Kurmi
Tucson Ballroom & Prefunction Space 60
Revisiting an Old Perspective Projection for Monocular 3D Morphable Models Regression Poster Session 6 + Refreshments
Toby Chong ⋅ Ryota Nakajima
Tucson Ballroom & Prefunction Space 57
CAPE: A CLIP-Aware Pointing Ensemble of Complementary Heatmap Cues for Embodied Reference Understanding Poster Session 3
Fevziye Irem Eyiokur ⋅ Dogucan Yaman ⋅ Hazım Ekenel ⋅ Alexander Waibel
Tucson Ballroom & Prefunction Space 98
Augmenting with NeRFs: Fast Relocalization on Densified Datasets Poster Session 3
Michael Tomadakis ⋅ Rebecca Borissova ⋅ Yuxuan Zhang ⋅ Sanjeev Koppal
Tucson Ballroom & Prefunction Space 14
iMotion-LLM: Instruction-Conditioned Trajectory Generation Poster Session 2 + Refreshments
Abdulwahab Felemban ⋅ Nussair Hroub ⋅ Jian Ding ⋅ Faizan Khan ⋅ Xiaoqian Shen ⋅ Abduallah Mohamed ⋅ Mohamed Elhoseiny
Tucson Ballroom & Prefunction Space 123
DreamMakeup: Face Makeup Customization using Latent Diffusion Models Poster Session 1
Geon Yeong Park ⋅ Inhwa Han ⋅ Serin Yang ⋅ Yeobin Hong ⋅ Seongmin Jeong ⋅ Heechan Jeon ⋅ Myeongjin Goh ⋅ Sung Yi ⋅ Jin Nam ⋅ Jong Ye
Tucson Ballroom & Prefunction Space 41
An Efficient Multi-Rater Setup Towards Personalized and Diversified Medical Image Segmentation Poster Session 4 + Reception
Sajed Almorsy ⋅ Ayman Khalafallah ⋅ Marwan Torki
Tucson Ballroom & Prefunction Space 99
Salience-SGG: Enhancing Unbiased Scene Graph Generation with Iterative Salience Estimation Poster Session 1
Runfeng Qu ⋅ Ole Hall ⋅ Pia Bideau ⋅ Julie Ouerfelli-Ethier ⋅ Martin Rolfs ⋅ Klaus Obermayer ⋅ Olaf Hellwich
Tucson Ballroom & Prefunction Space 99
CURIO: Curvature-Aligned and Efficient OCR for Low-Resource Historical Manuscripts Poster Session 2 + Refreshments
Sai Madhusudan Gunda ⋅ Tathagata Ghosh ⋅ Simran Sandral ⋅ Ravi Kiran Sarvadevabhatla
Tucson Ballroom & Prefunction Space 57
SkelSplat: Robust Multi-view 3D Human Pose Estimation with Differentiable Gaussian Rendering Poster Session 3
Laura Bragagnolo ⋅ Leonardo Barcellona ⋅ Stefano Ghidoni
Tucson Ballroom & Prefunction Space 11
Learning spatio-temporal feature representations for video-based gaze estimation Poster Session 4 + Reception
Alexandre Personnic ⋅ Mihai Bace
Tucson Ballroom & Prefunction Space 80
VLMs Guided Interpretable Decision Making in Autonomous Driving Poster Session 4 + Reception
Xin Hu ⋅ TAOTAO JING ⋅ Renran Tian ⋅ Zhengming Ding
Tucson Ballroom & Prefunction Space 20
Enhancing Monocular 3D Hand Reconstruction with Learned Texture Priors Poster Session 5
Giorgos Karvounas ⋅ Nikolaos Kyriazis ⋅ Iason Oikonomidis ⋅ Georgios Pavlakos ⋅ Antonis Argyros
Tucson Ballroom & Prefunction Space 121
Systematic Analysis of the Unintentional CSAM-Generation-Potential of Text-to-Image Models Poster Session 1
Nicolas Göller ⋅ Martin Steinebach
Tucson Ballroom & Prefunction Space 48
Enhanced Back-Projection of Vision Features for 3D Symmetry Detection Poster Session 1
Isaac Aguirre ⋅ Ivan Sipiran
Tucson Ballroom & Prefunction Space 7
Descrip3D: Enhancing Large Language Model-based 3D Scene Understanding with Object-Level Text Descriptions Poster Session 2 + Refreshments
Jintang Xue ⋅ Ganning Zhao ⋅ Jie-En Yao ⋅ Hong-En Chen ⋅ Yue Hu ⋅ Meida Chen ⋅ Suya You ⋅ Chung Chieh Kuo
Tucson Ballroom & Prefunction Space 32
MARS: a Multimodal Alignment and Ranking System for Few-Shot Segmentation Poster Session 1
Nico Catalano ⋅ Stefano Samele ⋅ Paolo Pertino ⋅ Matteo Matteucci
Tucson Ballroom & Prefunction Space 123
Occlusion Boundary and Depth: Mutual Enhancement via Multi-Task Learning Poster Session 4 + Reception
Lintao XU ⋅ Yinghao WANG ⋅ Chaohui Wang
Tucson Ballroom & Prefunction Space 14
MoRe: Monocular Geometry Refinement via Graph Optimization for Cross-View Consistency Poster Session 4 + Reception
Dongki Jung ⋅ Jaehoon Choi ⋅ Yonghan Lee ⋅ Sungmin Eum ⋅ Heesung Kwon ⋅ Dinesh Manocha
Tucson Ballroom & Prefunction Space 53
Vision-informed Semantic Text Alignment for Open-set Recognition in Remote Sensing Poster Session 2 + Refreshments
Siddhant Gole ⋅ Akash Pal ⋅ Ankit Jha ⋅ Subhasis Chaudhuri ⋅ Biplab Banerjee
Tucson Ballroom & Prefunction Space 134
GrounDiff: Diffusion-Based Ground Surface Generation from Digital Surface Models Poster Session 1
Oussema Dhaouadi ⋅ Johannes Meier ⋅ Jacques Kaiser ⋅ Daniel Cremers
Tucson Ballroom & Prefunction Space 130
RampWatch: An In-the-Wild Dataset and Text-Guided Detection Framework for Recreational Vessels Poster Session 6 + Refreshments
Malik Muhammad Asim ⋅ Claire Smallwood ⋅ Abdullah Tariq ⋅ Johnny Lo ⋅ Syed Zulqarnain Gilani
Tucson Ballroom & Prefunction Space 36
AnyAnomaly: Zero-Shot Customizable Video Anomaly Detection with LVLM Poster Session 3
Sunghyun Ahn ⋅ Youngwan Jo ⋅ Kijung Lee ⋅ Sein Kwon ⋅ Inpyo Hong ⋅ Sanghyun Park
Tucson Ballroom & Prefunction Space 10
STARS: Self-supervised Tuning for 3D Action Recognition in Skeleton Sequences Poster Session 2 + Refreshments
Soroush Mehraban ⋅ Javad Rajabi ⋅ Andrea Iaboni ⋅ Babak Taati
Tucson Ballroom & Prefunction Space 137
ObjectMeshDeform : Towards recovering precise 3D geometry of real objects via image-guided mesh deformation of 3D generative priors Poster Session 2 + Refreshments
Siddharth Katageri ⋅ SANJANA SINHA ⋅ Sourav Ghosh ⋅ Soumyadip Maity ⋅ Brojeshwar Bhowmick
Tucson Ballroom & Prefunction Space 111
PADM: A Physics-aware Diffusion Model for Attenuation Correction Poster Session 2 + Refreshments
Trung Pham ⋅ Hoang Vu ⋅ Anh Chu ⋅ Dac Thai Nguyen ⋅ Trung Thanh Nguyen ⋅ THAO TRUONG TRUONG ⋅ Mai Son ⋅ Thanh Nguyen ⋅ Phi Le Nguyen
Tucson Ballroom & Prefunction Space 113
Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression Poster Session 3
Roy Jennings ⋅ Genady Paikin ⋅ Roy Shaul ⋅ Evgeny Soloveichik
Tucson Ballroom & Prefunction Space 52
D2Mamba: Dual Domain Guided Informed Search in State Space Model for Underwater Image Enhancement Poster Session 5
Alik Pramanick ⋅ Soumajit Roy ⋅ ARIJIT SUR
Tucson Ballroom & Prefunction Space 126
TopoRec: Point Cloud Recognition Using Topological Data Analysis Poster Session 6 + Refreshments
Anirban Ghosh ⋅ Iliya Kulbaka ⋅ Ian Dahlin ⋅ Ayan Dutta
Tucson Ballroom & Prefunction Space 33
AdaptViG: Adaptive Vision GNN with Exponential Decay Gating Poster Session 1
Mustafa Munir ⋅ Mostafijur Rahman ⋅ Radu Marculescu
Tucson Ballroom & Prefunction Space 43
DynaGSLAM: Real-Time Gaussian-Splatting SLAM for Online Rendering, Tracking, Motion Predictions of Moving Objects in Dynamic Scenes Poster Session 2 + Refreshments
Runfa Li ⋅ Mahdi Shaghaghi ⋅ Keito Suzuki ⋅ Xinshuang Liu ⋅ Varun Moparthi ⋅ Bang Du ⋅ Walker Curtis ⋅ Martin Renschler ⋅ Ki Myung Brian Lee ⋅ Nikolay Atanasov ⋅ Truong Nguyen
Tucson Ballroom & Prefunction Space 97
SD-CSFL: A Synthetic Data-Driven Conformity Scoring Framework for Robust Federated Learning Poster Session 5
Ebtisaam Alharbi ⋅ Abdulrahman Kerim ⋅ Leandro Soriano Marcolino ⋅ Qiang Ni
Tucson Ballroom & Prefunction Space 105
AirLock+: Scaling UAV-to-Satellite Image Registration for Target Geolocalization and Geospatial Augmented Reality Poster Session 3
Zhiyun Deng ⋅ Austin Case ⋅ Luis Sentis
Tucson Ballroom & Prefunction Space 40
Gaussian Swaying: Surface-Based Framework for Aerodynamic Simulation with 3D Gaussians Poster Session 4 + Reception
Hongru Yan ⋅ Xiang Zhang ⋅ Zeyuan Chen ⋅ Fangyin Wei ⋅ Zhuowen Tu
Tucson Ballroom & Prefunction Space 62
Overcoming Fine-Grained Visual Challenges in Animal Re-Identification via Semantic Feature Alignment Poster Session 1
Yihao Wu ⋅ Di Zhao ⋅ Yuzhuo Li ⋅ Matthew Alajas ⋅ Alistair Glen ⋅ Jingfeng Zhang ⋅ Gillian Dobbie ⋅ Daniel Wilson ⋅ Yun Sing Koh
Tucson Ballroom & Prefunction Space 36
UniDiff: Parameter-Efficient Adaptation of Diffusion Models for Land Cover Classification with Multi-Modal Remotely Sensed Imagery and Sparse Annotations Poster Session 4 + Reception
Yuzhen Hu ⋅ Saurabh Prasad
Tucson Ballroom & Prefunction Space 31
Zero-LEAD: Source-Free Universal Domain Adaptation for Abdominal Multi-Organ Segmentation Poster Session 5
Ahmed El-Sayed ⋅ Marwan Torki
Tucson Ballroom & Prefunction Space 87
Overcoming Small Data Limitations in Video-Based Infant Respiration Estimation Poster Session 5
Liyang Song ⋅ Hardik Bishnoi ⋅ Sai Manne ⋅ Sarah Ostadabbas ⋅ Briana Taylor ⋅ Michael Wan
Tucson Ballroom & Prefunction Space 52
SUGAR: A Sweeter Spot for Generative Unlearning of Many Identities Poster Session 2 + Refreshments
Dung Nguyen ⋅ Quang Nguyen ⋅ Preston Robinette ⋅ Eli Jiang ⋅ Taylor Johnson ⋅ Kevin Leach
Tucson Ballroom & Prefunction Space 125
One-shot Portrait Stylizaiton via Geometric Alignment Poster Session 4 + Reception
Xinrui Wang ⋅ Zilin Guo ⋅ Zhuoru Li ⋅ Jinze Yu ⋅ Heng Zhang ⋅ Yusuke Iwasawa ⋅ Yutaka Matsuo ⋅ Jiaxian Guo
Tucson Ballroom & Prefunction Space 65
RobuMTL: Enhancing Multi-Task Learning Robustness Against Weather Conditions Poster Session 4 + Reception
Tasneem Shaffee ⋅ Sherief Reda
Tucson Ballroom & Prefunction Space 125
Graph-Based Spectral Attention with Multi-Spectral Images for Illuminant Estimation Poster Session 2 + Refreshments
Dong-Hoon Kang ⋅ Seung-Yeop Baek ⋅ Jong-Ok Kim
Tucson Ballroom & Prefunction Space 142
BoxSplitGen: A Generative Model for 3D Part Bounding Boxes in Varying Granularity Poster Session 2 + Refreshments
Juil Koo ⋅ Wei-Tung Lin ⋅ Chanho Park ⋅ Chanhyeok Park ⋅ Minhyuk Sung
Tucson Ballroom & Prefunction Space 35
AD2: Analysis and Detection of Adversarial Threats in Visual Perception for End-to-End Autonomous Driving Systems Poster Session 2 + Refreshments
Ishan Sahu ⋅ Somnath Hazra ⋅ Somak Aditya ⋅ Soumyajit Dey
Tucson Ballroom & Prefunction Space 27
LASOR: Towards Clinically Transparent and Explainable Ophthalmic Report Generation via Lesion-Aware Segmentation Poster Session 4 + Reception
Jian Park ⋅ Hyunseon Won ⋅ JeeEun Kim ⋅ JOON HWANG ⋅ Jeong Han ⋅ Ji Park ⋅ Daniel Hwang ⋅ Jinyoung Han
Tucson Ballroom & Prefunction Space 87
Can We Challenge Open-Vocabulary Object Detectors with Generated Content in Street Scenes? Poster Session 1
Annika Mütze ⋅ Sadia Ilyas ⋅ Christian Dörpelkus ⋅ Matthias Rottmann
Tucson Ballroom & Prefunction Space 71
SOAF: Scene Occlusion-aware Neural Acoustic Field Poster Session 3
Huiyu Gao ⋅ Jiahao Ma ⋅ David Ahmedt-Aristizabal ⋅ Chuong Nguyen ⋅ Miaomiao Liu
Tucson Ballroom & Prefunction Space 113
SOPHY: Generating Simulation-Ready Objects with Physical Materials Poster Session 4 + Reception
Junyi Cao ⋅ Evangelos Kalogerakis
Tucson Ballroom & Prefunction Space 39
Diversity Preserving Coresets for Image Quality Assessment Poster Session 6 + Refreshments
Arpita Nema ⋅ Hanwei Zhu ⋅ Xi Zhang ⋅ Weisi Lin
Tucson Ballroom & Prefunction Space 69
SeaClips: A Video Dataset for Maritime Object Detection. Poster Session 4 + Reception
Franziska Denk ⋅ Christian Rankl ⋅ Shaban ALMOUAHED ⋅ David Moser ⋅ Robert Sablatnig
Tucson Ballroom & Prefunction Space 30
Tables Decoded: DELTA for Structure, TARQA for Understanding Poster Session 2 + Refreshments
Jahanvi Rajput ⋅ Dhruv Kudale ⋅ Saikiran Kasturi ⋅ Utkarsh Verma ⋅ Ganesh Ramakrishnan
Tucson Ballroom & Prefunction Space 129
DREAM: Dynamic Prompts and GuidedMix for Efficient Continual Adaptation of Visual-Language Models Poster Session 5
Evelyn Chee ⋅ Mong-Li Lee ⋅ Wynne Hsu
Tucson Ballroom & Prefunction Space 6
Blur2Sharp: Human Novel Pose and View Synthesis with Generative Prior Refinement Poster Session 3
Chia Lai ⋅ I-Hsuan Lo ⋅ Yen-Ku Yeh ⋅ Thanh-Nguyen Truong ⋅ Ching-Chun Huang
Tucson Ballroom & Prefunction Space 41
GorillaWatch: An Automated System for In-the-Wild Gorilla Re-Identification and Population Monitoring Poster Session 6 + Refreshments
Maximilian Schall ⋅ Felix Knöfel ⋅ Noah König ⋅ Jan Kubeler ⋅ Maximilian von Klinski ⋅ Joan Linnemann ⋅ Xiaoshi Liu ⋅ Iven Schlegelmilch ⋅ Ole Woyciniuk ⋅ Alexandra Schild ⋅ Dante Wasmuht ⋅ Magdalena Bermejo Espinet ⋅ Germán Illera Basas ⋅ Gerard de Melo
Tucson Ballroom & Prefunction Space 110
DATTA: Domain-Adversarial Test-Time Adaptation for Cross-Domain WiFi-Based Human Activity Recognition Poster Session 3
Julian Strohmayer ⋅ Rafael Sterzinger ⋅ Matthias Wödlinger ⋅ Martin Kampel
Tucson Ballroom & Prefunction Space 48
CLIP-IT: CLIP-based Pairing of Histology Images with Privileged Textual Information Poster Session 3
Banafsheh Karimian ⋅ Giulia Avanzato ⋅ Soufiane Belharbi ⋅ Alexis Guichemerre ⋅ Luke McCaffrey ⋅ Mohammadhadi Shateri ⋅ Eric Granger
Tucson Ballroom & Prefunction Space 75
Exploiting Label-Independent Regularization from Spatial Patterns for Whole Slide Image Analysis Poster Session 6 + Refreshments
Weiyi Wu ⋅ Xinwen Xu ⋅ Chongyang Gao ⋅ Xingjian Diao ⋅ Siting Li ⋅ Jiang Gui
Tucson Ballroom & Prefunction Space 136
Crafting Descriptive Information for a Zero-shot Method to Improve Knowledge-Based Visual Question Answering Performance Poster Session 3
Mohammad Moradi ⋅ Sudhir Mudur
Tucson Ballroom & Prefunction Space 19
From Few-Shot to Zero-Shot Pallet Load Recognition: A Deployed Embedding-Based Vision System for Industrial Logistics Poster Session 2 + Refreshments
Juan Jesús Losada del Olmo ⋅ Emilio Ballesteros ⋅ Pedro Lopez-de-Teruel ⋅ Alberto Ruiz
Tucson Ballroom & Prefunction Space 141
SaccadeX: Directed Acyclic Graph-based Semi-Supervised Learning of Continuous Ocular Dynamics from Sparse Neuromorphic Streams Poster Session 1
Nuwan Bandara ⋅ Thivya Kandappu ⋅ Archan Misra
Tucson Ballroom & Prefunction Space 133
See, Think, Learn: A Self-Taught Multimodal Reasoner Poster Session 6 + Refreshments
Sourabh Sharma ⋅ Sonam Gupta ⋅ Sadbhawna Thakur
Tucson Ballroom & Prefunction Space 105
PVeRA: Probabilistic Vector-Based Random Matrix Adaptation Poster Session 2 + Refreshments
Leo Fillioux ⋅ Enzo Ferrante ⋅ Paul-Henry Cournède ⋅ Maria Vakalopoulou ⋅ Stergios Christodoulidis
Tucson Ballroom & Prefunction Space 100
Non-Aligned Reference Image Quality Assessment for Novel View Synthesis Poster Session 5
Abhijay Ghildyal ⋅ Rajesh Sureddi ⋅ Nabajeet Barman ⋅ Saman Zadtootaghaj ⋅ Alan Bovik
Tucson Ballroom & Prefunction Space 53
View-aware Cross-modal Distillation for Multi-view Action Recognition Poster Session 6 + Refreshments
Trung Thanh Nguyen ⋅ Yasutomo Kawanishi ⋅ Vijay John ⋅ Takahiro Komamizu ⋅ Ichiro Ide
Tucson Ballroom & Prefunction Space 54
Beyond Real Weights: Hypercomplex Representations for Stable Quantization Poster Session 1
Jawad Ibn Ahad ⋅ Maisha Rahman ⋅ Amrijit Biswas ⋅ Muhammad Kabir ⋅ Robin Krambroeckers ⋅ Sifat Momen ⋅ Nabeel Mohammed ⋅ Shafin Rahman
Tucson Ballroom & Prefunction Space 113
Power of Boundary and Reflection: Semantic Transparent Object Segmentation using Pyramid Vision Transformer with Transparent Cues Poster Session 3
Tuan-Anh Vu ⋅ Nguyen Hai ⋅ Ziqiang Zheng ⋅ Binh-Son Hua ⋅ Qing Guo ⋅ Ivor Tsang ⋅ Sai-Kit Yeung
Tucson Ballroom & Prefunction Space 42
QAL : A Loss for Recall–Precision Balance in 3D Reconstruction Poster Session 6 + Refreshments
Pranay Meshram ⋅ Yash Turkar ⋅ kartikeya singh ⋅ Praveen Raj Masilamani ⋅ Charuvahan Adhivarahan ⋅ Karthik Dantu
Tucson Ballroom & Prefunction Space 73
Efficient Text-Guided Convolutional Adapter for the Diffusion Model Poster Session 3
Aryan Das ⋅ Koushik Biswas ⋅ Swalpa Roy ⋅ Badri Patro ⋅ Vinay Verma
Tucson Ballroom & Prefunction Space 105
ClusterMine: Robust Label-Free Visual Out-Of-Distribution Detection via Concept Mining from Text Corpora Poster Session 2 + Refreshments
Nikolaos Adaloglou ⋅ Diana Petrusheva ⋅ Mohamed Asker ⋅ Felix Michels ⋅ Markus Kollmann
Tucson Ballroom & Prefunction Space 56
Digital Forensic AI You Can Explain: A Case Study on Video Source Camera Identification Poster Session 5
Maryna Veksler ⋅ Kemal Akkaya ⋅ Selcuk Uluagac
Tucson Ballroom & Prefunction Space 117
Confidence Through Parallel Attention for Depth and Uncertainty Estimation in Dynamic Environments Poster Session 4 + Reception
Onkar Susladkar ⋅ Rohit Pawar ⋅ Chirag Sehgal ⋅ Samaksh Ujjawal ⋅ Sparsh Mittal
Tucson Ballroom & Prefunction Space 11
TED-4DGS: Temporally Activated and Embedding-based Deformation for 4DGS Compression Poster Session 5
Cheng-Yuan Ho ⋅ He-Bi Yang ⋅ Jui-Chiu Chiang ⋅ Yu-Lun Liu ⋅ Wen-Hsiao Peng
Tucson Ballroom & Prefunction Space 55
Improvise, Adapt, Overcome — Telescopic Adapters for Efficient fine-tuning of Vision Language Models in Medical Imaging Poster Session 6 + Refreshments
Ujjwal Mishra ⋅ VINITA SHUKLA ⋅ Praful Hambarde ⋅ Amit Shukla
Tucson Ballroom & Prefunction Space 39
FedEFC: Federated Learning Using Enhanced Forward Correction Against Noisy Labels Poster Session 6 + Refreshments
Seunghun Yu ⋅ Jin-Hyun Ahn ⋅ Joonhyuk Kang
Tucson Ballroom & Prefunction Space 85
Analysis of Text Accuracy and Visual Alignment in Vision-Language Models for Artistic Text Generation Poster Session 1
Fatima Alderazi ⋅ Motaz Alfarraj
Tucson Ballroom & Prefunction Space 84
MoSCo: Real-time and Efficient Text-to-Motion Synthesis via Delta Training Poster Session 5
Zhiyuan Zhang ⋅ Lingqiao Liu
Tucson Ballroom & Prefunction Space 48
GDoFS: Gaussian DoF Separation for Plausible 3D Geometry in Sparse-View 3DGS Poster Session 5
Yongsung Kim ⋅ Jooyoung Choi ⋅ Sungroh Yoon
Tucson Ballroom & Prefunction Space 80
DexAvatar: 3D Sign Language Reconstruction with Hand and Body Pose Priors Poster Session 5
Kaustubh Kundu ⋅ Hrishav Barua ⋅ Lucy Robertson-Bell ⋅ Zhixi Cai ⋅ Kalin Stefanov
Tucson Ballroom & Prefunction Space 5
Feature-Disentangling RGB-NIR Fusion Network for Remote Driver Physiological Measurement Poster Session 1
Tayssir Bouraffa ⋅ Ziyuan Wang ⋅ Daniel Strüber
Tucson Ballroom & Prefunction Space 63
WiSE-OD: Benchmarking Robustness in Infrared Object Detection Poster Session 4 + Reception
Heitor Medeiros ⋅ ATIF BELAL ⋅ Masih Aminbeidokhti ⋅ Eric Granger ⋅ Marco Pedersoli
Tucson Ballroom & Prefunction Space 60
Gated Temporal Fusion Transformers for Robust Multi-Object Tracking Poster Session 4 + Reception
Jinho Kim ⋅ Kuk-Jin Yoon
Tucson Ballroom & Prefunction Space 23
WALDO: Where Unseen Model-based 6D Pose Estimation Meets Occlusion Poster Session 3
Sajjad Pakdamansavoji ⋅ Yintao Ma ⋅ Amir Rasouli ⋅ TONGTONG CAO
Tucson Ballroom & Prefunction Space 110
Feedback Alignment Meets Low-Rank Manifolds: A Structured Recipe for Local Learning Poster Session 3
Arani Roy ⋅ Marco P. E. Apolinario ⋅ Shristi Biswas Biswas ⋅ Kaushik Roy
Tucson Ballroom & Prefunction Space 7
Learning Beyond Labels: Self-Supervised Handwritten Text Recognition Poster Session 5
Shree Mitra ⋅ Ajoy Mondal ⋅ Jawahar CV
Tucson Ballroom & Prefunction Space 81
FLoMo-Net: A Novel Task-Adaptive Mixture of Experts Routing Framework with Frequency and Uncertainty Correction for Medical Image Segmentation Poster Session 3
Md Rayhan Ahmed ⋅ Patricia Lasserre
Tucson Ballroom & Prefunction Space 106
VISTA: A Vision and Intent-Aware Social Attention Framework for Multi-Agent Trajectory Prediction Poster Session 1
Stephane Da Silva Martins ⋅ Emanuel Aldea ⋅ Sylvie Le Hégarat-Mascle
Tucson Ballroom & Prefunction Space 28
Orca: Object Recognition and Comprehension for Archiving Marine Species Poster Session 2 + Refreshments
Yuk Kwan Wong ⋅ Liang Haixin ⋅ Zeyu Ma ⋅ Yiwei Chen ⋅ Ziqiang Zheng ⋅ Rinaldi Gotama ⋅ Pascal Sebastian ⋅ Lauren Sparks ⋅ Sai-Kit Yeung
Tucson Ballroom & Prefunction Space 18
GaussianHeadTalk: Wobble-Free 3D Talking Heads with Audio Driven Gaussian Splatting Poster Session 6 + Refreshments
Madhav Agarwal ⋅ Mingtian Zhang ⋅ Laura Sevilla-Lara ⋅ Steven McDonagh
Tucson Ballroom & Prefunction Space 78
Pretraining Helps When Capacity Allows: Evidence from Ultra-Small ConvNets Poster Session 6 + Refreshments
Srikanth Muralidharan ⋅ Heitor Medeiros ⋅ Masih Aminbeidokhti ⋅ Eric Granger ⋅ Marco Pedersoli
Tucson Ballroom & Prefunction Space 107
Intra-Class Probabilistic Embeddings for Uncertainty Estimation in Vision-Language Models Poster Session 2 + Refreshments
Zhenxiang Lin ⋅ Maryam Haghighat ⋅ Will Browne ⋅ Dimity Miller
Tucson Ballroom & Prefunction Space 87
Do generative video models understand physical principles? Poster Session 1
Saman Motamed ⋅ Laura Culp ⋅ Kevin Swersky ⋅ Priyank Jaini ⋅ Robert Geirhos
Tucson Ballroom & Prefunction Space 91
RAT4D: Rig and Animate Objects without Surface Templates in 4D Poster Session 1
Mosam Dabhi ⋅ Simon Lucey ⋅ Laszlo Jeni
Tucson Ballroom & Prefunction Space 38
Mitigating Backdoor Attacks via Trigger Reconstruction and Model Hardening Poster Session 1
Guanhong Tao ⋅ Siyuan Cheng ⋅ Guangyu Shen ⋅ Yingqi Liu ⋅ Shengwei An ⋅ ZHUO ZHANG ⋅ Zhenting Wang ⋅ Hanxi Guo ⋅ Xiangyu Zhang
Tucson Ballroom & Prefunction Space 56
Divide and Refine: Enhancing Multimodal Representation and Explainability for Emotion Recognition in Conversation Poster Session 2 + Refreshments
Tuan Mai ⋅ Cam-Van Thi Nguyen ⋅ Duc-Trong Le
Tucson Ballroom & Prefunction Space 122
SSplain: Sparse and Smooth Explainer for Retinopathy of Prematurity Classification Poster Session 2 + Refreshments
Elifnur Sunger ⋅ Tales Imbiriba ⋅ J. Campbell ⋅ Deniz Erdogmus ⋅ Stratis Ioannidis ⋅ Jennifer Dy
Tucson Ballroom & Prefunction Space 28
Broadcast2Pitch: Game State Reconstruction from Unconstrained Soccer Videos Poster Session 4 + Reception
Yin May Oo ⋅ Yewon Hwang ⋅ Muhammad Robbani ⋅ VANYI CHAO ⋅ Ankhzaya Jamsrandorj ⋅ Hoang Nguyen ⋅ Kyung-Ryoul Mun ⋅ Jinwook Kim
Tucson Ballroom & Prefunction Space 19
Dronaquatics: Real-time Swimming Analytics Using Drone Captured Imagery Poster Session 4 + Reception
Thu Tran ⋅ Harold Abraham Joseph ⋅ Kichang Lee ⋅ Kenny Choo ⋅ Dong Ma ⋅ Shaohui Foong ⋅ Thivya Kandappu ⋅ Jeonggil Ko ⋅ Rajesh Balan
Tucson Ballroom & Prefunction Space 57
Clear Sights on Site: A Spatial-Adaptive Channel Network for Deblurring Construction Site Images Poster Session 5
Bonyani ⋅ Maryam Soleymani ⋅ Chao Wang
Tucson Ballroom & Prefunction Space 108
SynPlay: Large-Scale Synthetic Human Data with Real-World Diversity for Aerial-View Perception Poster Session 1
Jinsub Yim ⋅ Hyungtae Lee ⋅ Sungmin Eum ⋅ Yi-Ting Shen ⋅ Yan Zhang ⋅ Heesung Kwon ⋅ Shuvra Bhattacharyya
Tucson Ballroom & Prefunction Space 90
Beyond Paired Data: Self-Supervised UAV Geo-Localization from Reference Imagery Alone Poster Session 6 + Refreshments
Tristan Amadei ⋅ Enric Meinhardt-Llopis ⋅ Benedicte Bascle ⋅ Corentin ABGRALL ⋅ Gabriele Facciolo
Tucson Ballroom & Prefunction Space 20
Illuminating Darkness: Learning to Enhance Low-light Images In-the-Wild Poster Session 2 + Refreshments
S Sharif ⋅ Abdur Rehman ⋅ Zain Abidin ⋅ Fayaz Ali ⋅ Radu Timofte ⋅ Rizwan Naqvi
Tucson Ballroom & Prefunction Space 81
VideoSketcher: A Training-Free Approach for Coherent Video Sketch Transfer Poster Session 6 + Refreshments
Huining Li ⋅ Bangzhen Liu ⋅ Rui Yang ⋅ Yang Zhou ⋅ Chenshu Xu ⋅ Xufang PANG ⋅ Shengfeng He
Tucson Ballroom & Prefunction Space 13
Crash2DocAI: Automated Integration of Post-Crash Car Part Images into Technical Reports Poster Session 6 + Refreshments
Václav Diviš ⋅ Jessica Giovagnola ⋅ Khalil Ben Chikha ⋅ Marek Hrúz
Tucson Ballroom & Prefunction Space 101
TacticalCalib: End-to-End 6-DoF Camera Pose Regression for Tactical Camera Calibration Poster Session 5
Liang Fan ⋅ Xiaoqian Liu ⋅ Zhi Chen ⋅ Lingkai Yang
Tucson Ballroom & Prefunction Space 72
Joint Modeling of Corruption-Driven and Information-Limited Uncertainty for Robust 3D Gaussian Splatting Poster Session 1
Zeji Hui ⋅ Amirali Khodadadian Gostar ⋅ WeiQin Chuah ⋅ Alireza Bab-Hadiashar ⋅ Ruwan Tennakoon
Tucson Ballroom & Prefunction Space 66
No MoCap Needed: Post-Training Motion Diffusion Models with Reinforcement Learning using Only Textual Prompts Poster Session 1
Girolamo Macaluso ⋅ Lorenzo Mandelli ⋅ Mirko Bicchierai ⋅ Stefano Berretti ⋅ Andrew Bagdanov
Tucson Ballroom & Prefunction Space 93
Revisiting Layer Normalization for Point Cloud Test Time Adaptation Poster Session 1
Moslem Yazdanpanah ⋅ Ali Bahri ⋅ Mehrdad Noori ⋅ Sahar Dastani ⋅ Samuel Barbeau ⋅ David OSOWIECHI ⋅ Gustavo Vargas Hakim ⋅ Ismail Ayed ⋅ Christian Desrosiers
Tucson Ballroom & Prefunction Space 52
T2LF: LLM-Guided Multimodal Diffusion for Text-to-Light Field Synthesis Poster Session 6 + Refreshments
Soyoung Yoon ⋅ Namhyuk Ahn ⋅ In Kyu Park
Tucson Ballroom & Prefunction Space 12
SENCA-st: Integrating Spatial Transcriptomics and Histopathology with Cross Attention Shared Encoder for Region Identification in Cancer Pathology Poster Session 3
Shanaka Liyanaarachchi ⋅ Chathurya Wijethunga ⋅ Shihab Ahamed ⋅ Akthas Absar ⋅ Ranga Rodrigo
Tucson Ballroom & Prefunction Space 63
LogicCBMs: Logic-Enhanced Concept-Based Learning Poster Session 5
Deepika Vemuri ⋅ Gautham Bellamkonda ⋅ Aditya Pola ⋅ Vineeth Balasubramanian
Tucson Ballroom & Prefunction Space 23
SurgXBench: Explainable Vision-Language Model Benchmark for Surgery Poster Session 6 + Refreshments
Jiajun Cheng ⋅ Xianwu Zhao ⋅ Sainan Liu ⋅ Xiaofan Yu ⋅ Ravi Prakash ⋅ Patrick Codd ⋅ Jonathan Katz ⋅ Shan Lin
Tucson Ballroom & Prefunction Space 94
CountingDINO: A Training-free Pipeline for Class-Agnostic Counting using Unsupervised Backbones Poster Session 1
Giacomo Pacini ⋅ Lorenzo Bianchi ⋅ Luca Ciampi ⋅ Nicola Messina ⋅ Giuseppe Amato ⋅ Fabrizio Falchi
Tucson Ballroom & Prefunction Space 77
Personalized Image Privacy Advisors via Federated Daisy-Chaining Poster Session 2 + Refreshments
Sourasekhar Banerjee ⋅ Vengateswaran Subramaniam ⋅ Debaditya Roy ⋅ Vigneshwaran Subbaraju ⋅ Monowar Bhuyan
Tucson Ballroom & Prefunction Space 132
Reciprocal Teaching: Dynamic Multi-Model Teacher-Student Learning for Multiple Noisy Annotations Poster Session 6 + Refreshments
Wenjie Ai ⋅ Cuong Nguyen ⋅ Adrian Hilton ⋅ Gustavo Carneiro
Tucson Ballroom & Prefunction Space 111
WWE-UIE: A Wavelet & White Balance Efficient Network for Underwater Image Enhancement Poster Session 2 + Refreshments
Ching-Heng Cheng ⋅ Jen-Wei Lee ⋅ Chia-Ming Lee ⋅ Chih-Chung Hsu
Tucson Ballroom & Prefunction Space 69
CLIP’s Visual Embedding Projector is a Few-shot Cornucopia Poster Session 3
Mohammad Fahes ⋅ Tuan-Hung VU ⋅ Andrei Bursuc ⋅ Patrick Perez ⋅ Raoul de Charette
Tucson Ballroom & Prefunction Space 32
SFMNet: Sparse Focal Modulation for 3D Object Detection Poster Session 5
Oren Shrout ⋅ Ayellet Tal
Tucson Ballroom & Prefunction Space 47
UltraClean: A Simple Framework to Train Robust Neural Networks against Backdoor Attacks Poster Session 6 + Refreshments
Bingyin Zhao ⋅ Yingjie Lao
Tucson Ballroom & Prefunction Space 109
LangPose: Language-Aligned Motion for Robust 3D Human Pose Estimation Poster Session 6 + Refreshments
Longyun Liao ⋅ Rong Zheng
Tucson Ballroom & Prefunction Space 83
Restora-Flow: Mask-Guided Image Restoration with Flow Matching Poster Session 4 + Reception
Arnela Hadzic ⋅ Franz Thaler ⋅ Lea Bogensperger ⋅ Simon Johannes Joham ⋅ Martin Urschler
Tucson Ballroom & Prefunction Space 63
RegionAligner: Bridging Ego-Exo Views for Object Correspondence via Unified Text-Visual Learning Poster Session 3
Yuhao Su ⋅ Ehsan Elhamifar
Tucson Ballroom & Prefunction Space 33
Scalable Video Action Anticipation with Cross Linear Attentive Memory Poster Session 6 + Refreshments
Zeyun Zhong ⋅ Manuel Martin ⋅ David Schneider ⋅ David Lerch ⋅ Chengzhi Wu ⋅ Frederik Diederichs ⋅ Juergen Gall ⋅ Jürgen Beyerer
Tucson Ballroom & Prefunction Space 87
Learning Compact Video Representations for Efficient Long-form Video Understanding in Large Multimodal Models Poster Session 3
Yuxiao Chen ⋅ Jue Wang ⋅ Zhikang Zhang ⋅ Jingru Yi ⋅ Xu Zhang ⋅ Yang Zou ⋅ Zhaowei Cai ⋅ Jianbo Yuan ⋅ Xinyu Li ⋅ Hao Yang ⋅ Davide Modolo
Tucson Ballroom & Prefunction Space 127
CSF-Net: Context-Semantic Fusion Network for Large Mask Inpainting Poster Session 6 + Refreshments
Chae-Yeon Heo ⋅ Yeong-Jun Cho
Tucson Ballroom & Prefunction Space 103
ChartQA-X: Generating Explanations for Visual Chart Reasoning Poster Session 5
Shamanthak Hegde ⋅ Pooyan Fazli ⋅ Hasti Seifi
Tucson Ballroom & Prefunction Space 63
AnyBald: Toward Realistic Diffusion-Based Hair Removal In-The-Wild Poster Session 2 + Refreshments
Yongjun Choi ⋅ Seungoh Han ⋅ Soomin Kim ⋅ Sumin Son ⋅ Mohsen Rohani ⋅ Edgar Maucourant ⋅ Dongbo Min ⋅ Kyungdon Joo
Tucson Ballroom & Prefunction Space 77
FAE-Net: Fashion Attribute Editing via Disentangled Latent Conditioning in Diffusion Models Poster Session 1
Parvatam Rajith Bhargav ⋅ Gaurab Bhattacharya ⋅ Vivek B S ⋅ Jayavardhana Gubbi
Tucson Ballroom & Prefunction Space 19
NRGMark: Localized Watermarking for Energy Transparency in Images Poster Session 6 + Refreshments
Shruti Agarwal ⋅ Élie Michel ⋅ Vishal Asnani ⋅ Tania Mathern ⋅ John Collomosse
Tucson Ballroom & Prefunction Space 55
ACuRE: Accurate Continuity-Regularized SpO2 Estimation Using Liquid Time-Constant Networks Poster Session 6 + Refreshments
Shahzad Ahmad ⋅ DR. MISHRA ⋅ Sania Bano ⋅ Sukalpa Chanda ⋅ Yogesh Rawat
Tucson Ballroom & Prefunction Space 5
F-ViTA: Foundation Model Guided Visible to Infrared Translation Poster Session 4 + Reception
Jay Paranjape ⋅ Celso de Melo ⋅ Vishal Patel
Tucson Ballroom & Prefunction Space 129
Graph Query Networks for Object Detection with Automotive Radar Poster Session 5
Loveneet Saini ⋅ Hasan Tercan ⋅ Tobias Meisen
Tucson Ballroom & Prefunction Space 113
Multi-Grained Text-Guided Image Fusion for Multi-Exposure and Multi-Focus Scenarios Poster Session 6 + Refreshments
Mingwei Tang ⋅ Jiahao Nie ⋅ Guang Yang ⋅ Ziqing Cui ⋅ Jie Li
Tucson Ballroom & Prefunction Space 45
FB-4D: Spatial-Temporal Coherent Dynamic 3D Content Generation with Feature Banks Poster Session 4 + Reception
Jinwei Li ⋅ Huan-ang Gao ⋅ Wenyi Li ⋅ Haohan Chi ⋅ Chenyu Liu ⋅ Chenxi Du ⋅ Yiqian Liu ⋅ Mingju Gao ⋅ Guiyu Zhang ⋅ Zongzheng Zhang ⋅ Li Yi ⋅ Yao Yao ⋅ Jingwei Zhao ⋅ Hongyang Li ⋅ Yikai Wang ⋅ Hao Zhao
Tucson Ballroom & Prefunction Space 96
Neural Geometry Image-Based Representations with Optimal Transport (OT) Poster Session 5
Xiang Gao ⋅ Yuanpeng Liu ⋅ Jiazhi Li ⋅ Xinmu Wang ⋅ Minghao Guo ⋅ Yu Guo ⋅ Xiyun Song ⋅ Heather Yu ⋅ Zhiqiang Lao ⋅ David Gu
Tucson Ballroom & Prefunction Space 83
LENVIZ: A High-Resolution Low-Exposure Night Vision Benchmark Dataset Poster Session 2 + Refreshments
Manjushree Aithal ⋅ Rosaura VidalMata ⋅ Manikandtan Kartha ⋅ Gong Chen ⋅ Eashan Adhikarla ⋅ Lucas Kirsten ⋅ Zhicheng Fu ⋅ Nikhil Madhusudhana ⋅ Joseph Nasti
Tucson Ballroom & Prefunction Space 106
DICE: Discrete Inversion Enabling Controllable Editing for Masked Generative Models Poster Session 1
Sen Zhang ⋅ Quan Dao ⋅ Ligong Han ⋅ Song Wen ⋅ Minhao Bai ⋅ Di Liu ⋅ Han Zhang ⋅ Felix Juefei-Xu ⋅ Chaowei Tan ⋅ Bo Liu ⋅ Martin Min ⋅ Kang Li ⋅ Faez Ahmed ⋅ Akash Srivastava ⋅ Hongdong Li ⋅ Junzhou Huang ⋅ Dimitri Metaxas
Tucson Ballroom & Prefunction Space 73
High-Level Semantics and Low-Level Features Fusion for Multi-Scale Object Detection in Dynamic Construction Environments Poster Session 5
Bonyani ⋅ Maryam Soleymani ⋅ Chao Wang
Tucson Ballroom & Prefunction Space 70
F-INR: Functional Tensor Decomposition for Implicit Neural Representations Poster Session 5
Sai Karthikeya Vemuri ⋅ Tim Büchner ⋅ Joachim Denzler
Tucson Ballroom & Prefunction Space 73
Meta-YOLO: Metadata-Guided Real-Time Object Detector in Aerial Imagery Poster Session 6 + Refreshments
Deukryeol Yoon ⋅ Seonghak KIM ⋅ Young Hwa Sung ⋅ Jinho Jung
Tucson Ballroom & Prefunction Space 74
Understanding Human-Like Biases in VLMs via Subjective Face Analytics Poster Session 1
Chaitanya Roygaga ⋅ Aparna Bharati
Tucson Ballroom & Prefunction Space 50
Integrating Multi-scale and Multi-filtration Topological Features for Medical Image Classification Poster Session 6 + Refreshments
Pengfei Gu ⋅ Huimin Li ⋅ Haoteng Tang ⋅ Dongkuan Xu ⋅ Erik Enriquez ⋅ Dongchul Kim ⋅ Bin Fu ⋅ Danny Chen
Tucson Ballroom & Prefunction Space 138
Decoupling Shape and Texture in SAM-2 via Controlled Texture Replacement Poster Session 3
Inbal Cohen ⋅ Boaz Meivar ⋅ Peihan Tu ⋅ Shai Avidan ⋅ Gal Oren
Tucson Ballroom & Prefunction Space 111
PEaRL: Pathway-Enhanced Representation Learning for Gene and Pathway Expression Prediction from Histology Poster Session 6 + Refreshments
Sejuti Majumder ⋅ Saarthak Kapse ⋅ Moinak Bhattacharya ⋅ Xuan Xu ⋅ Alisa Yurovsky ⋅ Prateek Prasanna
Tucson Ballroom & Prefunction Space 81
VectorSynth: Fine-Grained Satellite Image Synthesis with Structured Semantics Poster Session 5
Daniel Cher ⋅ Brian Wei ⋅ Srikumar Sastry ⋅ Nathan Jacobs
Tucson Ballroom & Prefunction Space 116
Feature Inversion as a Lens on Vision Encoders Poster Session 3
Eduard Allakhverdov ⋅ Dmitrii Tarasov ⋅ Elizaveta Goncharova ⋅ Andrei Kuznetsov
Tucson Ballroom & Prefunction Space 65
SAIL: Self-supervised Learning of Lighting-Invariant Representations from Real Images with Latent Diffusion Poster Session 3
Hala Djeghim ⋅ Céline Loscos ⋅ Désiré Sidibé
Tucson Ballroom & Prefunction Space 29
Stroke Modeling Enables Vectorized Character Generation with Large Vectorized Glyph Model Poster Session 3
Xinyue Zhang ⋅ Haolong Li ⋅ Jiawei Ma ⋅ Chen Ye
Tucson Ballroom & Prefunction Space 46
CaRS: A Causal Intervention Segmentation Framework and Benchmark Dataset for Autonomous Driving under Transitional Weather Conditions Poster Session 3
Madhavi Kondapally ⋅ Naveen Kumar K ⋅ C Mohan ⋅ Sobhan Babu
Tucson Ballroom & Prefunction Space 108
DirectDrag: High-Fidelity, Mask-Free, Prompt-Free Drag-based Image Editing via Readout-Guided Feature Alignment Poster Session 6 + Refreshments
Sheng-Hao Liao ⋅ Shang-Fu Chen ⋅ Tai-Ming Huang ⋅ Wen-Huang Cheng ⋅ Kailung Hua
Tucson Ballroom & Prefunction Space 99
DMS2F-HAD: A Dual-branch Mamba-based Spatial–Spectral Fusion Network for Hyperspectral Anomaly Detection Poster Session 4 + Reception
Aayushma Pant ⋅ Lakpa Tamang ⋅ Tsz-Kwan Lee ⋅ Sunil Aryal
Tucson Ballroom & Prefunction Space 128
MANTA: Physics-Informed Generalized Underwater Object Tracking Poster Session 3
Suhas Srinath ⋅ Hemang Jamadagni ⋅ Aditya Chandrasekar ⋅ Prathosh AP
Tucson Ballroom & Prefunction Space 53
A Fast, Simple, and Flexible Scale Informative Feature Transform Module for Arbitrary Scale Image Super-Resolution Poster Session 1
Aupendu Kar ⋅ Prabir Biswas
Tucson Ballroom & Prefunction Space 135
DCText: Scheduled Attention Masking for Visual Text Generation via Divide-and-Conquer Strategy Poster Session 4 + Reception
Jaewoo Song ⋅ Jooyoung Choi ⋅ Kanghyun Baek ⋅ Sangyub Lee ⋅ Daemin Park ⋅ Sungroh Yoon
Tucson Ballroom & Prefunction Space 2
Visual Detector Compression via Location-Aware Discriminant Analysis Poster Session 3
Qizhen Lan ⋅ Jung Choi Choi ⋅ Qing Tian
Tucson Ballroom & Prefunction Space 60
ImageNet-sES: A First Systematic Study of Sensor–Environment Simulation Anchored by Real Recaptures Poster Session 1
Ji-yoon Kim ⋅ Eunsu Baek ⋅ Hyung-Sin Kim
Tucson Ballroom & Prefunction Space 107
Cross-Modal Event Encoder: Bridging Image–Text Knowledge to Event Streams Poster Session 3
SungHeon Jeong ⋅ Hanning Chen ⋅ Sanggeon Yun ⋅ Suhyeon Cho ⋅ Wenjun Huang ⋅ Xiangjian Liu ⋅ Mohsen Imani
Tucson Ballroom & Prefunction Space 28
Exploring Automated Recognition of Instructional Activity and Discourse from Multimodal Classroom Data Poster Session 5
Ivo Bueno ⋅ Ruikun Hou ⋅ Babette Bühler ⋅ Tim Fütterer ⋅ James Drimalla ⋅ Jonathan Foster ⋅ Peter Youngs ⋅ Peter Gerjets ⋅ Ulrich Trautwein ⋅ Enkelejda Kasneci
Tucson Ballroom & Prefunction Space 96
WSSSP-Net: Weakly Supervised Semantic Segmentation Plugin Network for Face Anti-Spoofing Poster Session 4 + Reception
Krzysztof Galus ⋅ Piotr Syga ⋅ Piotr Kawa
Tucson Ballroom & Prefunction Space 92
NAPP: Noise-Adaptive Prototype Perturbation for Few-Shot Learning Poster Session 6 + Refreshments
Il Kim ⋅ Sang Yun ⋅ Dongheon Lee ⋅ Seong Kim Kim ⋅ Joonki Paik
Tucson Ballroom & Prefunction Space 77
Being Positive about Negative Queries: Exclusion Aware Multimodal Retrieval using Disentangled Representations Poster Session 6 + Refreshments
Prachi Jha ⋅ Sumit Bhatia ⋅ Srikanta Bedathur
Tucson Ballroom & Prefunction Space 60
PredMapNet: Future and Historical Reasoning for Consistent Online HD Vectorized Map Construction Poster Session 4 + Reception
Bo Lang ⋅ Nirav Savaliya ⋅ Zhihao Zheng ⋅ Jinglun Feng ⋅ Zheng-Hang Yeh ⋅ Mooi Choo Chuah
Tucson Ballroom & Prefunction Space 114
Inpainting of Sparse Depth Maps from Monocular Depth-from-Focus on Pixel Processor Arrays Poster Session 4 + Reception
Maciej Lewandowski ⋅ Piotr Dudek
Tucson Ballroom & Prefunction Space 127
Shift-Equivariant Complex-Valued Convolutional Neural Networks Poster Session 2 + Refreshments
Quentin Gabot ⋅ Teck-Yian Lim ⋅ Jeremy Fix ⋅ Joana Frontera-Pons ⋅ Chengfang Ren ⋅ Jean-Philippe Ovarlez
Tucson Ballroom & Prefunction Space 110
Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis Poster Session 5
Thang-Anh-Quan Nguyen ⋅ Laurent Caraffa ⋅ Jean-Philippe Tarel ⋅ Roland Brémond
Tucson Ballroom & Prefunction Space 54
ExDDV: A New Dataset for Explainable Deepfake Detection in Video Poster Session 3
Vlad Hondru ⋅ Eduard Hogea ⋅ Darian Onchis ⋅ Radu Ionescu
Tucson Ballroom & Prefunction Space 130
SCORE: Soft Label Compression-Centric Dataset Condensation via Coding Rate Optimization Poster Session 2 + Refreshments
Bowen Yuan ⋅ Yuxia Fu ⋅ Zijian Wang ⋅ Yadan Luo ⋅ Zi Huang
Tucson Ballroom & Prefunction Space 75
Direct Visual Grounding by Directing Attention of Visual Tokens Poster Session 4 + Reception
Parsa Esmaeilkhani ⋅ Longin Jan Latecki
Tucson Ballroom & Prefunction Space 144
MDUNet: Multimodal Decoding UNet for Passive Occluder-Aided Non-line-of-sight 3D Imaging Poster Session 1
Fadlullah Raji ⋅ John Murray-Bruce
Tucson Ballroom & Prefunction Space 45
One Model, Many Behaviors: Training-Induced Effects on Out-of-Distribution Detection Poster Session 3
Gerhard Krumpl ⋅ Henning Avenhaus ⋅ Horst Possegger
Tucson Ballroom & Prefunction Space 116
Imitating the Functionality of Image-to-Image Models Using a Single Example Poster Session 2 + Refreshments
Nurit Spingarn ⋅ Tomer Michaeli
Tucson Ballroom & Prefunction Space 73
NavMapFusion: Diffusion-based Fusion of Navigation Maps for Online Vectorized HD Map Construction Poster Session 6 + Refreshments
Thomas Monninger ⋅ Zihan Zhang ⋅ Steffen Staab ⋅ Sihao Ding
Tucson Ballroom & Prefunction Space 71
RobustFormer: Noise-Robust Pre-training for Images and Videos Poster Session 2 + Refreshments
Ashish Bastola ⋅ Nishant Luitel ⋅ Hao Wang ⋅ Danda Pani Paudel ⋅ Roshni Poudel ⋅ Abolfazl Razi
Tucson Ballroom & Prefunction Space 83
Rethinking Real Image Editing: Unleashing Diverse Editing Operators via Multi-Objective Optimization Poster Session 3
Siyuan Wang ⋅ Xi Yang ⋅ Zihao Zhou ⋅ Huiru Shao ⋅ Rui Zhang ⋅ Qiufeng Wang ⋅ Guangliang Cheng ⋅ Kaizhu Huang
Tucson Ballroom & Prefunction Space 118
SpecGen: Neural Spectral BRDF Generation via Spectral-Spatial Tri-plane Aggregation Poster Session 6 + Refreshments
Jin Zhenyu ⋅ Wenjie Li ⋅ Zhanyu Ma ⋅ Heng Guo
Tucson Ballroom & Prefunction Space 106
Surgical Gaussian Surfels: Highly Accurate Real-time Surgical Scene Rendering using Gaussian Surfels Poster Session 4 + Reception
Idris Sunmola ⋅ Zhenjun Zhao ⋅ Samuel Schmidgall ⋅ Yumeng Wang ⋅ Paul Maria Scheikl ⋅ Viet Pham ⋅ Axel Krieger
Tucson Ballroom & Prefunction Space 22
SCATR: Mitigating New Instance Suppression in LiDAR-based Tracking-by-Attention via Second Chance Assignment and Track Query Dropout Poster Session 3
Brian Cheong ⋅ Letian Wang ⋅ Sandro Papais ⋅ Steven Waslander
Tucson Ballroom & Prefunction Space 39
VFace: A Training-Free Approach for Diffusion-Based Video Face Swapping Poster Session 4 + Reception
Sanoojan Baliah ⋅ Yohan Abeysinghe ⋅ Rusiru Thushara ⋅ Khan Muhammad ⋅ Abhinav Dhall ⋅ Karthik Nandakumar ⋅ Muhammad Haris Khan
Tucson Ballroom & Prefunction Space 3
SegMango: Early Deep Mango Yield Prediction based on Flower Segmentation and Weather Data Poster Session 4 + Reception
Janaksinh Ven ⋅ Charu Sharma ⋅ Azeemuddin Syed
Tucson Ballroom & Prefunction Space 67
Diagnose Like A REAL Pathologist: An Uncertainty-Focused Approach for Trustworthy Multi-Resolution Multiple Instance Learning Poster Session 5
Sungrae Hong ⋅ Sol Lee ⋅ Jisu Shin ⋅ Jiwon Jeong ⋅ Mun Yi
Tucson Ballroom & Prefunction Space 32
Isolating the Role of Temporal Information in Video Saliency: A Controlled Experimental Analysis Poster Session 5
Peter El-Jiz ⋅ Matthias Kuemmerer ⋅ Matthias Tangemann ⋅ Matthias Bethge ⋅ Andreas Bartels ⋅ Michael Bannert
Tucson Ballroom & Prefunction Space 11
Safe Vision-Language Models via Unsafe Weights Manipulation Poster Session 4 + Reception
Moreno D'Incà ⋅ Elia Peruzzo ⋅ Xingqian Xu ⋅ Humphrey Shi ⋅ Nicu Sebe ⋅ Massimiliano Mancini
Tucson Ballroom & Prefunction Space 38
Structure-Aware Feature Rectification with Region Adjacency Graphs for Training-free Open-Vocabulary Semantic Segmentation Poster Session 3
Qiming Huang ⋅ Hao Ai ⋅ Jianbo Jiao
Tucson Ballroom & Prefunction Space 115
DCSHARP: 3D Gaussian Splatting with Direction Cosine Spherical Harmonics and Shape-Aware Pruning Poster Session 3
Ahmed Hasssan ⋅ Jian Meng ⋅ Yuanbo Xiangli ⋅ Jae-sun Seo
Tucson Ballroom & Prefunction Space 68
PSA-MIL: A Probabilistic Spatial Attention-Based Multiple Instance Learning for Whole Slide Image Classification Poster Session 1
Sharon Peled ⋅ Yosef Maruvka ⋅ Moti Freiman
Tucson Ballroom & Prefunction Space 116
Unsupervised Segmentation by Diffusing, Walking and Cutting Poster Session 4 + Reception
Daniela Ivanova ⋅ Marco Aversa ⋅ Paul Henderson ⋅ John Williamson
Tucson Ballroom & Prefunction Space 79
GAITGen: Disentangled Motion-Pathology Impaired Gait Generative Model -- Bringing Motion Generation to the Clinical Domain Poster Session 3
Vida Adeli ⋅ Soroush Mehraban ⋅ Majid Mirmehdi ⋅ Alan Whone ⋅ Benjamin Filtjens ⋅ Amirhossein Dadashzadeh ⋅ Alfonso Fasano ⋅ Andrea Iaboni ⋅ Babak Taati
Tucson Ballroom & Prefunction Space 22
milliMamba: Specular-Aware Human Pose Estimation via Dual mmWave Radar with Multi-Frame Mamba Fusion Poster Session 2 + Refreshments
Niraj Prakash Kini ⋅ Shiau-Rung Tsai ⋅ Guan-Hsun Lin ⋅ Wen-Hsiao Peng ⋅ Ching-Wen Ma ⋅ Jenq-Neng Hwang
Tucson Ballroom & Prefunction Space 7
Improving Animal Pose Estimation through Species Similarity Measures and Rigorous Label Definition Poster Session 4 + Reception
Medhashree Parhy ⋅ Shaan Chanchani ⋅ Claire Kim ⋅ Joshua Mansky ⋅ Parth Thakre ⋅ Zian Pan ⋅ Haoyu Chen ⋅ Amy Reibman
Tucson Ballroom & Prefunction Space 132
Comp4D: Compositional 4D Scene Generation Poster Session 3
Hanwen Liang ⋅ Dejia Xu ⋅ Neel Bhatt ⋅ Hezhen Hu ⋅ Hanxue Liang ⋅ Konstantinos Plataniotis
Tucson Ballroom & Prefunction Space 62
Food Image Generation on Multi-Noun Categories Poster Session 4 + Reception
Xinyue Pan ⋅ Yuhao Chen ⋅ Jiangpeng He ⋅ Fengqing Zhu
Tucson Ballroom & Prefunction Space 124
GraspDiffusion: Synthesizing Realistic Whole-body Hand-Object Interaction Poster Session 2 + Refreshments
Patrick Kwon ⋅ Chen Chen ⋅ Hanbyul Joo
Tucson Ballroom & Prefunction Space 93
Mem-MLP: Real-Time 3D Human Motion Generation from Sparse Inputs Poster Session 6 + Refreshments
Sinan Mutlu ⋅ Georgios Fotios Angelis ⋅ Savas Ozkan ⋅ Paul Wisbey ⋅ Anastasios Drosou ⋅ Mete Ozay
Tucson Ballroom & Prefunction Space 108
X-JEPA: A Novel Joint Learning Cross-Modal Predictive Alignment Framework for Remote Sensing Image Retrieval Poster Session 4 + Reception
Shabnam Choudhury ⋅ Yash Salunkhe ⋅ Vaibhav Rajan ⋅ Subhasis Chaudhuri ⋅ Biplab Banerjee
Tucson Ballroom & Prefunction Space 7
SOLAR: Switchable Output Layer for Accuracy and Robustness in Once-for-All Training Poster Session 6 + Refreshments
Shaharyar Ahmed Khan Tareen ⋅ Lei Fan ⋅ Xiaojing Yuan ⋅ Qin Lin ⋅ Bin Hu
Tucson Ballroom & Prefunction Space 66
Advancing Player Identification and Tracking with Global ID Fusion (GIF) Poster Session 6 + Refreshments
Karol Wojtulewicz ⋅ Minxing Liu ⋅ Niklas Carlsson
Tucson Ballroom & Prefunction Space 7
Line Art Colorization with Offset Prior-based Diffusion Model Poster Session 4 + Reception
Xuan Zhu ⋅ Miao Cao ⋅ Fang-Lue Zhang ⋅ Yu-Kun Lai ⋅ Paul Rosin
Tucson Ballroom & Prefunction Space 123
STRinGS: Selective Text Refinement in Gaussian Splatting Poster Session 6 + Refreshments
Abhinav Raundhal ⋅ Gaurav Behera ⋅ P Narayanan ⋅ Ravi Kiran Sarvadevabhatla ⋅ Makarand Tapaswi
Tucson Ballroom & Prefunction Space 130
Remote Sensing Forestry Similarity Convolution Poster Session 6 + Refreshments
Shikuan Wang ⋅ Yuangong Chen ⋅ Jianzhou Gong ⋅ Lingyi Meng ⋅ Mengquan Wu ⋅ Longxing Liu ⋅ Haiwei Yuan ⋅ Guo Mingbin
Tucson Ballroom & Prefunction Space 35
Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting Poster Session 3
Shu Zou ⋅ Xinyu Tian ⋅ Lukas Wesemann ⋅ Fabian Waschkowski ⋅ Zhaoyuan Yang ⋅ Jing Zhang
Tucson Ballroom & Prefunction Space 125
RemEdit: Efficient Diffusion Editing with Riemannian Geometry Poster Session 4 + Reception
Eashan Adhikarla ⋅ Brian Davison
Tucson Ballroom & Prefunction Space 72
AusSmoke meets MultiNatSmoke: a fully-labelled diverse smoke segmentation dataset Poster Session 6 + Refreshments
Weihao Li ⋅ Hongjin Zhao ⋅ Gao Zhu ⋅ Ge-Peng Ji ⋅ Nicholas Wilson ⋅ Marta Yebra ⋅ Nick Barnes
Tucson Ballroom & Prefunction Space 76
Equivariant Sampling for Improving Diffusion Model-based Image Restoration Poster Session 5
Chenxu Wu ⋅ Qingpeng Kong ⋅ Peiang Zhao ⋅ Wendi Yang ⋅ Wenxin ma ⋅ Fenghe Tang ⋅ Zihang Jiang ⋅ S Kevin Zhou
Tucson Ballroom & Prefunction Space 98
FlowEO: Generative Unsupervised Domain Adaptation for Earth Observation Poster Session 3
Georges Le Bellier ⋅ Nicolas Audebert
Tucson Ballroom & Prefunction Space 94
Deepfake Detection that Generalizes Across Benchmarks Poster Session 1
Andrii Yermakov ⋅ Jan Čech ⋅ Jiri Matas ⋅ Mario Fritz
Tucson Ballroom & Prefunction Space 74
HDR Reconstruction Boosting with Training-Free and Exposure-Consistent Diffusion Poster Session 6 + Refreshments
Yo-Tin Lin ⋅ Sykai Chen ⋅ Hou-Ning Hu ⋅ Yen-Yu Lin ⋅ Yu-Lun Liu
Tucson Ballroom & Prefunction Space 30
HiMix : Hierarchical Visual-Textual Mixing Network for Lesion Segmentation Poster Session 4 + Reception
Soojin Hwang ⋅ Jaeyoon Sim ⋅ Won Hwa Kim
Tucson Ballroom & Prefunction Space 100
Visibility guided Self-Supervised Occlusion Resilient Human Pose Estimation Poster Session 1
Arindam Dutta ⋅ Sarosij Bose ⋅ Rohit Kundu ⋅ Calvin-Khang Ta ⋅ Saketh Bachu ⋅ Konstantinos Karydis ⋅ Amit Roy-Chowdhury
Tucson Ballroom & Prefunction Space 101
Exploring the Boundaries of Diffusion Models for Offline Writer Identification with Sparse and Intra-Variable Data Poster Session 5
Aritra Dey ⋅ Chandranath Adak ⋅ Kumari Priya ⋅ Soumi Chattopadhyay ⋅ Sukalpa Chanda
Tucson Ballroom & Prefunction Space 131
A Woman with a Knife or A Knife with a Woman? Measuring Directional Bias Amplification in Image Captions Poster Session 1
Rahul Nair ⋅ Bhanu Tokas ⋅ Hannah Kerner
Tucson Ballroom & Prefunction Space 25
Non‑Contact Blood Pressure Estimation from Face Videos via Physiology‑Aware Contrastive Learning Poster Session 2 + Refreshments
JaeHyuk Son ⋅ Young-Seok Choi
Tucson Ballroom & Prefunction Space 95
DuPLUS: Dual-Prompt Vision-Language Framework for Universal Medical Image Segmentation and Prognosis Poster Session 6 + Refreshments
Numan Saeed ⋅ Tausifa Jan Saleem ⋅ Fadillah Maani ⋅ Muhammad Ridzuan ⋅ Hu Wang ⋅ Mohammad Yaqub
Tucson Ballroom & Prefunction Space 112
UniCoRN: Latent Diffusion-based Unified Controllable Image Restoration Network across Multiple Degradations Poster Session 2 + Refreshments
Debabrata Mandal ⋅ Soumitri Chattopadhyay ⋅ Guansen Tong ⋅ Praneeth Chakravarthula
Tucson Ballroom & Prefunction Space 13
PatchEAD: Unifying Industrial Visual Prompting Frameworks for Patch-Exclusive Anomaly Detection Poster Session 4 + Reception
Po-Han Huang ⋅ Jeng-Lin Li ⋅ Po-Hsuan Huang ⋅ Ming-Ching Chang ⋅ Wei-Chao Chen
Tucson Ballroom & Prefunction Space 119
EndoPBR: Photorealistic Synthetic Data for Surgical 3D Vision via Physically-based Rendering Poster Session 4 + Reception
John Han ⋅ Jie Ying Wu
Tucson Ballroom & Prefunction Space 126
Beyond the Encoder: Joint Encoder-Decoder Contrastive Pre-Training Improves Dense Prediction Poster Session 1
Sébastien Quetin ⋅ Tapotosh Ghosh ⋅ Farhad Maleki
Tucson Ballroom & Prefunction Space 96
Tables Guide Vision: Learning to See the Heart through Tabular Data Poster Session 2 + Refreshments
Marta Hasny ⋅ Maxime Di Folco ⋅ Keno Bressem ⋅ Julia Schnabel
Tucson Ballroom & Prefunction Space 29
Pose-Diverse Multi-View Virtual Try-on from a Single Frontal Image via Diffusion Transformer Poster Session 3
Seonghee Han ⋅ Minchang Chung ⋅ Gyeongsu Cho ⋅ Kyungdon Joo ⋅ Taehwan Kim
Tucson Ballroom & Prefunction Space 37
Dual-Domain Multimodal Hyperbolic Fusion for Cardiopulmonary Disease Diagnosis in Emergency Care Poster Session 6 + Refreshments
Ke Nan ⋅ Maggie Samaan ⋅ Benjamin Burns ⋅ Xia Ning ⋅ Yuchi Han ⋅ Yuan Xue
Tucson Ballroom & Prefunction Space 142
Enabling High-Quality In-the-Wild Imaging from Severely Aberrated Metalens Bursts Poster Session 1
Debabrata Mandal ⋅ Zhihan Peng ⋅ Yujie Wang ⋅ Praneeth Chakravarthula
Tucson Ballroom & Prefunction Space 81
FG-TRACER: Tracing Information Flow in Multimodal Large Language Models in Free-Form Generation Poster Session 6 + Refreshments
Alessia Saporita ⋅ Vittorio Pipoli ⋅ Federico Bolelli ⋅ Lorenzo Baraldi ⋅ Andrea Acquaviva ⋅ ELISA FICARRA
Tucson Ballroom & Prefunction Space 67
ReFineVQA: Iterative Refinement of Video Description via Feedback Generation for Video Question Answering Poster Session 6 + Refreshments
Jeongwan Shin ⋅ Chan Hur ⋅ Seongmin Cho ⋅ Jae-Ho Choi ⋅ Hyeyoung Park
Tucson Ballroom & Prefunction Space 43
From Lightweight CNNs to SpikeNets: Benchmarking Accuracy–Energy Tradeoffs with Pruned Spiking SqueezeNet Poster Session 1
Radib Kabir ⋅ Tawsif Tashwar Dipto ⋅ Mehedi Ahamed ⋅ Sabbir Ahmed ⋅ Md Hasanul Kabir
Tucson Ballroom & Prefunction Space 109
MAFM³: Modular Adaptation of Foundation Models for Multi-Modal Medical AI Poster Session 3
Mohammad Qazi ⋅ Munachiso Nwadike ⋅ Ibrahim Almakky ⋅ Mohammad Yaqub ⋅ Numan Saeed
Tucson Ballroom & Prefunction Space 55
Align Video Diffusion Model with Online Video-Centric Preference Optimization Poster Session 5
Jiacheng Zhang ⋅ Jie Wu ⋅ Weifeng Chen ⋅ Yatai Ji ⋅ Xuefeng Xiao ⋅ Weilin Huang ⋅ Kai Han
Tucson Ballroom & Prefunction Space 33
HABIT: Human Action Benchmark for Interactive Traffic in CARLA Poster Session 5
Mohan Ramesh ⋅ Mark Azer ⋅ Fabian Flohr
Tucson Ballroom & Prefunction Space 128
Explaining the Unseen: Multimodal Vision-Language Reasoning for Situational Awareness in Underground Mining Disasters Poster Session 1
Mizanur Rahman Jewel ⋅ Mohamed Elmahallawy ⋅ Sanjay Madria ⋅ Samuel Frimpong
Tucson Ballroom & Prefunction Space 127
Color Preserving CMOS-SPAD Fusion for Multi-Frame HDR Poster Session 4 + Reception
Aleksi Suonsivu ⋅ Lauri Salmela ⋅ Lassi Helin ⋅ Leevi Uosukainen ⋅ Giacomo Boracchi
Tucson Ballroom & Prefunction Space 78
Sea-CLIP: Mining Semantic-Aware Representations for Few-Shot Anomaly Detection with CLIP Poster Session 3
Xiao Guo ⋅ Zhimin Chen ⋅ Carlos Castillo ⋅ Hongcheng Wang ⋅ Xiaoming Liu
Tucson Ballroom & Prefunction Space 74
Unified Video Anomaly Detection Model for Detecting Different Anomaly Types Poster Session 1
Kijung Lee ⋅ Youngwan Jo ⋅ Sunghyun Ahn ⋅ Sanghyun Park
Tucson Ballroom & Prefunction Space 75
MageBench: Bridging Large Multimodal Models to Agents Poster Session 2 + Refreshments
Miaosen Zhang ⋅ Qi Dai ⋅ Yifan Yang ⋅ Jianmin Bao ⋅ Dongdong Chen ⋅ Kai Qiu ⋅ Chong Luo ⋅ Xin Geng ⋅ Baining Guo
Tucson Ballroom & Prefunction Space 1
DermEVAL: A Dermatologist-Reviewed Benchmark for Multimodal Large Language Models Poster Session 1
Hongjin Zhao ⋅ Weihao Li ⋅ Zhenyue Qin ⋅ Ge-Peng Ji ⋅ Yang Liu ⋅ Tom Gedeon ⋅ Nick Barnes
Tucson Ballroom & Prefunction Space 89
CAMP-VQA: Caption-Embedded Multimodal Perception for No-Reference Quality Assessment of Compressed Video Poster Session 2 + Refreshments
Xinyi Wang ⋅ Angeliki Katsenou ⋅ Junxiao Shen ⋅ David Bull
Tucson Ballroom & Prefunction Space 60
TaxonRL: Reinforcement Learning with Intermediate Rewards for Interpretable Fine-Grained Visual Reasoning Poster Session 2 + Refreshments
Maximilian von Klinski ⋅ Maximilian Schall
Tucson Ballroom & Prefunction Space 102
Patch Your Matcher: Correspondence-Aware Image-to-Image Translation Unlocks Cross-Modal Matching via Single-Modality Priors Poster Session 6 + Refreshments
Anton Frolov ⋅ Volker Rodehorst
Tucson Ballroom & Prefunction Space 68
MarineEval: Assessing the Marine Intelligence of Vision-Language Models Poster Session 2 + Refreshments
Yuk Kwan Wong ⋅ Tuan-An To ⋅ Jipeng Zhang ⋅ Ziqiang Zheng ⋅ Sai-Kit Yeung
Tucson Ballroom & Prefunction Space 5
CLIP-UP: CLIP-Based Unanswerable Problem Detection for Visual Question Answering Poster Session 5
Ben Vardi ⋅ Oron Nir ⋅ Ariel Shamir
Tucson Ballroom & Prefunction Space 10
CaFlow: Enhancing Long-Term Action Quality Assessment with Causal Counterfactual Flow Poster Session 4 + Reception
Ruisheng Han ⋅ Kanglei Zhou ⋅ Shuang Chen ⋅ Amir Atapour-Abarghouei ⋅ Hubert P. H. Shum
Tucson Ballroom & Prefunction Space 145
Layout Anything: One Transformer for Universal Room Layout Estimation Poster Session 2 + Refreshments
Md Sohag Mia ⋅ Muhammad Abdullah Adnan
Tucson Ballroom & Prefunction Space 15
Not Like Transformers: Drop the Beat Representation for Dance Generation with Mamba-Based Diffusion Model Poster Session 2 + Refreshments
Sangjune Park ⋅ Inhyeok Choi ⋅ Donghyeon Soon ⋅ Youngwoo Jeon ⋅ Kyungdon Joo
Tucson Ballroom & Prefunction Space 34
Distribution Highlighted Reference-based Label Distribution Learning for Facial Age Estimation Poster Session 5
Satoshi Suzuki ⋅ Shin'ya Yamaguchi ⋅ Shoichiro Takeda ⋅ Takuhiro Kaneko ⋅ Shota Orihashi ⋅ Ryo Masumura
Tucson Ballroom & Prefunction Space 64
Can Image Splicing and Copy-Move Forgery Be Detected by the Same Model? Forensim: An Attention-Based State-Space Approach Poster Session 5
Soumyaroop Nandi ⋅ Prem Natarajan
Tucson Ballroom & Prefunction Space 38
Rank-based Geographical Regularization: Revisiting Contrastive Self-Supervised Learning for Multispectral Remote Sensing Imagery Poster Session 4 + Reception
Tom Burgert ⋅ Leonard Hackel ⋅ Paolo Rota ⋅ Begüm Demir
Tucson Ballroom & Prefunction Space 9
AortaDiff: A Unified Multitask Diffusion Framework for Contrast-Free AAA Imaging Poster Session 6 + Refreshments
Yuxuan Ou ⋅ NING BI ⋅ Jiazhen Pan ⋅ Jiancheng Yang ⋅ Boliang Yu ⋅ Usama Zidan ⋅ Regent Lee ⋅ Vicente Grau
Tucson Ballroom & Prefunction Space 98
DARB-Splatting: Generalizing Splatting with Decaying Anisotropic Radial Basis Functions Poster Session 4 + Reception
Hashiru Pramuditha ⋅ Vinasirajan Viruthshaan ⋅ Vishagar Arunan ⋅ Saeedha Nazar ⋅ Sameera Ramasinghe ⋅ Simon Lucey ⋅ Ranga Rodrigo
Tucson Ballroom & Prefunction Space 21
Hierarchical Adaptive networks with Task vectors for Test-Time Adaptation Poster Session 4 + Reception
Sameer Ambekar ⋅ Marta Hasny ⋅ Laura Daza ⋅ Daniel Lang ⋅ Julia Schnabel
Tucson Ballroom & Prefunction Space 36
GFT: Graph Feature Tuning for Efficient Point Cloud Analysis Poster Session 6 + Refreshments
Manish Dhakal ⋅ Venkat Dasari ⋅ Rajshekhar Sunderraman ⋅ Yi Ding
Tucson Ballroom & Prefunction Space 72
IPCD: Intrinsic Point-Cloud Decomposition Poster Session 5
Shogo Sato ⋅ Takuhiro Kaneko ⋅ Shoichiro Takeda ⋅ Tomoyasu Shimada ⋅ Kazuhiko Murasaki ⋅ Taiga Yoshida ⋅ Ryuichi Tanida ⋅ Akisato Kimura
Tucson Ballroom & Prefunction Space 123
See, Record, Do: Automated Generation of UI Workflows from Tutorial Videos Poster Session 5
Adam Beauchaine ⋅ Craig Shue
Tucson Ballroom & Prefunction Space 44
Empowering Source-Free Domain Adaptation via MLLM-Guided Reliability-Based Curriculum Learning Poster Session 3
Dongjie Chen ⋅ Kartik Patwari ⋅ Zhengfeng Lai ⋅ Xiaoguang Zhu ⋅ Sen-ching Cheung ⋅ Chen-Nee Chuah
Tucson Ballroom & Prefunction Space 129
QUOTA: Quantifying Objects with Text-to-Image Models for Any Domain Poster Session 5
Wenfang Sun ⋅ Yingjun Du ⋅ Gaowen Liu ⋅ Yefeng Zheng ⋅ Cees Snoek
Tucson Ballroom & Prefunction Space 56
Extreme Amodal Face Detection Poster Session 3
Changlin Song ⋅ Yunzhong Hou ⋅ Michael Barnes ⋅ Rahul Shome ⋅ Dylan Campbell
Tucson Ballroom & Prefunction Space 2
Contrastive Integrated Gradients: A Feature Attribution-Based Method for Explaining Whole Slide Image Classification Poster Session 1
Anh Vu ⋅ Tuan Vo ⋅ Ngoc Bui ⋅ Nam Le ⋅ AKASH AWASTHI ⋅ Huy Vo ⋅ Thanh-Huy Nguyen ⋅ Zhu Han ⋅ Chandra Mohan ⋅ Hien Nguyen
Tucson Ballroom & Prefunction Space 115
MEGA-PCC: A Mamba-based Efficient Approach for Joint Geometry and Attribute Point Cloud Compression Poster Session 2 + Refreshments
Kai-Hsiang Hsieh ⋅ Monyneath Yim ⋅ Wen-Hsiao Peng ⋅ Jui-Chiu Chiang
Tucson Ballroom & Prefunction Space 39
CORA: Consistency-Guided Semi-Supervised Framework for Reasoning Segmentation Poster Session 5
Prantik Howlader ⋅ Hoang Nguyen-Canh ⋅ Srijan Das ⋅ Jingyi Xu ⋅ Hieu Le ⋅ Dimitris Samaras
Tucson Ballroom & Prefunction Space 13
DODA: Adapting Object Detectors to Dynamic Agricultural Environments in Real-Time with Diffusion Poster Session 4 + Reception
Shuai Xiang ⋅ Pieter Blok ⋅ James Burridge ⋅ Haozhou Wang ⋅ Wei Guo
Tucson Ballroom & Prefunction Space 49
Training-free Detection of Text-to-video Generations via Over-coherence Poster Session 3
Jonathan Brokman ⋅ Oren Rachmil ⋅ Omer Hofman ⋅ Roy Betser ⋅ Amit Giloni ⋅ Roman Vainshtein ⋅ Hisashi Kojima
Tucson Ballroom & Prefunction Space 103
MM-TS: Multi-Modal Temperature and Margin Schedules for Contrastive Learning with Long-Tail Data Poster Session 6 + Refreshments
Siarhei Sheludzko ⋅ Dhimitrios Duka ⋅ Bernt Schiele ⋅ Hilde Kühne ⋅ Anna Kukleva
Tucson Ballroom & Prefunction Space 17
AFL-PRF: Adaptive Federated Learning for Low-Quality Data: Enhancing Performance, Robustness, and Fairness Poster Session 1
Pinrui Yu ⋅ Yiming Xie ⋅ Longtian Ye ⋅ Geng Yuan ⋅ Ningfang Mi ⋅ Xue Lin
Tucson Ballroom & Prefunction Space 39
Harnessing Object Grounding for Time-Sensitive Video Understanding Poster Session 2 + Refreshments
Tz-Ying Wu ⋅ Sharath Nittur Sridhar ⋅ Subarna Tripathi
Tucson Ballroom & Prefunction Space 101
Are All Marine Species Created Equal? Performance Disparities in Underwater Object Detection Poster Session 4 + Reception
Melanie Wille ⋅ Tobias Fischer ⋅ Scarlett Raine
Tucson Ballroom & Prefunction Space 26
ViGG: Robust RGB-D Point Cloud Registration using Visual-Geometric Mutual Guidance Poster Session 1
Congjia Chen ⋅ Shen Yan ⋅ Yufu Qu
Tucson Ballroom & Prefunction Space 78
SCORP: Scene-Consistent Object Refinement via Proxy Generation and Tuning Poster Session 1
Ziwei Chen ⋅ Ziling Liu ⋅ Zitong Huang ⋅ Mingqi Gao ⋅ Feng Zheng
Tucson Ballroom & Prefunction Space 76
How I Met Your Bias: Investigating Bias Amplification in Diffusion Models Poster Session 4 + Reception
Nathan Roos ⋅ Ekaterina Iakovleva ⋅ Ani Gjergji ⋅ Vito Paolo Pastore ⋅ Enzo Tartaglione
Tucson Ballroom & Prefunction Space 104
PhysEduVideo: A Benchmark for Evaluating Text-to-Video Models for Physics Education Poster Session 6 + Refreshments
Megha Mariam K M ⋅ Aditya Arun ⋅ Zakaria Laskar ⋅ Jawahar CV
Tucson Ballroom & Prefunction Space 141
DreamCatcher: Efficient Multi-Concept Customization via Representation Finetuning Poster Session 5
Jungwon Lee ⋅ Changhun Lee ⋅ Eunhyeok Park
Tucson Ballroom & Prefunction Space 120
Self-Supervised Visual Prompting for Cross-Domain Road Damage Detection Poster Session 3
Xi Xiao ⋅ Zhuxuanzi Wang ⋅ Mingqiao Mo ⋅ Chen Liu ⋅ Chenrui Ma ⋅ Yanshu Li ⋅ Smita Krishnaswamy ⋅ Xiao Wang ⋅ Tianyang Wang
Tucson Ballroom & Prefunction Space 57
HumanBench: Two Heads, No Legs, But Mostly Human, the State of Generative Capabilities in T2I Models Poster Session 3
Anubhooti Jain ⋅ Mayank Vatsa ⋅ Richa Singh
Tucson Ballroom & Prefunction Space 112
Boosting Unsupervised Video Instance Segmentation with Automatic Quality-Guided Self-Training Poster Session 6 + Refreshments
Kaixuan Lu ⋅ Mehmet Onurcan Kaya ⋅ Dim Papadopoulos
Tucson Ballroom & Prefunction Space 18
Where is the Watermark? Interpretable Watermark Detection at the Block Level Poster Session 6 + Refreshments
Maria Bulychev ⋅ Neil Grant Marchant ⋅ Benjamin Rubinstein
Tucson Ballroom & Prefunction Space 21
From Detection to Anticipation: Online Understanding of Struggles across Various Tasks and Activities Poster Session 3
Shijia Feng ⋅ Michael Wray ⋅ Walterio Mayol-Cuevas
Tucson Ballroom & Prefunction Space 107
Memoire: Learning User Personas from Gallery Tags for Personalized Photo Curation Poster Session 5
Praful Mathur ⋅ Mohsin Iftekhar ⋅ Aman Sharma ⋅ Sarvesh Tiwari ⋅ Meghali Deka ⋅ Sathish Cherukuri ⋅ Roopa Sheshadri ⋅ Rakesh Valusa
Tucson Ballroom & Prefunction Space 102
Zero-Shot Video Deraining with Video Diffusion Models Poster Session 1
Tuomas Varanka ⋅ Juan Bello Bello ⋅ Hyeongwoo Kim ⋅ Pablo Garrido ⋅ Xu YAO
Tucson Ballroom & Prefunction Space 65
RoadBench: A Vision-Language Foundation Model and Benchmark for Road Damage Understanding Poster Session 5
Xi Xiao ⋅ Yunbei Zhang ⋅ Janet Wang ⋅ Lin Zhao ⋅ YUXIANG WEI ⋅ Hengjia Li ⋅ Yanshu Li ⋅ Xiao Wang ⋅ Swalpa Roy ⋅ Hao Xu ⋅ Tianyang Wang
Tucson Ballroom & Prefunction Space 21
Ego-EXTRA: video-language Egocentric Dataset for EXpert-TRAinee assistance Poster Session 4 + Reception
Francesco Ragusa ⋅ Michele Mazzamuto ⋅ Rosario Forte ⋅ Irene D'Ambra ⋅ James Fort ⋅ Jakob Engel ⋅ Antonino Furnari ⋅ Giovanni Farinella
Tucson Ballroom & Prefunction Space 15
BREEN: Bridge Data-Efficient Encoder-Free Multimodal Learning with Learnable Queries Poster Session 4 + Reception
Tianle Li ⋅ Yongming Rao ⋅ Winston Hu ⋅ Yu Cheng
Tucson Ballroom & Prefunction Space 105
GAEA: A Geolocation Aware Conversational Assistant Poster Session 4 + Reception
Ron Campos ⋅ Ashmal Vayani ⋅ Parth Parag Kulkarni ⋅ Rohit Gupta ⋅ Aizan Zafar ⋅ Aritra Dutta ⋅ Mubarak Shah
Tucson Ballroom & Prefunction Space 91
Leveraging Sparsity for Privacy in Collaborative Inference Poster Session 6 + Refreshments
Maximilian Hoefler ⋅ Karsten Mueller ⋅ Wojciech Samek
Tucson Ballroom & Prefunction Space 38
Optimizing LVLMs with On-Policy Data for Effective Hallucination Mitigation Poster Session 4 + Reception
Chengzhi Yu ⋅ Yifan Xu ⋅ Yifan Chen ⋅ Wenyi Zhang
Tucson Ballroom & Prefunction Space 43
Eye-for-an-eye: Appearance Transfer with Dense Semantic Correspondence in Diffusion Models Poster Session 4 + Reception
Sooyeon Go ⋅ Kyungmook Choi ⋅ Minjung Shin ⋅ Youngjung Uh
Tucson Ballroom & Prefunction Space 34
Diffusion-Based Action Recognition Generalizes to Untrained Domains Poster Session 5
Rogério Guimarães ⋅ Frank Xiao ⋅ Pietro Perona ⋅ Markus Marks
Tucson Ballroom & Prefunction Space 12
Multimodal Medical Image Binding via Shared Text Embeddings Poster Session 2 + Refreshments
Yunhao Liu ⋅ Suyang Xi ⋅ Shiqi Liu ⋅ Hong Ding ⋅ Chicheng Jin ⋅ Zhong Chong ⋅ Junjun He ⋅ Catherine Liu ⋅ Yiqing Shen
Tucson Ballroom & Prefunction Space 19
ATM: Enhanced Alignment for Text-to-Motion Generation Poster Session 5
Ke Han ⋅ Yueming Lyu ⋅ Weichen Yu ⋅ Nicu Sebe
Tucson Ballroom & Prefunction Space 101
You May Speak Freely: Improving the Fine-Grained Visual Recognition Capabilities of Multimodal Large Language Models with Answer Extraction Poster Session 2 + Refreshments
Logan Lawrence ⋅ Oindrila Saha ⋅ Megan Wei ⋅ Chen Sun ⋅ Subhransu Maji ⋅ Grant Horn
Tucson Ballroom & Prefunction Space 2
Intraoperative 2D/3D Registration via Spherical Similarity Learning and Differentiable Levenberg-Marquardt Optimization Poster Session 6 + Refreshments
Minheng Chen ⋅ Youyong Kong
Tucson Ballroom & Prefunction Space 4
GRAPE (Gaussian Rendering for Accelerated Pixel Enhancement) Brings Fast and Lightweight Arbitrary Super-Resolution Poster Session 6 + Refreshments
Jung In Jang ⋅ Kyong Hwan Jin
Tucson Ballroom & Prefunction Space 52
Face-LLaVA: Facial Expression and Attribute Understanding through Instruction Tuning Poster Session 2 + Refreshments
Ashutosh Chaubey ⋅ Xulang Guan ⋅ Mohammad Soleymani
Tucson Ballroom & Prefunction Space 117
Revisiting Retentive Networks for Fast Range-View 3D LiDAR Semantic Segmentation Poster Session 2 + Refreshments
Simone Mosco ⋅ Daniel Fusaro ⋅ Wanmeng Li ⋅ Alberto Pretto
Tucson Ballroom & Prefunction Space 103
Diffusion-Based Authentication of Copy Detection Patterns: A Multimodal Framework with Printer Signature Conditioning Poster Session 2 + Refreshments
Bolutife Atoki ⋅ Iuliia Tkachenko ⋅ Bertrand Kerautret ⋅ Carlos Crispim-Junior Crispim-Junior
Tucson Ballroom & Prefunction Space 26
Pyramidal Spectrum: Frequency-based Hierarchically Vector Quantized VAE for Videos Poster Session 2 + Refreshments
Tushar Prakash ⋅ Onkar Susladkar ⋅ Inderjit Dhillon ⋅ Sparsh Mittal
Tucson Ballroom & Prefunction Space 63
Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising Poster Session 6 + Refreshments
Yan-Bo Lin ⋅ Kevin Lin ⋅ Zhengyuan Yang ⋅ Linjie Li ⋅ Jianfeng Wang ⋅ Chung-Ching Lin ⋅ Xiaofei Wang ⋅ Gedas Bertasius ⋅ Lijuan Wang
Tucson Ballroom & Prefunction Space 14
FedSCAl: Leveraging Server and Client Alignment for Unsupervised Federated Source-Free Domain Adaptation Poster Session 3
M Yashwanth ⋅ Sampath Koti ⋅ Arunabh Singh ⋅ Shyam Marjit ⋅ Anirban Chakraborty
Tucson Ballroom & Prefunction Space 114
Human Pose Aggregation for Multi-View Temporal Video Alignment Poster Session 1
Fabien Delattre ⋅ Tsung-Wei Huang ⋅ Guan-Ming Su ⋅ Erik Learned-Miller
Tucson Ballroom & Prefunction Space 61
MEDAL: multi-modal MEta-space Distillation and ALignment for Visual Compatibility Learning Poster Session 1
Dween Sanny ⋅ Vinay Verma ⋅ Prateek Sircar ⋅ Deepak Gupta
Tucson Ballroom & Prefunction Space 85
FlowCLAS: Enhancing Normalizing Flow-Based Anomaly Segmentation Via Contrastive Learning Poster Session 5
Chang Won (John) Lee ⋅ Selina Leveugle ⋅ Paul Grouchy ⋅ Chris Langley ⋅ Svetlana Stolpner ⋅ Jonathan Kelly ⋅ Steven Waslander
Tucson Ballroom & Prefunction Space 114
Multimodal Graph Representation Learning over Arbitrary Sets of Modalities Poster Session 5
Santosh Patapati ⋅ Trisanth Srinivasan
Tucson Ballroom & Prefunction Space 124
RapidMV: Leveraging Spatio-Angular Latent Space for Efficient and Consistent Text-to-Multi-View Synthesis Poster Session 2 + Refreshments
Seungwook Kim ⋅ Yichun Shi ⋅ Kejie Li ⋅ Minsu Cho ⋅ Peng Wang
Tucson Ballroom & Prefunction Space 25
PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models Poster Session 1
Zilu Guo ⋅ Hongbin Lin ⋅ Zhihao Yuan ⋅ Chaoda Zheng ⋅ Pengshuo Qiu ⋅ Dongzhi Jiang ⋅ Renrui Zhang ⋅ Chun-Mei Feng ⋅ Zhen Li
Tucson Ballroom & Prefunction Space 122
DreamAnywhere: Object-Centric Panoramic 3D Scene Generation Poster Session 1
Edoardo Dominici ⋅ Jozef Hladký ⋅ Floor Verhoeven ⋅ Lukas Radl ⋅ Thomas Deixelberger ⋅ Stefan Ainetter ⋅ Philipp Drescher ⋅ Stefan Hauswiesner ⋅ Arno Coomans ⋅ Giacomo Nazzaro ⋅ Konstantinos Vardis ⋅ Markus Steinberger
Tucson Ballroom & Prefunction Space 1
mEOL: Training-Free Instruction-Guided Multimodal Embedder for Vector Graphics and Image Retrieval Poster Session 1
Kyeongseon Kim ⋅ Baek Seong-Eun ⋅ Lee Jung-Mok ⋅ Tae-Hyun Oh
Tucson Ballroom & Prefunction Space 114
TS-PCI: Point Cloud Frame Interpolation with Time-Aware Point Cloud Sampling and Self-Supervised Learning Strategy Poster Session 1
Kohei Matsuzaki ⋅ Keisuke Nonaka
Tucson Ballroom & Prefunction Space 6
Referring Change Detection in Remote Sensing Imagery Poster Session 1
Yilmaz Korkmaz ⋅ Jay Paranjape ⋅ Celso de Melo ⋅ Vishal Patel
Tucson Ballroom & Prefunction Space 11
GenHSI: Controllable Generation of Human-Scene Interaction Videos Poster Session 1
Zekun Li ⋅ Rui Zhou ⋅ Rahul Sajnani ⋅ Xiaoyan Cong ⋅ Daniel Ritchie ⋅ Srinath Sridhar
Tucson Ballroom & Prefunction Space 14
SAVeD: Learning to Denoise Low-SNR Video for Improved Downstream Performance Poster Session 5
Suzanne Stathatos ⋅ Michael Hobley ⋅ Pietro Perona ⋅ Markus Marks
Tucson Ballroom & Prefunction Space 100
Forget Less by Learning Together through Concept Consolidation Poster Session 1
Arjun Kaushik Kaushik ⋅ Naresh Kumar Devulapally ⋅ Vishnu Lokhande ⋅ Nalini Ratha ⋅ Venu Govindaraju
Tucson Ballroom & Prefunction Space 26
Training-free Multi-view 4D Human Motion Reconstruction Virtual Reality System Poster Session 1
Yijie Li ⋅ Ce Zheng ⋅ Yijie He ⋅ Joel Julin ⋅ Ryosuke Ichikari ⋅ Satoki Ogiso ⋅ Satoshi Nakae ⋅ Akihiro Sato ⋅ Takeshi Kurata ⋅ Laszlo Jeni
Tucson Ballroom & Prefunction Space 31
Cluster-Guided Adversarial Perturbations for Robust Contrastive Learning Poster Session 1
Seongyun Seo ⋅ Sungmin Han ⋅ Jeonghyun Lee ⋅ Sangkyun Lee
Tucson Ballroom & Prefunction Space 34
Eff-GRot: Efficient and Generalizable Rotation Estimation with Transformers Poster Session 1
Fanis Mathioulakis ⋅ Gorjan Radevski ⋅ Tinne Tuytelaars
Tucson Ballroom & Prefunction Space 40
Interleaved Vision-and-Language Generation via Generative Voken Poster Session 1
Kaizhi Zheng ⋅ Xuehai He ⋅ Xin Wang
Tucson Ballroom & Prefunction Space 46
CraftSVG: Multi-Object Text-to-SVG Synthesis via Layout Guided Diffusion Poster Session 2 + Refreshments
Ayan Banerjee ⋅ Nityanand Mathur ⋅ Josep Llados ⋅ Umapada Pal ⋅ Anjan Dutta
Tucson Ballroom & Prefunction Space 109
Network-agnostic distortion-robust projections for wide-angle image understanding Poster Session 1
Akshaya Athwale ⋅ Ola Ahmad ⋅ Jean-Francois Lalonde
Tucson Ballroom & Prefunction Space 57
PS3: Part level instance segmentation in 3D Poster Session 1
HONG-XUAN YEN ⋅ Chiamin Chen ⋅ Yanqing Wang ⋅ Yu-Lun Liu ⋅ Min Sun
Tucson Ballroom & Prefunction Space 86
Root Completion from Intraoral Scans of Tooth Crowns using Diffusion with Patch Perturbation Poster Session 1
Yohan Jang ⋅ In-Seok Song ⋅ Seung Baek
Tucson Ballroom & Prefunction Space 47
ZonUI-3B: Competitive GUI Grounding with a 3B VLM Trained on a Single Consumer GPU Poster Session 1
ZongHan Hsieh ⋅ SHENGJING YANG ⋅ TZER-JEN WEI
Tucson Ballroom & Prefunction Space 92
HyperPose: Hyper-pose Embeddings for 3D-Aware Generative Models with Self-Supervised Disentangling of Pose and Scene Poster Session 1
Mijeong Kim ⋅ Namgi Kim ⋅ Bohyung Han
Tucson Ballroom & Prefunction Space 97
Diverse Sketch Colorization with Content-Enhanced Style Representation and Recolorization Distillation Poster Session 1
Shuangming Mao ⋅ HaiXiang Zhu
Tucson Ballroom & Prefunction Space 102
BanglaProtha: Evaluating Vision Language Models in Underrepresented Long-tail Cultural Contexts Poster Session 1
Md Fahim ⋅ Md Sakib Ul Rahman Sourove ⋅ Akm Mazumder ⋅ Md Ishmam ⋅ Md Adib ⋅ Fariha Tanjim Shifat ⋅ Fabiha Haider ⋅ Md Bhuiyan
Tucson Ballroom & Prefunction Space 111
ProSkill: Segment-Level Skill Assessment in Procedural Videos Poster Session 4 + Reception
Michele Mazzamuto ⋅ Daniele Di Mauro ⋅ Gianpiero Francesca ⋅ Giovanni Farinella ⋅ Antonino Furnari
Tucson Ballroom & Prefunction Space 54
Towards Fast and Scalable Normal Integration using Continuous Components Poster Session 1
Francesco Milano ⋅ Jen Jen Chung ⋅ Lionel Ott ⋅ Roland Siegwart
Tucson Ballroom & Prefunction Space 23
GHOST: Getting to the Bottom of Hallucinations with A Multi-round Consistency Benchmark Poster Session 5
Vibashan VS ⋅ Nadine Chang ⋅ Jenny Schmalfuss ⋅ Vishal Patel ⋅ Zhiding Yu ⋅ Jose M. Alvarez
Tucson Ballroom & Prefunction Space 35
QuadraNet V2: Efficient and Sustainable Training of High-Order Neural Networks with Quadratic Adaptation Poster Session 1
Chenhui Xu ⋅ Fuxun Yu ⋅ Jinjun Xiong ⋅ Xiang Chen
Tucson Ballroom & Prefunction Space 131
Identity Verification from Human Scent using Channel Representation of 2D Gas Chromatography-Mass Spectrometry Data Poster Session 2 + Refreshments
Radim Spetlik ⋅ Jan Hlavsa ⋅ Jana Čechová ⋅ Petra Pojmanová ⋅ Jiri Matas ⋅ Štěpán Urban
Tucson Ballroom & Prefunction Space 6
BrightRate: Quality Assessment for User-Generated HDR Videos Poster Session 2 + Refreshments
Shreshth Saini ⋅ Bowen Chen ⋅ Yilin Wang ⋅ Neil Birkbeck ⋅ Balu Adsumilli ⋅ Alan Bovik
Tucson Ballroom & Prefunction Space 11
Timestamp Query Transformer for Temporal Action Segmentation Poster Session 4 + Reception
Tieqiao Wang ⋅ Sinisa Todorovic
Tucson Ballroom & Prefunction Space 70
Inpaint360GS: Efficient Object-Aware 3D Inpainting via Gaussian Splatting for 360° Scenes Poster Session 1
Shaoxiang Wang ⋅ Shihong Zhang ⋅ Christen Millerdurai ⋅ Rüdiger Westermann ⋅ Didier Stricker ⋅ Alain Pagani
Tucson Ballroom & Prefunction Space 12
QCFace: Image Quality Control for boosting Face Representation & Recognition Poster Session 2 + Refreshments
Duc-Phuong Doan-Ngo ⋅ Thanh-Dang Diep ⋅ Thanh Nguyen-Duc ⋅ Thanh-Sach LE ⋅ Nam Thoai
Tucson Ballroom & Prefunction Space 9
Test-Time Adaptation for Video Highlight Detection Using Meta-Auxiliary Learning and Cross-Modality Hallucinations Poster Session 5
Zahidul Islam ⋅ Sujoy Paul ⋅ Mrigank Rochan
Tucson Ballroom & Prefunction Space 104
CycleSL: Server-Client Cyclical Update Driven Scalable Split Learning Poster Session 2 + Refreshments
Mengdi Wang ⋅ Efe Bozkir ⋅ Enkelejda Kasneci
Tucson Ballroom & Prefunction Space 41
Roadside Monocular 3D Detection Prompted by 2D Detection Poster Session 2 + Refreshments
Yechi Ma ⋅ Wei Hua ⋅ Yanan Li ⋅ Shu Kong
Tucson Ballroom & Prefunction Space 46
ASC: Learning Augmentation Severity-Consistent Representations Improves Generalization via Augmentation Search Poster Session 2 + Refreshments
Amirhossein Alamdar ⋅ Hossein Jafarinia ⋅ Mahdi Nouri ⋅ Mohammad Rohban
Tucson Ballroom & Prefunction Space 49
Semi-Supervised Hierarchical Open-Set Classification Poster Session 2 + Refreshments
Erik Wallin ⋅ Fredrik Kahl ⋅ Lars Hammarstrand
Tucson Ballroom & Prefunction Space 55
DoTA: Latent Distribution Conditioned Data Attribution for Diffusion Models Poster Session 2 + Refreshments
Ninad Joshi ⋅ Vivek Srivastava ⋅ Shirish Karande
Tucson Ballroom & Prefunction Space 58
Narrating For You: Prompt-guided Audio-visual Narrating Face Generation Employing Multi-entangled Latent Space Poster Session 1
Aashish Chandra ⋅ Aashutosh A V ⋅ Abhijit Das
Tucson Ballroom & Prefunction Space 126
LightGazeNet: A Lightweight GNN-based Architecture for Gaze Estimation Poster Session 3
Heena Patel ⋅ Anirban Chowdhury ⋅ Pooja Choksy ⋅ Samiksha Pachade ⋅ Ajinkya Puar
Tucson Ballroom & Prefunction Space 76
Zero-Shot Coreset Selection via Iterative Subspace Sampling Poster Session 2 + Refreshments
Brent Griffin ⋅ Jacob Marks ⋅ Jason Corso
Tucson Ballroom & Prefunction Space 67
BAFIS: Dataset + Framework to assess occupational Bias and Human Preference in modern Text-to-image Models Poster Session 2 + Refreshments
Thomas Klassert ⋅ Adrian Ulges ⋅ Biying Fu
Tucson Ballroom & Prefunction Space 72
High-Rate Mixout: Revisiting Mixout for Robust Domain Generalization Poster Session 3
Masih Aminbeidokhti ⋅ Heitor Medeiros ⋅ Srikanth Muralidharan ⋅ Eric Granger ⋅ Marco Pedersoli
Tucson Ballroom & Prefunction Space 85
CVP: Central-Peripheral Vision-Inspired Multimodal Model for Spatial Reasoning Poster Session 2 + Refreshments
Zeyuan Chen ⋅ Xiang Zhang ⋅ Haiyang Xu ⋅ Jianwen Xie ⋅ Zhuowen Tu
Tucson Ballroom & Prefunction Space 84
Discrete Facial Encoding: A Framework for Data-driven Facial Display Discovery Poster Session 2 + Refreshments
Minh Tran ⋅ Maksim Siniukov ⋅ Zhangyu Jin ⋅ Mohammad Soleymani
Tucson Ballroom & Prefunction Space 89
ScoliGaitX: A Deep Multi-Modal Fusion Network for Scoliosis Assessment via Gait Video Analysis Poster Session 2 + Refreshments
Kaushik Vishwakarma ⋅ Aditya Nigam
Tucson Ballroom & Prefunction Space 94
FlowMorph: Revealing an Optimizable Flow Latent Space for Controlled Image Morphing Poster Session 2 + Refreshments
Yan Zheng ⋅ Yi Yang ⋅ Lanqing Guo ⋅ Zhangyang ”Atlas” Wang
Tucson Ballroom & Prefunction Space 99
Zero-shot Hierarchical Plant Segmentation via Foundation Segmentation Models and Text-to-image Attention Poster Session 2 + Refreshments
Junhao Xing ⋅ Ryohei Miyakawa ⋅ Yang Yang ⋅ Xinpeng Liu ⋅ Risa Shinoda ⋅ Hiroaki Santo ⋅ Yosuke Toda ⋅ Fumio Okura
Tucson Ballroom & Prefunction Space 104
Moiré Zero: An Efficient and High-Performance Neural Architecture for Moiré Removal Poster Session 2 + Refreshments
Seungryong Lee ⋅ Woojeong Baek ⋅ Younghyun Kim ⋅ Eunwoo Kim ⋅ Haru Moon ⋅ Donggon Yoo ⋅ Eunbyung Park
Tucson Ballroom & Prefunction Space 105
A-V Representation Learning via Audio Shift Prediction for Multimodal Deepfake Detection and Temporal Localization Poster Session 2 + Refreshments
Ashutosh Anshul ⋅ Eng Chng ⋅ Deepu Rajan
Tucson Ballroom & Prefunction Space 108
MVAT: Multi-View Aware Teacher for Weakly Supervised 3D Object Detection Poster Session 5
Saad Lahlali ⋅ Alexandre Montgieux ⋅ Nicolas Granger ⋅ Hervé Le Borgne ⋅ Quoc Cuong PHAM
Tucson Ballroom & Prefunction Space 29
Evaluating Text-to-Image and Text-to-Video Synthesis with a Conditional Frechet Distance Poster Session 2 + Refreshments
Jaywon Koo ⋅ Jefferson Hernandez ⋅ Moayed Haji-Ali ⋅ Ziyan Yang ⋅ Vicente Ordonez
Tucson Ballroom & Prefunction Space 61
CineVerse: Consistent Keyframe Synthesis for Cinematic Scene Composition Poster Session 2 + Refreshments
Quynh Phunh ⋅ Long Mai ⋅ Fabian Caba Heilbron ⋅ Feng Liu ⋅ Jia-Bin Huang ⋅ Cusuh Ham
Tucson Ballroom & Prefunction Space 115
ConsensusXAI: A framework to examine class-wise agreement in medical imaging Poster Session 2 + Refreshments
Abbas Haider ⋅ David Wright ⋅ Ruth Hogg ⋅ Hui Wang ⋅ Tunde Peto ⋅ Richard Gault
Tucson Ballroom & Prefunction Space 118
Matching Semantically Similar Non-Identical Objects Poster Session 2 + Refreshments
Yusuke Marumo ⋅ Kazuhiko Kawamoto ⋅ Satomi Tanaka ⋅ Shigenobu Hirano ⋅ Hiroshi Kera
Tucson Ballroom & Prefunction Space 127
What Happens When: Learning Temporal Orders of Events in Videos Poster Session 2 + Refreshments
Daechul Ahn ⋅ Yura Choi ⋅ Hyeonbeom Choi ⋅ Seongwon Cho ⋅ San Kim ⋅ Jonghyun Choi
Tucson Ballroom & Prefunction Space 130
DiRe: Diversity-promoting Regularization for Dataset Condensation Poster Session 2 + Refreshments
Saumyaranjan Mohanty ⋅ Aravind Reddy ⋅ Konda Reddy Mopuri
Tucson Ballroom & Prefunction Space 133
Improved Wildfire Spread Prediction with Time-Series Data and the WSTS+ Benchmark Poster Session 2 + Refreshments
Saad Lahrichi ⋅ Jake Bova ⋅ Jesse Johnson ⋅ Jordan Malof
Tucson Ballroom & Prefunction Space 140
RAVU: Retrieval Augmented Video Understanding with Compositional Reasoning over Graph Poster Session 2 + Refreshments
Sameer Malik ⋅ Ayush Singh ⋅ Moyuru Yamada ⋅ Dishank Aggarwal
Tucson Ballroom & Prefunction Space 138
StreetView-Waste: A Multi-Task Dataset for Urban Waste Management Poster Session 3
Diogo J. Paulo ⋅ João Martins ⋅ Hugo Proenca ⋅ João Neves
Tucson Ballroom & Prefunction Space 9
Evaluating the Capability of Video Question Generation for Expert Knowledge Elicitation Poster Session 3
Huaying Zhang ⋅ Atsushi Hashimoto ⋅ Tosho Hirasawa
Tucson Ballroom & Prefunction Space 12
GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving Poster Session 3
William Ljungbergh ⋅ Adam Lilja ⋅ Adam Tonderski ⋅ Arvid Ling ⋅ Carl Lindström ⋅ Willem Verbeke ⋅ Junsheng Fu ⋅ Christoffer Petersson ⋅ Lars Hammarstrand ⋅ Michael Felsberg
Tucson Ballroom & Prefunction Space 15
Gradient-Free Classifier Guidance for Diffusion Model Sampling Poster Session 3
Rahul Shenoy ⋅ Zhihong Pan ⋅ Kaushik Balakrishnan ⋅ Qisen Cheng ⋅ Yongmoon Jeon ⋅ Heejune Yang ⋅ Jaewon Kim
Tucson Ballroom & Prefunction Space 23
PointNet4D: A lightweight 4D Point Cloud Video Backbone for Online and Offline Perception in Robotic Applications Poster Session 3
Yunze Liu ⋅ Zifan Wang ⋅ Peiran Wu ⋅ Jiayang Ao
Tucson Ballroom & Prefunction Space 27
Show Me: Unifying Instructional Image and Video Generation with Diffusion Models Poster Session 3
Yujiang Pu ⋅ Zhanbo Huang ⋅ Vishnu Boddeti ⋅ Yu Kong
Tucson Ballroom & Prefunction Space 35
HEART-PFL: Stable Personalized Federated Learning under Heterogeneity with Hierarchical Directional Alignment and Adversarial Knowledge Transfer Poster Session 3
Minjun Kim ⋅ Minje Kim
Tucson Ballroom & Prefunction Space 43
Detecting Social Engagement of Elderly From Lifelog Image-streams to Identify Effective Cues for Autobiographic Recall Poster Session 3
Vengateswaran Subramaniam ⋅ Vigneshwaran Subbaraju ⋅ Debaditya Roy ⋅ Pramath Krishna ⋅ Thivya Kandappu ⋅ Qianli Xu
Tucson Ballroom & Prefunction Space 44
Data-Driven Loss Functions for Inference-Time Optimization in Text-to-Image Poster Session 3
Sapir Esther Yiflach ⋅ Yuval Atzmon ⋅ Gal Chechik
Tucson Ballroom & Prefunction Space 58
DOTGraph: CLIP-Driven Feature Disentanglement and Optimal Transport based Graph Learning for Few-Shot Segmentation Poster Session 3
Shreya Biswas ⋅ Zhaozheng Yin
Tucson Ballroom & Prefunction Space 69
ScoreNet: Netting Lightweight Quality Scores for Better Visual Assessment with Large Multi-Modality Models Poster Session 5
Bahador Rashidi ⋅ Kiarash Aghakasiri ⋅ Shupei Zhang ⋅ Amirmohsen Sattarifard ⋅ Yue zhang ⋅ Chao Gao
Tucson Ballroom & Prefunction Space 115
A Little More Like This: Text-to-Image Retrieval with Vision-Language Models Using Relevance Feedback Poster Session 3
Bulat Khaertdinov ⋅ Mirela Popa ⋅ Nava Tintarev
Tucson Ballroom & Prefunction Space 87
Online Episodic Memory Visual Query Localization with Egocentric Streaming Object Memory Poster Session 3
Zaira Manigrasso ⋅ Matteo Dunnhofer ⋅ Antonino Furnari ⋅ Moritz Nottebaum ⋅ Antonio Finocchiaro ⋅ Marana Davide ⋅ Rosario Forte ⋅ Giovanni Farinella ⋅ Christian Micheloni
Tucson Ballroom & Prefunction Space 99
LVM-Lite: Training Large Vision Models with Efficient Sequential Modeling Poster Session 4 + Reception
Xianhang Li ⋅ Hongru Zhu ⋅ Sucheng Ren ⋅ Linjie Yang ⋅ Peng Wang ⋅ Heng Wang ⋅ Xiaohui Shen ⋅ Qing Liu ⋅ Cihang Xie
Tucson Ballroom & Prefunction Space 27
Domain Generalizing DINO for Visual Regression via Latent Distractor Subspace Consistency Poster Session 3
Nikhil Kumar Jangamreddy ⋅ Chetan Arora ⋅ Mahsa Baktashmotlagh
Tucson Ballroom & Prefunction Space 109
TalkingHeadBench: A Multi-Modal Benchmark & Analysis of Talking-Head DeepFake Detection Poster Session 3
Xinqi Xiong ⋅ Prakrut Patel ⋅ Qingyuan Fan ⋅ Amisha Wadhwa ⋅ Sarathy Selvam ⋅ Xiao Guo ⋅ Luchao Qi ⋅ Xiaoming Liu ⋅ Roni Sengupta
Tucson Ballroom & Prefunction Space 117
Guided Texture Segmentation via Coordinate-Aware Class-Ratio Mapping Poster Session 3
Bishal Swain ⋅ Kyung Cheoi ⋅ Jaepil Ko
Tucson Ballroom & Prefunction Space 128
OMeGa: Joint Optimization of Explicit Meshes and Gaussian Splats for Robust Scene-Level Surface Reconstruction Poster Session 4 + Reception
Yuhang Cao ⋅ Haojun Yan ⋅ Danya Yao
Tucson Ballroom & Prefunction Space 10
Similarity-aware Probabilistic Embeddings Modeling for Video-Text Retrieval Poster Session 4 + Reception
Yuliang Huang ⋅ Pengxu Wei ⋅ Zhicheng Dong ⋅ Liang Lin
Tucson Ballroom & Prefunction Space 16
SIAM: Synchronous Interaction Attention for Human Mesh Recovery Poster Session 4 + Reception
Niaz Ahmad ⋅ Saif Ullah ⋅ Youngmoon Lee ⋅ Guanghui Wang
Tucson Ballroom & Prefunction Space 24
Transformer-Based Inpainting for Real-Time 3D Streaming in Sparse Multi-Camera Setups Poster Session 4 + Reception
Leif V Holland ⋅ Domenic Zingsheim ⋅ Mana Takhsha ⋅ Hannah Dröge ⋅ Patrick Stotko ⋅ Markus Plack ⋅ Reinhard Klein
Tucson Ballroom & Prefunction Space 29
LiDAR-DHMT: LiDAR-Adaptive Dual Hierarchical Mask Transformer for Robust Freespace Detection and Semantic Segmentation Poster Session 1
Siyu Chen ⋅ Ting Han ⋅ Changshe Zhang ⋅ Xin Luo ⋅ Huan Chen ⋅ Meiliu Wu ⋅ Guorong Cai ⋅ jinhe su
Tucson Ballroom & Prefunction Space 120
LASER: Lip Landmark Assisted Speaker Detection for Robustness Poster Session 6 + Refreshments
Le Thien Phuc Nguyen ⋅ Zhuoran Yu ⋅ Yong Jae Lee
Tucson Ballroom & Prefunction Space 9
Generalization of Real World Video Deblurring By Image-to-Image Translation Poster Session 4 + Reception
Kassymzhomart Aitbek ⋅ Seungjoon Yang
Tucson Ballroom & Prefunction Space 40
More Than Memory Savings: Zeroth-Order Optimization Mitigates Forgetting in Continual Learning Poster Session 4 + Reception
Wanhao Yu ⋅ Zheng Wang ⋅ Shuteng Niu ⋅ Sen Lin ⋅ Li Yang
Tucson Ballroom & Prefunction Space 46
CoL2A: Convolution-free Local Linear Attention for SpatioTemporal Event Processing Poster Session 4 + Reception
Yusuke Sekikawa ⋅ Itsumi Araki ⋅ Jun Nagata ⋅ Andreu Girbau
Tucson Ballroom & Prefunction Space 56
Patch-wise Retrieval: A Bag of Practical Techniques for Instance-level Matching Poster Session 4 + Reception
Wonseok Choi ⋅ Sohwi Lim ⋅ Nam Hyeon-Woo ⋅ Moon Ye-Bin ⋅ Dong-ju Jeong ⋅ Jinyoung Hwang ⋅ Tae-Hyun Oh
Tucson Ballroom & Prefunction Space 61
Geo3DVQA: Evaluating Vision-Language Models for 3D Geospatial Reasoning from Aerial Imagery Poster Session 4 + Reception
Mai Tsujimoto ⋅ Junjue Wang ⋅ Weihao Xuan ⋅ Naoto Yokoya
Tucson Ballroom & Prefunction Space 68
GrowTAS: Progressive Expansion from Small to Large Subnets for Efficient ViT Architecture Search Poster Session 4 + Reception
Hyunju Lee ⋅ Youngmin Oh ⋅ Jeimin Jeon ⋅ Donghyeon Baek ⋅ Bumsub Ham
Tucson Ballroom & Prefunction Space 73
Curve Skeletonization in Continuous domain for Meshes and Point Clouds Poster Session 4 + Reception
Jai Bardhan ⋅ Ramya Hebbalaguppe ⋅ Aravind Udupa
Tucson Ballroom & Prefunction Space 76
ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models Poster Session 4 + Reception
Danae Sanchez Villegas ⋅ Ingo Ziegler ⋅ Desmond Elliott
Tucson Ballroom & Prefunction Space 81
ST-Think: How Multimodal Large Language Models Reason About 4D Worlds from Ego-Centric Videos Poster Session 4 + Reception
Peiran Wu ⋅ Yunze Liu ⋅ Miao Liu ⋅ Junxiao Shen
Tucson Ballroom & Prefunction Space 85
Improving Out-of-Distribution Detection Using Segmented Images and Cross-View Attention Fusion Poster Session 4 + Reception
Alexander Politowicz ⋅ Sahisnu Mazumder ⋅ Bing Liu
Tucson Ballroom & Prefunction Space 94
Revisiting Vision–Language Foundations for No-Reference Image Quality Assessment Poster Session 4 + Reception
ANKIT YADAV ⋅ Ta Duc Huy ⋅ Lingqiao Liu
Tucson Ballroom & Prefunction Space 108
Learning Subglacial Bed Topography from Sparse Radar with Physics-Guided Residuals Poster Session 4 + Reception
Bayu Tama ⋅ Jianwu Wang ⋅ Vandana Janeja ⋅ Mostafa Cham
Tucson Ballroom & Prefunction Space 111
DPBridge: Latent Diffusion Bridge for Dense Prediction Poster Session 4 + Reception
Haorui Ji ⋅ Tao Jun Lin ⋅ Hongdong Li
Tucson Ballroom & Prefunction Space 118
CRISP: Cylindrical Rendering for In-Stream Point Clouds Poster Session 4 + Reception
Hyungwoo Kang ⋅ Seonyoung Jang ⋅ YeoJun Yoon ⋅ Byungtae Oh
Tucson Ballroom & Prefunction Space 121
KFS-Bench: Comprehensive Evaluation of Key Frame Sampling in Long Video Understanding Poster Session 4 + Reception
Zongyao Li ⋅ Kengo Ishida ⋅ Satoshi Yamazaki ⋅ XIAOTONG JI ⋅ Jianquan Liu
Tucson Ballroom & Prefunction Space 130
Style-Friendly SNR Sampler for Style-Driven Generation Poster Session 4 + Reception
Jooyoung Choi ⋅ Chaehun Shin ⋅ Yeongtak Oh ⋅ Heeseung Kim ⋅ Jungbeom Lee ⋅ Sungroh Yoon
Tucson Ballroom & Prefunction Space 136
ControlVP: Interactive Geometric Refinement of AI-Generated Images with Consistent Vanishing Points Poster Session 4 + Reception
Ryota Okumura ⋅ Kaede Shiohara ⋅ Toshihiko Yamasaki
Tucson Ballroom & Prefunction Space 140
Towards Egocentric 3D Hand Pose Estimation in Unseen Domains Poster Session 4 + Reception
Wiktor Mucha ⋅ Michael Wray ⋅ Martin Kampel
Tucson Ballroom & Prefunction Space 143
Motion-Aware Graph Fusion NetWork for 3D Human Pose Estimation Poster Session 5
Yen Pham ⋅ Xiaohui Yuan ⋅ Chengyuan Zhuang
Tucson Ballroom & Prefunction Space 1
SynchroRaMa : Lip-Synchronized and Emotion-Aware Talking Face Generation via Multi-Modal Emotion Embedding Poster Session 4 + Reception
Phyo Thet Yee ⋅ Dimitrios Kollias ⋅ Sudeepta Mishra ⋅ Abhinav Dhall
Tucson Ballroom & Prefunction Space 25
Conjuring Positive Pairs for Efficient Unification of Representation Learning and Image Synthesis Poster Session 1
Imanol Estepa ⋅ Jesús Rodríguez-de-Vera ⋅ Ignacio Sarasua ⋅ Bhalaji Nagarajan ⋅ Petia Radeva
Tucson Ballroom & Prefunction Space 72
IMKD: Intensity-Aware Multi-Level Knowledge Distillation for Camera-Radar Fusion Poster Session 5
Shashank Mishra ⋅ Karan Patil ⋅ Didier Stricker ⋅ Jason Rambach
Tucson Ballroom & Prefunction Space 22
MUSE: Model-based Uncertainty-aware Similarity Estimation for zero-shot 2D Object Detection and Segmentation Poster Session 5
Sungmin Cho ⋅ Sungbum Park ⋅ Insoo Oh
Tucson Ballroom & Prefunction Space 28
TM-Adapter: Temporal Merge Adapter for Efficient Global Temporal Modeling Poster Session 5
WooJoo Hahm ⋅ Seungwoo Jang ⋅ Hyeon Kim ⋅ Daeun Lee ⋅ Kwangsu Kim
Tucson Ballroom & Prefunction Space 31
SceneProp: Combining Neural Network and Markov Random Field for Scene-Graph Grounding Poster Session 5
Keita Otani ⋅ Tatsuya Harada
Tucson Ballroom & Prefunction Space 34
Reinforcement Learning-based Adaptive Control of Classifier-Free Guidance and Timestep Embeddings in Diffusion Models Poster Session 1
Haochen You ⋅ Baojing Liu ⋅ Hongyang He
Tucson Ballroom & Prefunction Space 5
Zero‑Shot Domain Generalisation via Prompt-Driven Feature Refinement Poster Session 5
Tingrui Qiao ⋅ Di Zhao ⋅ Caroline Walker ⋅ Chris Cunningham ⋅ Yun Sing Koh
Tucson Ballroom & Prefunction Space 37
GFT-GCN: Privacy-Preserving 3D Face Mesh Recognition with Spectral Diffusion Poster Session 5
Hichem Felouat ⋅ Hanrui Wang ⋅ Isao Echizen
Tucson Ballroom & Prefunction Space 42
Video and Language Alignment in 2D Systems for 3D Multi-object Scenes with Multi-Information Derivative-Free Control Poster Session 5
Jason Armitage ⋅ Rico Sennrich
Tucson Ballroom & Prefunction Space 45
ViSTA: Visual Storytelling using Multi-modal Adapters for Text-to-Image Diffusion Models Poster Session 1
Sibo Dong ⋅ Ismail Shaheen ⋅ Maggie Shen ⋅ Rupayan Mallick ⋅ Sarah Bargal
Tucson Ballroom & Prefunction Space 2
PSDiffusion: Harmonized Multi-Layer Image Generation via Layout and Appearance Alignment Poster Session 3
Dingbang Huang ⋅ Wenbo Li ⋅ Yifei Zhao ⋅ Xinyu Pan ⋅ Yanhong Zeng ⋅ Bo Dai
Tucson Ballroom & Prefunction Space 30
Mean-Shift Distillation for Diffusion Mode Seeking Poster Session 5
Vikas Thamizharasan ⋅ Nikitas Chatzis ⋅ Iliyan Georgiev ⋅ Matthew Fisher ⋅ Evangelos Kalogerakis ⋅ Difan Liu ⋅ Nanxuan Zhao ⋅ Michal Lukáč
Tucson Ballroom & Prefunction Space 71
FAIR-SIGHT: Fairness Assurance in Image Recognition via Simultaneous Conformal Thresholding and Dynamic Output Repair Poster Session 5
Arya Fayyazi ⋅ Mehdi Kamal ⋅ Massoud Pedram
Tucson Ballroom & Prefunction Space 79
Guiding What Not to Generate: Automated Negative Prompting for Text-Image Alignment Poster Session 5
Sangha Park ⋅ Eunji Kim ⋅ Yeongtak Oh ⋅ Jooyoung Choi ⋅ Sungroh Yoon
Tucson Ballroom & Prefunction Space 82
Correcting and Quantifying Systematic Errors in 3D Box Annotations for Autonomous Driving Poster Session 5
Alexandre Justo Miro ⋅ Ludvig af Klinteberg ⋅ Bogdan Timus ⋅ Aron Asefaw ⋅ Ajinkya Khoche ⋅ Thomas Gustafsson ⋅ Sina Mansouri ⋅ Masoud DANESHTALAB
Tucson Ballroom & Prefunction Space 88
S2O: Static to Openable Enhancement for Articulated 3D Objects Poster Session 5
Hanxiao Jiang ⋅ Hanxiao Jiang ⋅ Yiming Zhang ⋅ Manolis Savva ⋅ Angel Chang
Tucson Ballroom & Prefunction Space 94
PoseAdapt: Sustainable Human Pose Estimation via Continual Learning Benchmarks and Toolkit Poster Session 5
Muhammad Saif Ullah Khan ⋅ Didier Stricker
Tucson Ballroom & Prefunction Space 99
Multimodal Adversarial Defense for Vision-Language Models by Leveraging One-To-Many Relationships Poster Session 5
Futa Waseda ⋅ Antonio Tejero-de-Pablos ⋅ Isao Echizen
Tucson Ballroom & Prefunction Space 111
ForestSplats: Deformable transient field for Gaussian Splatting in the Wild Poster Session 5
Wongi Park ⋅ Myeongseok Nam ⋅ Siwon Kim ⋅ Sangwoo Jo ⋅ Soomok Lee
Tucson Ballroom & Prefunction Space 112
SGD-Mix: Enhancing Domain-Specific Image Classification with Label-Preserving Data Augmentation Poster Session 5
Yixuan Dong ⋅ Fang-Yi Su ⋅ Jung-Hsien Chiang
Tucson Ballroom & Prefunction Space 119
PALMS+: Modular Image-Based Floor Plan Localization Leveraging Depth Foundation Model Poster Session 5
Yunqian Cheng ⋅ Benjamin Princen ⋅ Roberto Manduchi
Tucson Ballroom & Prefunction Space 122
Knowledge to Sight: Reasoning over Visual Attributes via Knowledge Decomposition for Abnormality Grounding Poster Session 2 + Refreshments
Jun Li ⋅ Che Liu ⋅ Wenjia Bai ⋅ Mingxuan Liu ⋅ Rossella Arcucci ⋅ Cosmin Bercea ⋅ Julia Schnabel
Tucson Ballroom & Prefunction Space 90
AuViRe: Audio-visual Speech Representation Reconstruction for Deepfake Temporal Localization Poster Session 5
Christos Koutlis ⋅ Symeon Papadopoulos
Tucson Ballroom & Prefunction Space 130
T2VWorldBench: A Benchmark for Evaluating World Knowledge in Text-to-Video Generation Poster Session 5
Yubin Chen ⋅ Xuyang Guo ⋅ Zhenmei Shi ⋅ Zhao Song ⋅ Jiahao Zhang
Tucson Ballroom & Prefunction Space 65
IPTQ-ViT: Post-Training Quantization of Non-linear Functions for Integer-only Vision Transformers Poster Session 6 + Refreshments
Gihwan Kim ⋅ Jemin Lee ⋅ Hyungshin Kim
Tucson Ballroom & Prefunction Space 16
Locally Explaining Prediction Behavior via Gradual Interventions and Measuring Property Gradients Poster Session 6 + Refreshments
Niklas Penzel ⋅ Joachim Denzler
Tucson Ballroom & Prefunction Space 19
DM3Net: Dual-Camera Super-Resolution via Domain Modulation and Multi-scale Matching Poster Session 6 + Refreshments
CONG GUAN ⋅ Jiacheng Ying ⋅ Osamu Yoshie ⋅ Yuya Ieiri
Tucson Ballroom & Prefunction Space 26
3D Cell Oversegmentation Correction via Geo-Wasserstein Divergence Poster Session 6 + Refreshments
Peter Chen ⋅ Bryan Chang ⋅ Olivia Creasey ⋅ Julie Sneddon ⋅ Zev Gartner ⋅ Yining Liu
Tucson Ballroom & Prefunction Space 32
brat: Aligned Multi-View Embeddings for Brain MRI Analysis Poster Session 5
Maxime Kayser ⋅ Maksim Gridnev ⋅ Wanting Wang ⋅ Max Bain ⋅ Aneesh Rangnekar ⋅ Avijit Chatterjee ⋅ Aleksandr Petrov ⋅ Harini Veeraraghavan ⋅ Nathaniel Swinburne
Tucson Ballroom & Prefunction Space 7
MMHOI: Modeling Complex 3D Multi-Human Multi-Object Interactions Poster Session 2 + Refreshments
Kaen Kazawa (Kogashi) ⋅ Anoop Cherian ⋅ Meng-Yu Jennifer Kuo
Tucson Ballroom & Prefunction Space 10
Guided Model Merging for Hybrid Data Learning: Leveraging Centralized Data to Refine Decentralized Models Poster Session 3
Junyi Zhu ⋅ Ruicong Yao ⋅ Taha Ceritli ⋅ Savas Ozkan ⋅ Matthew Blaschko ⋅ Eunchung Noh ⋅ Jeongwon Min ⋅ Cho Min ⋅ Mete Ozay
Tucson Ballroom & Prefunction Space 25
MedPEFT-CL: Dual-Phase Parameter-Efficient Continual Learning with Medical Semantic Adapter and Bidirectional Memory Consolidation Poster Session 6 + Refreshments
ZIYUAN GAO ⋅ Philippe Morel
Tucson Ballroom & Prefunction Space 48
Test-Time Consistency in Vision Language Models Poster Session 6 + Refreshments
Shih-Han Chou ⋅ Shivam Chandhok ⋅ James Little ⋅ Leonid Sigal
Tucson Ballroom & Prefunction Space 56
DualRes: Production-ready Dynamic Object Detection Poster Session 6 + Refreshments
Jibril hassani ⋅ Thomas Verelst
Tucson Ballroom & Prefunction Space 61
FastPose-ViT: A Vision Transformer for Real-Time Spacecraft Pose Estimation Poster Session 6 + Refreshments
Pierre Ancey ⋅ Andrew Price ⋅ Saqib Javed ⋅ Mathieu Salzmann
Tucson Ballroom & Prefunction Space 64
SAVE: Sparse Autoencoder‑Driven Visual Information Enhancement for Mitigating Object Hallucination Poster Session 6 + Refreshments
Sangha Park ⋅ Seungryong Yoo ⋅ Jisoo Mok ⋅ Sungroh Yoon
Tucson Ballroom & Prefunction Space 70
Generalizing Sports Feedback Generation by Watching Competitions and Reading Books: A Rock Climbing Case Study Poster Session 6 + Refreshments
Arushi Rai ⋅ Adriana Kovashka
Tucson Ballroom & Prefunction Space 89
TriaGS: Differentiable Triangulation-Guided Geometric Consistency for 3D Gaussian Splatting Poster Session 6 + Refreshments
Quan Hong ⋅ Tuan Dang
Tucson Ballroom & Prefunction Space 113
Any Detector Can Detect Anything Poster Session 6 + Refreshments
Thomas Huang ⋅ Siyuan Li ⋅ Martin Danelljan ⋅ Henghui Ding ⋅ Luc Van Gool ⋅ Fisher Yu
Tucson Ballroom & Prefunction Space 117
SafeguardGS: 3D Gaussian Primitive Pruning While Avoiding Catastrophic Scene Destruction Poster Session 6 + Refreshments
Yongjae Lee ⋅ Zhaoliang Zhang ⋅ Deliang Fan
Tucson Ballroom & Prefunction Space 121
Scalpel: Fine-Grained Alignment of Attention Activation Manifolds via Mixture Gaussian Bridges to Mitigate Multimodal Hallucination Poster Session 3
Ziqiang Shi ⋅ Rujie Liu ⋅ Shanshan Yu ⋅ Satoshi Munakata ⋅ Koichi Shirahata
Tucson Ballroom & Prefunction Space 5
UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models Poster Session 5
Lan Chen ⋅ Yuchao Gu ⋅ Qi Mao
Tucson Ballroom & Prefunction Space 91
ENCORE : A Neural Collapse Perspective on Out-of-Distribution Detection in Deep Neural Networks Poster Session 3
A. Q. M. Sazzad Sayyed ⋅ Nathaniel Bastian ⋅ Francesco Restuccia
Tucson Ballroom & Prefunction Space 3
FlyPose: Towards Robust Human Pose Estimation From Aerial Views Poster Session 6 + Refreshments
Hassaan Farooq ⋅ Marvin Brenner ⋅ Peter Stütz
Tucson Ballroom & Prefunction Space 134
SVS-GAN for Semantic Synthesis of Traffic Videos for Autonomous Driving Poster Session 6 + Refreshments
Khaled Seyam ⋅ Julian Wiederer ⋅ Markus Braun ⋅ Bin Yang
Tucson Ballroom & Prefunction Space 137
FairScene: Learning Class-Disentangled 2D/3D Representations for Semantic Scene Completion Poster Session 3
Dian Jia ⋅ Pei Yu ⋅ Wei Tang
Tucson Ballroom & Prefunction Space 81
Towards Fine-Grained Adaptation of CLIP via a Self-Trained Alignment Score Poster Session 5
Eman Ali ⋅ Sathira Silva ⋅ Chetan Arora ⋅ Muhammad Haris Khan
Tucson Ballroom & Prefunction Space 8
Rethinking Latent Variable in Learned Image Compression Poster Session 6 + Refreshments
Fangzhou Yi ⋅ Zhicheng Gong ⋅ Hui Zeng
Tucson Ballroom & Prefunction Space 126
One-Cycle Structured Pruning via Stability-Driven Subnetwork Search Poster Session 4 + Reception
Deepak Ghimire ⋅ Dayoung Kil ⋅ Sunghwan Jeong ⋅ Jaesik Park ⋅ Seong-heum Kim
Tucson Ballroom & Prefunction Space 113
Frequency Is What You Need: Considering Word Frequency When Text Masking Benefits Vision-Language Model Pre-training Poster Session 3
Mingliang Liang ⋅ Martha Larson
Tucson Ballroom & Prefunction Space 82
SSMRadNet : A Sample-wise State-Space Framework for Efficient and Ultra-Light Radar Segmentation and Object Detection Poster Session 4 + Reception
Anuvab Sen ⋅ Mir Sayeed Mohammad ⋅ Saibal Mukhopadhyay
Tucson Ballroom & Prefunction Space 8
HOLO: Holistic Lightweight Optimization for Scene Understanding with Auto-Annotation and Multimodal Learning Poster Session 6 + Refreshments
Xiaoyun Hu ⋅ Xiaohan Yan ⋅ Nan Wang ⋅ Gang Wei ⋅ Zhicheng Wang
Tucson Ballroom & Prefunction Space 50
AEON: Adaptive Embedding Optimized Noise for Robust Watermarking in Diffusion Models Poster Session 4 + Reception
Muhammad Muneer ⋅ Simon Woo
Tucson Ballroom & Prefunction Space 107
Memory-Augmented Representation for Efficient Event-based Visuomotor Policy Learning with Adaptive Perception and Control Poster Session 2 + Refreshments
Uday Kamal ⋅ Saibal Mukhopadhyay
Tucson Ballroom & Prefunction Space 112
Hierarchical Instance Tracking to Balance Privacy Preservation with Accessible Information Poster Session 5
Neelima Prasad ⋅ Jarek Reynolds ⋅ Neel Karsanbhai ⋅ Tanusree Sharma ⋅ Lotus Zhang ⋅ Abigale Stangl ⋅ Yang Wang ⋅ Leah Findlater ⋅ Danna Gurari
Tucson Ballroom & Prefunction Space 14
FairVLM: Enhancing Fairness and Prompt Sensitivity in Vision Language Models for Medical Image Segmentation Poster Session 6 + Refreshments
Md Motiur Rahman ⋅ Saeka Rahman ⋅ Smriti Bhatt ⋅ Miad Faezipour
Tucson Ballroom & Prefunction Space 24
A Dataset and Framework for Learning State-invariant Object Representations Poster Session 4 + Reception
Rohan Sarkar ⋅ Avinash Kak
Tucson Ballroom & Prefunction Space 41
SuperRivolution: Fine-Scale Rivers from Coarse Temporal Satellite Imagery Poster Session 6 + Refreshments
Rangel Daroya ⋅ Subhransu Maji
Tucson Ballroom & Prefunction Space 27
SegMo: Segment-aligned Text to 3D Human Motion Generation Poster Session 5
Bowen Dang ⋅ Lin Wu ⋅ Xiaohang Yang ⋅ Zheng Yuan ⋅ Zhixiang Chen
Tucson Ballroom & Prefunction Space 109
TRACE: Confounder-free Adversarial Fine-tuning for Robust Object Detection Poster Session 5
Wonho Lee ⋅ Jisu Lee ⋅ Hyunsik Na ⋅ Sohee Park ⋅ Daeseon Choi
Tucson Ballroom & Prefunction Space 86
GeneVA: A Dataset of Human Annotations for Generative Text to Video Artifacts Poster Session 5
Jenna Kang ⋅ Maria Silva ⋅ Patsorn Sangkloy ⋅ Kenneth Chen ⋅ Niall Williams ⋅ Qi Sun
Tucson Ballroom & Prefunction Space 36
UCDSC: Open Set UnCertainty aware Deep Simplex Classifier for Medical Image Datasets Poster Session 4 + Reception
Arnav Aditya ⋅ Nitin Kumar ⋅ Saurabh Shigwan
Tucson Ballroom & Prefunction Space 48
ART-ASyn: Anatomy-aware Realistic Texture-based Anomaly Synthesis Framework for Chest X-Rays Poster Session 3
Qinyi Cao ⋅ Jianan Fan ⋅ Weidong Cai
Tucson Ballroom & Prefunction Space 84
Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models Poster Session 3
Prin Phunyaphibarn ⋅ Phillip Lee ⋅ Jaihoon Kim ⋅ Minhyuk Sung
Tucson Ballroom & Prefunction Space 102
Temporal Object Captioning for Street Scene Videos from LiDAR Tracks Poster Session 2 + Refreshments
Vignesh Gopinathan ⋅ Urs Zimmermann ⋅ Michael Arnold ⋅ Matthias Rottmann
Tucson Ballroom & Prefunction Space 136
Hybrid State Representation for Video Procedure Planning Poster Session 3
Woo Suk Choi ⋅ Youwon Jang ⋅ Minsu Lee ⋅ Byoung-Tak Zhang
Tucson Ballroom & Prefunction Space 135