This page is cached for 1 hour. Changes to affiliation or name in your local profile may take up to 60 minutes to appear here.
|
FuLLaMa: Training-free Diffusion-based Object Removal with Context Preservation
Poster Session 6 + Refreshments
Ilke Demir ⋅ Umur Ciftci
|
Tucson Ballroom & Prefunction Space 129 | |
|
FLARES: Fast and Accurate LiDAR Multi-Range Semantic Segmentation
Poster Session 3
Bin Yang ⋅ Alexandru Condurache
|
Tucson Ballroom & Prefunction Space 51 | |
|
DocWaveDiff: A Predict-and-Refine approch for Document Image Enhancement with Wavelet U-Nets and Diffusion models
Poster Session 6 + Refreshments
Matteo Marulli ⋅ Marco Bertini
|
Tucson Ballroom & Prefunction Space 124 | |
|
Photo Dating by Facial Age Aggregation
Poster Session 6 + Refreshments
Jakub Paplham ⋅ Vojtech Franc
|
Tucson Ballroom & Prefunction Space 86 | |
|
MergeSlide: Continual Model Merging and Task-to-Class Prompt-Aligned Inference for Lifelong Learning on Whole Slide Images
Poster Session 4 + Reception
Bui Cao Doanh ⋅ Ba Ngo ⋅ Pham Luan ⋅ Khang Nguyen ⋅ Mai Nguyen ⋅ Yasuhiko Nakashima
|
Tucson Ballroom & Prefunction Space 55 | |
|
MorphXAI: An Explainable Framework for Morphological Analysis of Parasites in Blood Smear Images
Poster Session 2 + Refreshments
Aqsa Yousaf ⋅ Sint Sint Win ⋅ Megan Coffee ⋅ Habeeb Olufowobi
|
Tucson Ballroom & Prefunction Space 68 | |
|
Trajectory Tactics: When Transformers Learn Exploration to Generate Online Signature
Poster Session 2 + Refreshments
Anurag Pandey ⋅ Aditya Nigam ⋅ Arnav Bhavsar ⋅ Ashutosh Sharma ⋅ Basu Verma ⋅ Divya Acharya ⋅ Mohd Amir
|
Tucson Ballroom & Prefunction Space 85 | |
|
SCAdapter: Content-Style Disentanglement for Diffusion Style Transfer
Poster Session 6 + Refreshments
Luan Thanh Trinh
|
Tucson Ballroom & Prefunction Space 11 | |
|
Generalized Category Discovery for LiDAR Semantic Segmentation
Poster Session 6 + Refreshments
Minseok Kim ⋅ Jiyong Boo ⋅ Kuk-Jin Yoon
|
Tucson Ballroom & Prefunction Space 115 | |
|
ART: Actor-Related Tubelet for Detecting Complex-shaped Action Tubes
Poster Session 1
Jiaojiao Zhao
|
Tucson Ballroom & Prefunction Space 30 | |
|
MBTI: Metric-Based Textual Inversion for Fine-Grained Image Generation
Poster Session 1
ByungKwan Chae ⋅ Youngjae Choi ⋅ Heewon Kim
|
Tucson Ballroom & Prefunction Space 106 | |
|
VRAgent: Self-Refining Agent for Zero-Shot Multimodal Video Retrieval
Poster Session 6 + Refreshments
Ketul Shah ⋅ Pankaj Nathani ⋅ Rama Chellappa ⋅ Fabian Caba Heilbron
|
Tucson Ballroom & Prefunction Space 91 | |
|
FocalComm: Hard Instance-Aware Multi-Agent Perception
Poster Session 5
Dereje Shenkut ⋅ Vijayakumar Bhagavatula
|
Tucson Ballroom & Prefunction Space 46 | |
|
CommonForms: A Large, Diverse Dataset for Form Field Detection
Poster Session 1
Joe Barrow
|
Tucson Ballroom & Prefunction Space 112 | |
|
MuseDance: A Diffusion-based Music-Driven Image Animation System
Poster Session 3
Zhikang Dong ⋅ Weituo Hao ⋅ Ju-Chiang Wang ⋅ Peng Zhang ⋅ Pawel Polak
|
Tucson Ballroom & Prefunction Space 86 | |
|
ZebraPose: Zebra Detection and Pose Estimation using only Synthetic Data
Poster Session 5
Elia Bonetto ⋅ Aamir Ahmad
|
Tucson Ballroom & Prefunction Space 78 | |
|
SmoothDiffusion-VE: Real-time Generative Video Editing Using Adaptive Feature Cache
Poster Session 6 + Refreshments
Mustafa Munir ⋅ Sophia Zalewski ⋅ Shiqiu Liu ⋅ David Tarjan ⋅ Sushmitha Belede ⋅ Anjul Patney ⋅ Radu Marculescu
|
Tucson Ballroom & Prefunction Space 120 | |
|
An improved architecture for part-based animal re-identification through semantic segmentation distillation
Poster Session 4 + Reception
Eugênio Dias Ribeiro Neto ⋅ Marc Chaumont ⋅ Gérard Subsol ⋅ Michel Garine-Wichatitsky ⋅ Hélène Guis
|
Tucson Ballroom & Prefunction Space 95 | |
|
Towards High-Fidelity, Identity-Preserving Real-Time Makeup Transfer: Decoupling Style Generation
Poster Session 3
Kin Chau Lydia Chau ⋅ Zhi Yu ⋅ Ruowei Jiang
|
Tucson Ballroom & Prefunction Space 64 | |
|
MMCM: Multimodality-aware Metric using Clustering-based Modes for Probabilistic Human Motion Prediction
Poster Session 2 + Refreshments
Kyotaro Tokoro ⋅ Hiromu Taketsugu ⋅ Norimichi Ukita
|
Tucson Ballroom & Prefunction Space 116 | |
|
FARF-Net: Frequency-guided Adaptive Receptive Field Network for Edge-enhanced Polyp Segmentation
Poster Session 2 + Refreshments
Xue Li ⋅ Aiwen Jiang ⋅ Hongqian Yu ⋅ Xiao Yang
|
Tucson Ballroom & Prefunction Space 88 | |
|
VOCAL: Visual Odometry via ContrAstive Learning
Poster Session 3
Chi-Yao Huang ⋅ Zeel Bhatt ⋅ “YZ” Yezhou Yang
|
Tucson Ballroom & Prefunction Space 36 | |
|
Deep Image Decomposition for Medical Imaging Anonymization and Curation
Poster Session 6 + Refreshments
Yael Elkin ⋅ Gal Arie ⋅ Tammy Raviv Raviv
|
Tucson Ballroom & Prefunction Space 3 | |
|
Fast Vision Mamba: Pooling Spatial Dimensions for Accelerated Processing
Poster Session 3
Saarthak Kapse ⋅ Robin Betz ⋅ Srinivasan Sivanandan
|
Tucson Ballroom & Prefunction Space 1 | |
|
CoreCaption: Core Caption based Text-to-Video Retrieval
Poster Session 5
Junkyu Jang
|
Tucson Ballroom & Prefunction Space 77 | |
|
Subspace-Guided Knowledge Distillation for Efficient Model Transfer
Poster Session 4 + Reception
Zeeshan Hayder ⋅ Ali Cheraghian ⋅ Lars Petersson ⋅ Mehrtash Harandi
|
Tucson Ballroom & Prefunction Space 74 | |
|
AGENet: Adaptive Edge-aware Geodesic Distance Learning for Few-Shot Medical Image Segmentation
Poster Session 3
ZIYUAN GAO
|
Tucson Ballroom & Prefunction Space 131 | |
|
PerVL-Bench: Benchmarking Multimodal Personalization for Large Vision–Language Models
Poster Session 5
Minsung Kim
|
Tucson Ballroom & Prefunction Space 85 | |
|
Histopath-C: Towards Realistic Domain Shifts for Histopathology Vision-Language Adaptation
Poster Session 4 + Reception
Mehrdad Noori ⋅ Gustavo Vargas Hakim ⋅ David OSOWIECHI ⋅ Fereshteh Shakeri ⋅ Ali Bahri ⋅ Moslem Yazdanpanah ⋅ Sahar Dastani ⋅ Ismail Ayed ⋅ Christian Desrosiers
|
Tucson Ballroom & Prefunction Space 58 | |
|
Training-Free Few-Shot Segmentation via Vision-Language Guided Prompting
Poster Session 5
Euihyun Yoon ⋅ Taejin Park ⋅ Jaekoo Lee
|
Tucson Ballroom & Prefunction Space 69 | |
|
SimForce: Force and Surface Electromyography from Full Body Video with Graph Neural Nets
Poster Session 3
Esha Dasgupta ⋅ Boeun Kim ⋅ Sang-Hoon Yeo ⋅ Hyung Jin Chang
|
Tucson Ballroom & Prefunction Space 38 | |
|
Virtually Unrolling the Herculaneum Papyri by Diffeomorphic Spiral Fitting
Poster Session 5
Paul Henderson
|
Tucson Ballroom & Prefunction Space 58 | |
|
Adversarial Pseudo-replay for Exemplar-free Class-incremental Learning
Poster Session 6 + Refreshments
Hiroto Honda
|
Tucson Ballroom & Prefunction Space 28 | |
|
SAFER-AiD: Saccade-Assisted Foveal-peripheral vision Enhanced Reconstruction for Adversarial Defense
Poster Session 2 + Refreshments
Jiayang Liu ⋅ Daniel Tso ⋅ Yiming Bu ⋅ Qinru Qiu
|
Tucson Ballroom & Prefunction Space 30 | |
|
Towards Unconstrained Cross-View Pose Estimation
Poster Session 6 + Refreshments
Alexander Wollam ⋅ Kyle Ashley ⋅ Maxim Shugaev ⋅ Oliver Arend ⋅ Ilya Semenov ⋅ Hadis Dashtestani ⋅ Sumved Ravi ⋅ Nathan Jacobs
|
Tucson Ballroom & Prefunction Space 118 | |
|
PromptGAR: Flexible Promptive Group Activity Recognition
Poster Session 4 + Reception
Zhangyu Jin ⋅ Andrew Feng ⋅ Ankur Chemburkar ⋅ Celso de Melo
|
Tucson Ballroom & Prefunction Space 17 | |
|
Delta-LLaVA: Base-then-Specialize Alignment for Token-Efficient Vision-Language Models
Poster Session 3
Mohamad Zamini ⋅ Diksha Shukla
|
Tucson Ballroom & Prefunction Space 70 | |
|
Spec-Gloss Surfels and Normal–Diffuse Priors for Relightable Glossy Objects
Poster Session 4 + Reception
Georgios Kouros ⋅ Minye Wu ⋅ Tinne Tuytelaars
|
Tucson Ballroom & Prefunction Space 13 | |
|
Enhancing Reverse Distillation with Core Exemplar Learning for Unified Multi-Class Anomaly Detection
Poster Session 6 + Refreshments
Heechul Lim ⋅ Min-Soo Kim ⋅ Hyun-Boo Lee ⋅ Suk-Ju Kang ⋅ Kang-Wook Chon ⋅ Haeyun Lee
|
Tucson Ballroom & Prefunction Space 37 | |
|
Human knowledge integrated multi-modal learning for single source domain generalization
Poster Session 2 + Refreshments
Ayan Banerjee ⋅ Kuntal Thakur ⋅ Sandeep Gupta
|
Tucson Ballroom & Prefunction Space 92 | |
|
OpenCowID: Zero-Shot Visual Identification of Dairy Cows
Poster Session 2 + Refreshments
Omkar Prabhune ⋅ Younghyun Kim
|
Tucson Ballroom & Prefunction Space 8 | |
|
PaRaChute: Pathology-Radiology Cross-Modal Fusion for Missing-Modality-Robust Survival Prediction
Poster Session 1
Pietro Caforio ⋅ Isabella Poles ⋅ Marco Santambrogio
|
Tucson Ballroom & Prefunction Space 69 | |
|
3DSceneEditor: Controllable 3D Scene Editing with Gaussian Splatting
Poster Session 2 + Refreshments
Ziyang Yan ⋅ Yihua Shao ⋅ Minwen Liao ⋅ Siyu Chen ⋅ Nan Wang ⋅ Muyuan Lin ⋅ Jenq-Neng Hwang ⋅ Hao Zhao ⋅ Fabio Remondino ⋅ Lei Li
|
Tucson Ballroom & Prefunction Space 42 | |
|
Enhancing Visual Planning with Auxiliary Tasks and Multi-token Prediction
Poster Session 3
Ce Zhang ⋅ Yale Song ⋅ Ruta Desai ⋅ Michael Iuzzolino ⋅ Joseph Tighe ⋅ Gedas Bertasius ⋅ Satwik Kottur
|
Tucson Ballroom & Prefunction Space 122 | |
|
Alignment and Distillation: A Robust Framework for Multimodal Domain Generalizable Human Action Recognition
Poster Session 5
Hyeonbin Ji ⋅ Juyeob Lee ⋅ Eunil Park
|
Tucson Ballroom & Prefunction Space 106 | |
|
BAFLE-DCT: Bypassing Adversarial Filters via Frequency-Selective Embedding in the DCT Domain
Poster Session 5
Balapuwaduge Mendis ⋅ Farah Kandah ⋅ Sathya Aakur
|
Tucson Ballroom & Prefunction Space 16 | |
|
Grounding Descriptions in Images informs Zero-Shot Visual Recognition
Poster Session 4 + Reception
Shaunak Halbe ⋅ Junjiao Tian ⋅ Joseph J ⋅ James Smith ⋅ Katherine Stevo ⋅ Vineeth Balasubramanian ⋅ Zsolt Kira
|
Tucson Ballroom & Prefunction Space 133 | |
|
A Universal Self-Attention Enhancement for Bridging Low-bit Quantization and Vision Transformers
Poster Session 1
Jiahe Qian ⋅ Peisong Wang ⋅ Zhengyang Zhuge ⋅ Qinghao Hu ⋅ Jian Cheng
|
Tucson Ballroom & Prefunction Space 35 | |
|
Joint Optimization of Camera Model and Deep Neural Network for Image Recognition
Poster Session 6 + Refreshments
Youta Noboru ⋅ Yuko Ozasa ⋅ Masayuki Tanaka
|
Tucson Ballroom & Prefunction Space 41 | |
|
Low-Rank Expert Merging for Multi-Source Domain Adaptation in Person Re-Identification
Poster Session 2 + Refreshments
Taha Mustapha Nehdi ⋅ Nairouz Mrabah ⋅ ATIF BELAL ⋅ Marco Pedersoli ⋅ Eric Granger
|
Tucson Ballroom & Prefunction Space 38 | |
|
SasMamba: A Lightweight Structure-Aware Stride State Space Model for 3D Human Pose Estimation
Poster Session 2 + Refreshments
Hu Cui ⋅ Wenqiang Hua ⋅ Renjing Huang ⋅ ShuRui Jia ⋅ Tessai Hayama
|
Tucson Ballroom & Prefunction Space 124 | |
|
The Perceptual Observatory Characterizing Robustness and Grounding in MLLMs
Poster Session 2 + Refreshments
Tejas Anvekar ⋅ Fenil Bardoliya ⋅ Pavan Turaga ⋅ Chitta Baral ⋅ Vivek Gupta
|
Tucson Ballroom & Prefunction Space 23 | |
|
Procedure Learning via Regularized Gromov-Wasserstein Optimal Transport
Poster Session 5
Syed Mahmood ⋅ Ali Ali ⋅ Umer Ahmed ⋅ Fawad Fateh ⋅ Zeeshan Zia ⋅ Quoc-Huy Tran
|
Tucson Ballroom & Prefunction Space 107 | |
|
Optimal Transport for Rectified Flow Image Editing: Unifying Inversion-Based and Direct Methods
Poster Session 5
Marian Lupaşcu ⋅ Mihai-Sorin Stupariu
|
Tucson Ballroom & Prefunction Space 92 | |
|
PRISM-CAFO: Prior-conditioned Remote-sensing Infrastructure Segmentation and Mapping for CAFOs
Poster Session 2 + Refreshments
Oishee Bintey Hoque ⋅ Nibir Mandal ⋅ Kyle Luong ⋅ Mandy Wilson ⋅ Samarth Swarup ⋅ Madhav Marathe ⋅ Abhijin Adiga
|
Tucson Ballroom & Prefunction Space 64 | |
|
Diffusion Noise Optimization for Synthetic VLM Training
Poster Session 5
Ren Ohkubo ⋅ Rintaro Yanagi ⋅ Hirokatsu Kataoka ⋅ Yutaka Satoh
|
Tucson Ballroom & Prefunction Space 59 | |
|
Federated Model Synchronization for Diagnostic Redefinition through a Novel Selective Parameter Unlearning
Poster Session 1
Mayank Kundalwal Kundalwal ⋅ Mamta Mamta ⋅ Deepak Mishra ⋅ Asif Ekbal
|
Tucson Ballroom & Prefunction Space 134 | |
|
MapleGrasp: Mask-guided Feature Pooling for Language-driven Efficient Robotic Grasping
Poster Session 6 + Refreshments
Vineet Bhat ⋅ Naman Patel ⋅ Prashanth Krishnamurthy ⋅ Ramesh Karri ⋅ Farshad Khorrami
|
Tucson Ballroom & Prefunction Space 34 | |
|
Multi-view stereo with multiple projectors for oneshot entire shape scan based on Neural SDF and DSSS demultiplexing
Poster Session 4 + Reception
Kota Nishihara ⋅ Ryo Furukawa ⋅ Ryusuke Sagawa ⋅ Hiroshi Kawasaki
|
Tucson Ballroom & Prefunction Space 115 | |
|
Interaction-via-Actions: Cattle Interaction Detection with Joint Learning of Action-Interaction Latent Space
Poster Session 2 + Refreshments
Ren Nakagawa ⋅ Yang Yang ⋅ Risa Shinoda ⋅ Hiroaki Santo ⋅ Kenji Oyama ⋅ Fumio Okura ⋅ Takenao Ohkawa
|
Tucson Ballroom & Prefunction Space 54 | |
|
1LoRA: Summation Compression for Very-Low Rank Adaptation
Poster Session 2 + Refreshments
Alessio Quercia ⋅ Zhuo Cao ⋅ Arya Bangun ⋅ Richard Paul ⋅ Abigail Morrison ⋅ Ira Assent ⋅ Hanno Scharr
|
Tucson Ballroom & Prefunction Space 80 | |
|
SeqFeedNet: Sequential Feature Feedback Network for Background Subtraction
Poster Session 6 + Refreshments
Yu-Shun Huang ⋅ Yu-Shun Huang ⋅ Yi-Xiang Yang
|
Tucson Ballroom & Prefunction Space 95 | |
|
Understanding the Visual Projection Space of Multimodal LLMs
Poster Session 5
SungHeon Jeong ⋅ Yoojeong Song ⋅ Yoojeong Song
|
Tucson Ballroom & Prefunction Space 24 | |
|
Real-Time Tracking of Flexible Markers in Low-Contrast Fluoroscopy Using a Deep Neural Network Trained Solely on Synthetic Data
Poster Session 2 + Refreshments
Tomoki Uchiyama ⋅ Yukinobu Sakata ⋅ Ryusuke Hirai ⋅ Hitoshi Ishikawa ⋅ Shinichiro Mori
|
Tucson Ballroom & Prefunction Space 119 | |
|
DRWKV: Focusing on Object Edges for Low-Light Image Enhancement
Poster Session 2 + Refreshments
Xuecheng Bai ⋅ Yuxiang Wang ⋅ Boyu Hu ⋅ Qinyuan Jie ⋅ Chuanzhi Xu ⋅ Kechen Li ⋅ Hongru Xiao ⋅ Yuk Chung
|
Tucson Ballroom & Prefunction Space 14 | |
|
A Multi-Agent Diffusion Approach for MRI Anomaly Segmentation via Modality-Specific LoRA Specialization
Poster Session 1
Wafa Ghallabi ⋅ Muhammad Zaigham Zaheer ⋅ Ritesh Thawkar ⋅ Omkar Thawakar ⋅ Salman Khan ⋅ Fahad Khan
|
Tucson Ballroom & Prefunction Space 13 | |
|
Event-based Graph Representation with Spatial and Motion Vectors for Asynchronous Object Detection
Poster Session 3
Aayush Verma ⋅ Arpitsinh Vaghela ⋅ Bharatesh Chakravarthi ⋅ Kaustav Chanda ⋅ “YZ” Yezhou Yang
|
Tucson Ballroom & Prefunction Space 83 | |
|
OSEG: Improving Diffusion sampling through Orthogonal Smoothed Energy Guidance
Poster Session 5
Masud Fahim ⋅ Nazmus Saqib ⋅ JOON-MIN GIL
|
Tucson Ballroom & Prefunction Space 19 | |
|
SGPMIL: Sparse Gaussian Process Multiple Instance Learning
Poster Session 1
Andreas Lolos ⋅ Stergios Christodoulidis ⋅ Aris Moustakas ⋅ Jose Dolz ⋅ Maria Vakalopoulou
|
Tucson Ballroom & Prefunction Space 49 | |
|
CAST: Evaluating Multi-Object Trackers with Context-Aware Switch and Transfer Scores
Poster Session 6 + Refreshments
Jin Bai ⋅ Gregory Hager
|
Tucson Ballroom & Prefunction Space 6 | |
|
M-ErasureBench: A Comprehensive Multimodal Evaluation Benchmark for Concept Erasure in Diffusion Models
Poster Session 1
Ju-Hsuan Weng ⋅ Jia-Wei Liao ⋅ Cheng-Fu Chou ⋅ Jun-Cheng Chen
|
Tucson Ballroom & Prefunction Space 51 | |
|
Unified Control for Inference-Time Guidance of Denoising Diffusion Models
Poster Session 4 + Reception
Maurya Goyal ⋅ Anuj Singh ⋅ Hadi Rad
|
Tucson Ballroom & Prefunction Space 110 | |
|
EVTP-IVS: Effective Visual Token Pruning For Unifying Instruction Visual Segmentation In Multi-Modal Large Language Models
Poster Session 5
Wenhui Zhu ⋅ Xiwen Chen ⋅ Zhipeng Wang ⋅ Shao Tang ⋅ Sayan Ghosh ⋅ XUANZHAO DONG ⋅ Rajat Koner ⋅ Yalin Wang
|
Tucson Ballroom & Prefunction Space 129 | |
|
SceneEval: Evaluating Semantic Coherence in Text-Conditioned 3D Indoor Scene Synthesis
Poster Session 6 + Refreshments
Hou In Ivan Tam ⋅ Hou In Derek Pun ⋅ Austin Wang ⋅ Angel Chang ⋅ Manolis Savva
|
Tucson Ballroom & Prefunction Space 15 | |
|
InteracTalker: Prompt-Based Human-Object Interaction with Co-Speech Gesture Generation
Poster Session 2 + Refreshments
Sreehari Rajan ⋅ Kunal Bhosikar ⋅ Charu Sharma
|
Tucson Ballroom & Prefunction Space 3 | |
|
BrandFusion: Aligning Image Generation with Brand Styles
Poster Session 2 + Refreshments
Parul Gupta ⋅ Varun Khurana ⋅ Yaman Singla ⋅ Balaji Krishnamurthy ⋅ Abhinav Dhall
|
Tucson Ballroom & Prefunction Space 86 | |
|
From Cognitive Priors to Instance Semantics: A Unified Framework for Multi-task Affective Computing
Poster Session 6 + Refreshments
Guanyu Hu ⋅ Dimitrios Kollias ⋅ Xinyu Yang
|
Tucson Ballroom & Prefunction Space 128 | |
|
CalibBEV: LiDAR-Camera Calibration via BEV Alignment
Poster Session 4 + Reception
Filippo D'Addeo ⋅ Lorenzo Cipelli ⋅ Adriano Cardace ⋅ Emanuele Ghelfi ⋅ Andrea Zinelli ⋅ Massimo Bertozzi
|
Tucson Ballroom & Prefunction Space 6 | |
|
ITSELF: Attention Guided Fine-Grained Alignment for Vision–Language Retrieval
Poster Session 2 + Refreshments
TIEN-HUY NGUYEN ⋅ Huu-Loc Tran ⋅ Thanh Ngo
|
Tucson Ballroom & Prefunction Space 4 | |
|
Detecting Out-of-Distribution Objects through Class-Conditioned Inpainting
Poster Session 2 + Refreshments
Quang-Huy Nguyen ⋅ Jin Peng Zhou ⋅ Zhenzhen Liu ⋅ Khanh-Huyen Bui ⋅ Kilian Weinberger ⋅ Wei-Lun Chao ⋅ Dung Le
|
Tucson Ballroom & Prefunction Space 50 | |
|
Image-Guided Semantic Pseudo-LiDAR Point Generation for 3D Object Detection
Poster Session 5
MINSEUNG LEE ⋅ Seokha Moon ⋅ Seung Lee ⋅ Reza Mahjourian ⋅ Jinkyu Kim
|
Tucson Ballroom & Prefunction Space 127 | |
|
Structured Context Learning for Generic Event Boundary Detection
Poster Session 4 + Reception
Xin Gu ⋅ Congcong Li ⋅ Xinyao Wang ⋅ Dexiang Hong ⋅ Libo Zhang ⋅ Tiejian Luo ⋅ Longyin Wen ⋅ Heng Fan
|
Tucson Ballroom & Prefunction Space 50 | |
|
MooTrack360: A Novel Fisheye Camera Dataset for Robust Multi Diary Cow Detection and Tracking
Poster Session 1
Rasmus Christiansen ⋅ Toan Nguyen ⋅ Lasse Malskær ⋅ Leon Bodenhagen ⋅ Dirk Kraft
|
Tucson Ballroom & Prefunction Space 44 | |
|
Saliency-Guided DETR for Moment Retrieval and Highlight Detection
Poster Session 1
Aleksandr Gordeev ⋅ Vladimir Dokholyan ⋅ Irina Tolstykh ⋅ Maksim Kuprashevich
|
Tucson Ballroom & Prefunction Space 87 | |
|
Gaussian Representations for Video
Poster Session 1
Sachin Shah ⋅ Anustup Choudhury ⋅ Guan-Ming Su ⋅ Jaclyn Pytlarz ⋅ Christopher Metzler ⋅ Trisha Mittal
|
Tucson Ballroom & Prefunction Space 79 | |
|
SVD-Det: A Lightweight Framework for Video Forgery Detection Using Semantic and Visual Defect Cues
Poster Session 6 + Refreshments
Tsung-Shan Yang ⋅ Tianyu Zhang ⋅ Feng Qian ⋅ Bing Yan ⋅ Chung Chieh Kuo
|
Tucson Ballroom & Prefunction Space 40 | |
|
Semi-supervised Domain Adaptation via Mutual Alignment through Joint Error
Poster Session 4 + Reception
Dexuan Zhang ⋅ Thomas Westfechtel ⋅ Tatsuya Harada
|
Tucson Ballroom & Prefunction Space 109 | |
|
Lose Your Self (LoYS): an adversarial entropy-based unsupervised approach for model debiasing
Poster Session 4 + Reception
Vito Paolo Pastore ⋅ Massimiliano Ciranni ⋅ Vittorio Murino
|
Tucson Ballroom & Prefunction Space 137 | |
|
Learning Mask-Aware Offsets: Two-branch Deformable Attention Networks for Inpainting with Masked Region Avoidance
Poster Session 1
Hyeongseok Oh ⋅ Joonki Paik
|
Tucson Ballroom & Prefunction Space 98 | |
|
TiCLS : Tightly Coupled Language Text Spotter
Poster Session 3
Leeje Jang ⋅ Yijun Lin ⋅ Yao-Yi Chiang ⋅ Jerod Weinman
|
Tucson Ballroom & Prefunction Space 78 | |
|
EmojiDiff: Advanced Facial Expression Control with High Identity Preservation in Portrait Generation
Poster Session 1
Liangwei Jiang ⋅ Ruida Li ⋅ Zhifeng Zhang ⋅ Shuo Fang ⋅ Chenguang Ma
|
Tucson Ballroom & Prefunction Space 32 | |
|
Towards Reliable Test-Time Adaptation: Style Invariance as a Correctness Likelihood
Poster Session 3
Gilhyun Nam ⋅ Taewon Kim ⋅ Joonhyun Jeong ⋅ Eunho Yang
|
Tucson Ballroom & Prefunction Space 16 | |
|
4D Multimodal Co-attention Fusion Network with Latent Contrastive Alignment for Alzheimer's Diagnosis
Poster Session 4 + Reception
YUXIANG WEI ⋅ Yanteng Zhang ⋅ Xi Xiao ⋅ Tianyang Wang ⋅ Xiao Wang ⋅ Vince Calhoun
|
Tucson Ballroom & Prefunction Space 112 | |
|
DNA: Dual-branch Network with Adaptation for Open-Set Online Handwriting Generation
Poster Session 3
Tsai-Ling Huang ⋅ Nhat-Tuong Do-Tran ⋅ Ngoc-Hoang-Lam Le ⋅ Hong-Han Shuai ⋅ Ching-Chun Huang
|
Tucson Ballroom & Prefunction Space 120 | |
|
Advancing Multimodal LLMs by Large-Scale 3D Visual Instruction Dataset Generation
Poster Session 5
Liu He ⋅ Xiao Zeng ⋅ Yizhi Song ⋅ Albert Chen ⋅ Lu Xia ⋅ Shashwat Verma ⋅ Sankalp Dayal ⋅ Min Sun ⋅ Cheng-Hao Kuo ⋅ Daniel Aliaga
|
Tucson Ballroom & Prefunction Space 9 | |
|
Towards Photorealistic Style Transfer with Multimodal Guidance and Robustness to Content Images in Arbitrary Styles
Poster Session 4 + Reception
Ruikai Zhou ⋅ Yating Liu ⋅ Yi Xu
|
Tucson Ballroom & Prefunction Space 35 | |
|
Optimizing against Infeasible Inclusions from Data for Semantic Segmentation through Morphology
Poster Session 6 + Refreshments
Shamik Basu ⋅ Luc Van Gool ⋅ Christos Sakaridis
|
Tucson Ballroom & Prefunction Space 31 | |
|
Flood-LDM: Generalizable Latent Diffusion Models for rapid and accurate zero-shot High-Resolution Flood Mapping
Poster Session 6 + Refreshments
Sun Han Neo ⋅ Sachith Seneviratne ⋅ Herath Mudiyanselage Viraj Vidura Herath ⋅ Abhishek Saha ⋅ Sanka Rasnayaka ⋅ Lucy Marshall
|
Tucson Ballroom & Prefunction Space 82 | |
|
UniGaze: Towards Universal Gaze Estimation via Large-scale Pre-Training
Poster Session 5
Jiawei Qin ⋅ Xucong Zhang ⋅ Yusuke Sugano
|
Tucson Ballroom & Prefunction Space 2 | |
|
ODEt(ODEl): Shortcutting the Time and the Length in Diffusion and Flow Models for Faster Sampling
Poster Session 5
Denis Gudovskiy ⋅ Wenzhao Zheng ⋅ Tomoyuki Okuno ⋅ Yohei Nakata ⋅ Kurt Keutzer
|
Tucson Ballroom & Prefunction Space 30 | |
|
JOCA: Task-Driven Joint Optimisation of Camera Hardware and Adaptive Camera Control Algorithms
Poster Session 3
Chengyang Yan ⋅ Mitch Bryson ⋅ Donald Dansereau
|
Tucson Ballroom & Prefunction Space 97 | |
|
BiPO: Bidirectional Partial Occlusion Network for Text-to-Motion Synthesis
Poster Session 1
Seong-Eun Hong ⋅ SooBin Lim ⋅ JuYeong Hwang ⋅ Minwook Chang ⋅ Hyeongyeop Kang
|
Tucson Ballroom & Prefunction Space 4 | |
|
PHYSPLAT: a Framework for Photorealistic Hybrid Simulation of Real and Synthetic Elements using 3D Gaussian Splatting
Poster Session 2 + Refreshments
Mario Alfonso-Arsuaga ⋅ Henar Dominguez-Elvira ⋅ Jorge Guerrero ⋅ Andrea Castiella-Aguirrezabala ⋅ Lorenzo Domínguez ⋅ Jorge García-González ⋅ Maria Naranjo-Almeida ⋅ Marc Comino-Trinidad ⋅ Jorge Lopez-Moreno
|
Tucson Ballroom & Prefunction Space 20 | |
|
Uplifting Table Tennis: A Robust, Real-World Application for 3D Trajectory and Spin Estimation
Poster Session 6 + Refreshments
Daniel Kienzle ⋅ Katja Ludwig ⋅ Julian Lorenz ⋅ Shin'ichi Satoh ⋅ Rainer Lienhart
|
Tucson Ballroom & Prefunction Space 23 | |
|
AUTOCORRELATION-BASED FIDUCIAL MARKERS FOR TRACEABILITY
Poster Session 1
BENCHEIKH ISMAIL ⋅ Max Dunitz ⋅ Marie d'Autume ⋅ Marc Pic ⋅ Enric Meinhardt-Llopis ⋅ Gabriele Facciolo ⋅ Pablo Musé
|
Tucson Ballroom & Prefunction Space 129 | |
|
QC-SF: Improving Computer Vision for Airborne LiDAR Point Clouds of Boreal Forests with Quebec Simulated Forest Dataset
Poster Session 4 + Reception
Olivier Stocker ⋅ Reza Mahmoudi Kouhi ⋅ Omid Reisi Gahrouei ⋅ Thierry Badard ⋅ Eric Guilbert
|
Tucson Ballroom & Prefunction Space 71 | |
|
ControlEvents: Controllable Synthesis of Event Camera Data with Foundational Prior from Image Diffusion Models
Poster Session 4 + Reception
Yixuan Hu ⋅ Yuxuan Xue ⋅ Simon Klenk ⋅ Daniel Cremers ⋅ Gerard Pons-Moll
|
Tucson Ballroom & Prefunction Space 117 | |
|
SurfDist: Interpretable Three-Dimensional Instance Segmentation Using Curved Surface Patches
Poster Session 4 + Reception
Jackson Borchardt ⋅ Saul Kato
|
Tucson Ballroom & Prefunction Space 120 | |
|
ReBrain: Brain MRI Reconstruction from Sparse CT Slice via Retrieval-Augmented Diffusion
Poster Session 3
Junming Liu ⋅ Yifei Sun ⋅ Weihua Cheng ⋅ Yujin Kang ⋅ Yirong Chen ⋅ Ding Wang ⋅ Guosun Zeng
|
Tucson Ballroom & Prefunction Space 104 | |
|
MSRTrack: LLM-Powered Object Tracking with Motion and Semantic Reasoning
Poster Session 1
Tong Shen ⋅ Di Wang ⋅ José Moura
|
Tucson Ballroom & Prefunction Space 80 | |
|
CONCORD: Concept-Informed Diffusion for Dataset Distillation
Poster Session 4 + Reception
Jianyang Gu ⋅ Haonan Wang ⋅ Ruoxi Jia ⋅ Saeed Vahidian ⋅ Vyacheslav Kungurtsev ⋅ Wei Jiang ⋅ Yiran Chen
|
Tucson Ballroom & Prefunction Space 93 | |
|
Detection-Driven Object Count Optimization for Text-to-Image Diffusion Models
Poster Session 2 + Refreshments
Oz Zafar ⋅ Yuval Cohen ⋅ Lior Wolf ⋅ Idan Schwartz
|
Tucson Ballroom & Prefunction Space 45 | |
|
Accelerated Dose Generation in Gamma Knife Radiosurgery Using a Wavelet Diffusion Model for Sparse Representation
Poster Session 1
Sangyoon Lee ⋅ Shubhendu Mishra ⋅ Yoichi Watanabe
|
Tucson Ballroom & Prefunction Space 88 | |
|
A framework for real-time Surgical Phase Recognition with application to Robot-Assisted Partial Nephrectomy
Poster Session 1
Marco Mezzina ⋅ Tom Vercauteren ⋅ Tinne Tuytelaars ⋅ Matthew Blaschko
|
Tucson Ballroom & Prefunction Space 24 | |
|
4D-Animal: Freely Reconstructing Animatable 3D Animals from Videos
Poster Session 1
Shanshan Zhong ⋅ Jiawei Peng ⋅ Zehan Zheng ⋅ Zhongzhan Huang ⋅ Wufei Ma ⋅ Guofeng Zhang ⋅ Qihao Liu ⋅ Alan Yuille ⋅ Jieneng Chen
|
Tucson Ballroom & Prefunction Space 58 | |
|
A Novel Metric for Detecting Memorization in Generative Models for Brain MRI Synthesis
Poster Session 3
Antonio Scardace ⋅ Lemuel Puglisi ⋅ Francesco Guarnera ⋅ Sebastiano Battiato ⋅ Daniele Ravi
|
Tucson Ballroom & Prefunction Space 91 | |
|
VIZOR: Viewpoint-Invariant Zero-Shot Scene Graph Generation for 3D Scene Reasoning
Poster Session 6 + Refreshments
Madhavaram Vivek Vardhan ⋅ Vartika Sengar ⋅ Arkadipta De ⋅ Charu Sharma
|
Tucson Ballroom & Prefunction Space 131 | |
|
Fused Similarity Measure Based Alignment with Dual-Scale Adaptive Selection for Weakly Supervised Video Anomaly Detection
Poster Session 3
Yuegao Lu ⋅ Hong-Jie Xing ⋅ Chun-Guo Li
|
Tucson Ballroom & Prefunction Space 26 | |
|
Distilling Diversity and Control in Diffusion Models
Poster Session 1
Rohit Gandikota ⋅ David Bau
|
Tucson Ballroom & Prefunction Space 125 | |
|
Automated Pore Detection from In-Situ FDM 3D Printing Video: A Comparative Evaluation of Modern Segmentation Models
Poster Session 4 + Reception
Abdullah Al Ahad Khan ⋅ Md Islam ⋅ Lin Li ⋅ Lai Jiang ⋅ Noushin Ghaffari
|
Tucson Ballroom & Prefunction Space 37 | |
|
Better Safe Than Sorry? Overreaction Problem of Vision Language Models in Visual Emergency Recognition
Poster Session 4 + Reception
Dasol Choi ⋅ Seunghyun Lee ⋅ Youngsook Song
|
Tucson Ballroom & Prefunction Space 42 | |
|
FastHMR: Accelerating Human Mesh Recovery via Token and Layer Merging with Diffusion Decoding
Poster Session 5
Soroush Mehraban ⋅ Andrea Iaboni ⋅ Babak Taati
|
Tucson Ballroom & Prefunction Space 89 | |
|
Latent Uncertainty-Aware Multi-View SDF Scan Completion
Poster Session 3
Faezeh Zakeri ⋅ Lukas Ruppert ⋅ Raphael Braun ⋅ Hendrik Lensch
|
Tucson Ballroom & Prefunction Space 61 | |
|
SCALEX: Scalable Concept and Latent Exploration for Diffusion Models
Poster Session 3
Emily Zhixuan Zeng ⋅ Yuhao Chen ⋅ Alexander Wong
|
Tucson Ballroom & Prefunction Space 67 | |
|
Gen-AFFECT: Generation of Avatar Fine-grained Facial Expressions with Consistent identiTy
Poster Session 3
Hao Yu ⋅ Rupayan Mallick ⋅ Margrit Betke ⋅ Sarah Bargal
|
Tucson Ballroom & Prefunction Space 93 | |
|
UnderWater SLAM with Laser-light sectioning method using ST-GAT
Poster Session 1
Heyang Gao ⋅ Kazuto Ichimaru ⋅ Takafumi Iwaguchi ⋅ Hiroshi Kawasaki
|
Tucson Ballroom & Prefunction Space 9 | |
|
START: Spatial and Textual Learning for Chart Understanding
Poster Session 6 + Refreshments
Zhuoming Liu ⋅ Xiaofeng Gao ⋅ Feiyang Niu ⋅ Qiaozi Gao ⋅ Liu Liu ⋅ Robinson Piramuthu
|
Tucson Ballroom & Prefunction Space 90 | |
|
Co-STAR: Collaborative Curriculum Self-Training with Adaptive Regularization for Source-Free Video Domain Adaptation
Poster Session 6 + Refreshments
Amirhossein Dadashzadeh ⋅ Parsa Esmati ⋅ Majid Mirmehdi
|
Tucson Ballroom & Prefunction Space 59 | |
|
PDV: Prompt Directional Vectors for Zero-shot Composed Image Retrieval
Poster Session 6 + Refreshments
Osman Tursun ⋅ Sinan Kalkan ⋅ Simon Denman ⋅ Clinton Fookes
|
Tucson Ballroom & Prefunction Space 51 | |
|
How to Design and Train Your Implicit Neural Representation for Video Compression
Poster Session 1
Matthew Gwilliam ⋅ Roy Zhang ⋅ Namitha Padmanabhan ⋅ Hongyang Du ⋅ Abhinav Shrivastava
|
Tucson Ballroom & Prefunction Space 70 | |
|
Chain-of-Look Spatial Reasoning for Dense Surgical Instrument Counting
Poster Session 6 + Refreshments
Rishikesh Bhyri ⋅ Brian Quaranto ⋅ Junsong Yuan ⋅ Peter Kim ⋅ Nan Xi
|
Tucson Ballroom & Prefunction Space 125 | |
|
From Darkness to Detail: Frequency-Aware SSMs for Low-Light Vision
Poster Session 5
Eashan Adhikarla ⋅ Kai Zhang ⋅ Gong Chen ⋅ John Nicholson ⋅ Brian Davison
|
Tucson Ballroom & Prefunction Space 110 | |
|
Global Focal and Radial Distortion Averaging from Radial Fundamental Matrices for Robust Self-Calibration
Poster Session 4 + Reception
Sergei Solonets ⋅ Daniil Sinitsyn ⋅ Daniel Cremers
|
Tucson Ballroom & Prefunction Space 47 | |
|
Hymavi : A Hybrid Mamba-Attention Network in Multi-View Framework for Volumetric Medical Image Segmentation
Poster Session 5
Sy Tran ⋅ Jin Kyu Gahm
|
Tucson Ballroom & Prefunction Space 20 | |
|
OpenLVLM-MIA: A Controlled Benchmark Revealing the Limits of Membership Inference Attacks on Large Vision-Language Models
Poster Session 2 + Refreshments
Miyamoto Ryoto ⋅ Xin Fan ⋅ Fuyuko Kido ⋅ Tsuneo Matsumoto ⋅ Hayato Yamana
|
Tucson Ballroom & Prefunction Space 120 | |
|
Beyond Faces: A Multimodal Person Clustering for Unconstrained Environments
Poster Session 4 + Reception
Sahngmin Yoo ⋅ Sangwon Lee ⋅ Seongin Jo
|
Tucson Ballroom & Prefunction Space 33 | |
|
Fetal and Neonatal Cortical Surface Reconstruction with Anatomical Normal-guidance and Perceptual Enhancements
Poster Session 6 + Refreshments
Jiyang Lee ⋅ Woori Bae ⋅ U-Geun Ji ⋅ Hanyeol Yang ⋅ Jong-Min Lee
|
Tucson Ballroom & Prefunction Space 53 | |
|
Mitigating the Modality Gap: Few-Shot Out-of-Distribution Detection with Multi-modal Prototypes and Image Bias Estimation
Poster Session 2 + Refreshments
Yimu Wang ⋅ Evelien Riddell ⋅ Adrian Chow ⋅ Sean Sedwards ⋅ Krzysztof Czarnecki
|
Tucson Ballroom & Prefunction Space 126 | |
|
SPOC: Spatially-Progressing Object State Change Segmentation in Video
Poster Session 3
Priyanka Mandikal ⋅ Tushar Nagarajan ⋅ Alex Stoken ⋅ Zihui Xue ⋅ Kristen Grauman
|
Tucson Ballroom & Prefunction Space 56 | |
|
FAST-EQA: Efficient Embodied Question Answering with Global and Local Region Relevancy
Poster Session 2 + Refreshments
Haochen Zhang ⋅ Nirav Savaliya ⋅ Faizan Siddiqui ⋅ Enna Sachdeva
|
Tucson Ballroom & Prefunction Space 24 | |
|
Mobile-Oriented Video Diffusion: Enabling Text-to-Video Generation on Mobile Devices Without Retraining, Compression, or Pruning
Poster Session 3
Bosung Kim ⋅ Kyuhwan Lee ⋅ Isu Jeong ⋅ Jungmin Cheon ⋅ Yeojin Lee ⋅ Seulki Lee
|
Tucson Ballroom & Prefunction Space 100 | |
|
Understanding Generative AI Capabilities in Everyday Image Editing Tasks
Poster Session 2 + Refreshments
Brandon Collins ⋅ Mohammad Reza Taesiri ⋅ Logan Bolton ⋅ Viet Lai ⋅ Franck Dernoncourt ⋅ Trung Bui ⋅ Anh Nguyen
|
Tucson Ballroom & Prefunction Space 78 | |
|
TalkingPose: Efficient Face and Gesture Animation with Feedback-guided Diffusion Model
Poster Session 3
Alireza Javanmardi ⋅ Pragati Jaiswal ⋅ Tewodros Habtegebrial ⋅ Christen Millerdurai ⋅ Shaoxiang Wang ⋅ Alain Pagani ⋅ Didier Stricker
|
Tucson Ballroom & Prefunction Space 17 | |
|
Conversational Image Generation: Towards Multi-Round Personalized Generation with Multi-Modal Language Models
Poster Session 6 + Refreshments
Haochen Zhang ⋅ Animesh Sinha ⋅ Felix Juefei-Xu ⋅ Haoyu Ma ⋅ Kunpeng Li ⋅ Zhipeng Fan ⋅ Xiaoliang Dai ⋅ Tingbo Hou ⋅ Peizhao Zhang ⋅ Zecheng He
|
Tucson Ballroom & Prefunction Space 102 | |
|
UniCalib: Targetless LiDAR-camera Calibration via Probabilistic Flow on Unified Depth Representations
Poster Session 2 + Refreshments
Shu Han ⋅ Xubo Zhu ⋅ Ji Wu ⋅ Ximeng Cai ⋅ Wen Yang ⋅ Huai Yu ⋅ Gui-Song Xia
|
Tucson Ballroom & Prefunction Space 47 | |
|
DOODLE: Diffusion-based Out-of-Distribution Learning for Open-set LiDAR Semantic Segmentation
Poster Session 2 + Refreshments
Changgyoon Oh ⋅ Hyeonseong Kim ⋅ Daehyun We ⋅ Jongoh Jeong ⋅ Yujeong Chae ⋅ Kuk-Jin Yoon
|
Tucson Ballroom & Prefunction Space 82 | |
|
Logit-Adjusted Test-Time Adaptation under Partial Class Imbalance
Poster Session 5
Thilina Weerasinghe ⋅ Ruwan Tennakoon ⋅ WeiQin Chuah ⋅ Alireza Bab-Hadiashar
|
Tucson Ballroom & Prefunction Space 17 | |
|
Conditional Text-to-Image Generation with Reference Guidance
Poster Session 2 + Refreshments
Taewook Kim ⋅ Ze Wang ⋅ Zhengyuan Yang ⋅ Jiang Wang ⋅ Lijuan Wang ⋅ Zicheng Liu ⋅ Qiang Qiu
|
Tucson Ballroom & Prefunction Space 139 | |
|
From SAM to DINOv2: Towards Distilling Foundation Models to Lightweight Baselines for Generalized Polyp Segmentation
Poster Session 2 + Refreshments
Shivanshu Agnihotri ⋅ Snehashis Majhi ⋅ Deepak Nayak ⋅ Debesh Jha
|
Tucson Ballroom & Prefunction Space 33 | |
|
Leveraging Pretrained Representations for Cross-Modal Point Cloud Completion
Poster Session 1
Kshitij Kale ⋅ Hrishikesh U ⋅ V Sreenidhe ⋅ Shylaja S
|
Tucson Ballroom & Prefunction Space 10 | |
|
RPT-SR: Regional Prior attention Transformer for infrared image Super-Resolution
Poster Session 4 + Reception
Youngwan Jin ⋅ Incheol Park ⋅ Yagiz Nalcakan ⋅ Hyeongjin Ju ⋅ Sang Yeo ⋅ Shiho Kim
|
Tucson Ballroom & Prefunction Space 86 | |
|
CropAT: Leveraging Diffusion-Generated Target-Like Cropped Objects for Pseudo-Label Refinement in Domain-Adaptive Object Detection
Poster Session 4 + Reception
Chen-Che Huang ⋅ Tzuhsuan Huang ⋅ Jun-Cheng Chen
|
Tucson Ballroom & Prefunction Space 32 | |
|
ArchitectHead: Continuous Level of Detail Control for 3D Gaussian Head Avatars
Poster Session 2 + Refreshments
Peizhi Yan ⋅ Rabab Ward ⋅ Qiang Tang ⋅ Shan Du
|
Tucson Ballroom & Prefunction Space 21 | |
|
TimeRefine: Temporal Grounding with Time Refining Video LLM
Poster Session 4 + Reception
Xizi Wang ⋅ Feng Cheng ⋅ Ziyang Wang ⋅ Huiyu Wang ⋅ Md Mohaiminul Islam ⋅ Lorenzo Torresani ⋅ Mohit Bansal ⋅ Gedas Bertasius ⋅ David Crandall
|
Tucson Ballroom & Prefunction Space 75 | |
|
Reviving Unsupervised Optical Flow: Concept Reevaluation, Multi-Scale Advances and Full Open-Source Release
Poster Session 2 + Refreshments
Azin Jahedi ⋅ Marc Rivinius ⋅ Noah Senn ⋅ Andres Bruhn
|
Tucson Ballroom & Prefunction Space 12 | |
|
EllipssianNet: Image-guided Sampling of 2D Gaussians for Gaussian Splatting
Poster Session 2 + Refreshments
MyoungGon Kim ⋅ JeongHyeon Ahn ⋅ Seohyeon Park ⋅ Hyemi Kim ⋅ Seunghyun Park ⋅ Jung Hwang ⋅ JungHyun Han
|
Tucson Ballroom & Prefunction Space 66 | |
|
MaxInfo: A Training-Free Key-Frame Selection Method Using Maximum Volume for Enhanced Video Understanding
Poster Session 5
Pengyi Li ⋅ Irina Abdullaeva ⋅ Alexander Gambashidze ⋅ Andrei Kuznetsov ⋅ Ivan Oseledets
|
Tucson Ballroom & Prefunction Space 133 | |
|
Splatter Layout: Geometry-embedded 3D Reconstruction via Surface Unfolding
Poster Session 6 + Refreshments
Bryan Heryanto ⋅ Tackgeun You ⋅ Chanwoo Kim ⋅ Hwasup Lim
|
Tucson Ballroom & Prefunction Space 49 | |
|
Relevance-aware Multi-context Contrastive Decoding for Retrieval-augmented Visual Question Answering
Poster Session 6 + Refreshments
Jongha Kim ⋅ Byungoh Ko ⋅ Jeehye Na ⋅ Jinsung Yoon ⋅ Hyunwoo Kim
|
Tucson Ballroom & Prefunction Space 132 | |
|
Unsupervised Discovery of Long-Term Spatiotemporal Periodic Workflows in Human Activities
Poster Session 5
Fan Yang ⋅ Quanting Xie ⋅ Atsunori Moteki ⋅ Shoichi Masui ⋅ Shan Jiang ⋅ Kanji Uchino ⋅ Yonatan Bisk ⋅ Graham Neubig
|
Tucson Ballroom & Prefunction Space 3 | |
|
Ordinal-Aware Multimodal Engagement Recognition for Collaborative Learning
Poster Session 2 + Refreshments
Nha Tran ⋅ Dat Ly ⋅ Phi Ta ⋅ Hung Nguyen ⋅ Hien Nguyen
|
Tucson Ballroom & Prefunction Space 96 | |
|
MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities
Poster Session 6 + Refreshments
Tooba Tehreem Sheikh ⋅ Jean Lahoud ⋅ Rao Anwer ⋅ Fahad Khan ⋅ Salman Khan ⋅ Hisham Cholakkal
|
Tucson Ballroom & Prefunction Space 135 | |
|
Dragonite: Single-Step Drag-based Image Editing with Geometric-Semantic Guidance
Poster Session 3
Meng-Ting Jhong ⋅ Tai-Ming Huang ⋅ Shang-Fu Chen ⋅ Wen-Huang Cheng ⋅ Kailung Hua
|
Tucson Ballroom & Prefunction Space 13 | |
|
Action Anticipation at a Glimpse: To What Extent Can Multimodal Cues Replace Video?
Poster Session 1
Manuel Benavent-Lledo ⋅ Konstantinos Bacharidis ⋅ Victoria Manousaki ⋅ Konstantinos Papoutsakis ⋅ Antonis Argyros ⋅ José García-Rodríguez
|
Tucson Ballroom & Prefunction Space 27 | |
|
NERVE: Neighbourhood & Entropy-Guided Random-Walk for Training Free Open-Vocabulary Segmentation
Poster Session 3
KUNAL MAHATHA ⋅ Jose Dolz ⋅ Christian Desrosiers
|
Tucson Ballroom & Prefunction Space 31 | |
|
2S-CEDiff: A Two-Stage Diffusion Framework for Generating High-Fidelity Contrast-Enhanced CT Images from Non-Contrast Scans
Poster Session 3
Yi-Bang Wu ⋅ Tzung-Dau Wang ⋅ Shang-Hong Lai
|
Tucson Ballroom & Prefunction Space 96 | |
|
INRetouch: Context Aware Implicit Neural Representation for Photography Retouching
Poster Session 4 + Reception
Omar Elezabi ⋅ Marcos Conde ⋅ Zongwei Wu ⋅ Radu Timofte
|
Tucson Ballroom & Prefunction Space 122 | |
|
Optimization-Free Style Transfer for 3D Gaussian Splats
Poster Session 6 + Refreshments
Raphael DuSablon ⋅ David Hart
|
Tucson Ballroom & Prefunction Space 80 | |
|
Streaming Real-Time Trajectory Prediction Using Endpoint-Aware Modeling
Poster Session 3
Alexander Prutsch ⋅ David Schinagl ⋅ Horst Possegger
|
Tucson Ballroom & Prefunction Space 134 | |
|
Performance of Conformal Prediction in Capturing Aleatoric Uncertainty
Poster Session 3
Misgina Tsighe Hagos ⋅ Claes Lundström
|
Tucson Ballroom & Prefunction Space 4 | |
|
Distilling Offline Action Detection Models into Real-Time Streaming Models
Poster Session 5
Deep Patel ⋅ Yasunori Babazazki ⋅ YASUTO NAGASE ⋅ Iain Melvin ⋅ Martin Min
|
Tucson Ballroom & Prefunction Space 39 | |
|
Multi-Modal Soccer Scene Analysis with Masked Pre-Training
Poster Session 3
Marc Peral ⋅ Guillem Capellera ⋅ Luis Ferraz ⋅ Antonio Romano ⋅ Antonio Agudo
|
Tucson Ballroom & Prefunction Space 59 | |
|
GroupPortrait: Multi-ID Portrait Generation with High Identity Preservation and Fine-Grained Control
Poster Session 5
Meijia Huang ⋅ Ruida Li ⋅ Bing Ma ⋅ Liangwei Jiang ⋅ Shuo Fang ⋅ Chenguang Ma
|
Tucson Ballroom & Prefunction Space 41 | |
|
From Prompt to Production: Automating Brand-Safe Marketing Imagery with Text-to-Image Models
Poster Session 5
Parmida Atighehchain ⋅ Henry Wang ⋅ Andrei Kapustin ⋅ Boris Lerner ⋅ Tiancheng Jiang ⋅ Taylor Jensen ⋅ Negin Sokhandan
|
Tucson Ballroom & Prefunction Space 97 | |
|
GateFusion: Hierarchical Gated Cross-Modal Fusion for Active Speaker Detection
Poster Session 1
Yu Wang ⋅ Juhyung Ha ⋅ Frangil Ramirez ⋅ Yuchen Wang ⋅ David Crandall
|
Tucson Ballroom & Prefunction Space 103 | |
|
MemeTAG: Keyword-Driven Meme Classification through Tag Embedding Reconstruction
Poster Session 6 + Refreshments
Akshit Sharma ⋅ Prashant Patil
|
Tucson Ballroom & Prefunction Space 46 | |
|
IDEAL-M3D: Instance Diversity-Enriched Active Learning for Monocular 3D Detection
Poster Session 1
Johannes Meier ⋅ Florian Günther ⋅ Riccardo Marin ⋅ Oussema Dhaouadi ⋅ Jacques Kaiser ⋅ Daniel Cremers
|
Tucson Ballroom & Prefunction Space 18 | |
|
Gene-DML: Dual-Pathway Multi-Level Discrimination for Gene Expression Prediction from Histopathology Images
Poster Session 4 + Reception
Yaxuan Song ⋅ Jianan Fan ⋅ Hang Chang ⋅ Weidong Cai
|
Tucson Ballroom & Prefunction Space 77 | |
|
SceneEdited: A City-Scale Benchmark for 3D HD Map Updating via Image-Guided Change Detection
Poster Session 5
Chun-Jung Lin ⋅ Tat-Jun Chin ⋅ Sourav Garg ⋅ Feras Dayoub
|
Tucson Ballroom & Prefunction Space 51 | |
|
Sketch2Stitch: GANs for Abstract Sketch-Based Dress Synthesis
Poster Session 2 + Refreshments
Faizan Khan ⋅ Faizan Khan ⋅ Davide Morelli ⋅ Marcella Cornia ⋅ Rita Cucchiara ⋅ Mohamed Elhoseiny
|
Tucson Ballroom & Prefunction Space 76 | |
|
Mixed Diffusion for 3D Indoor Scene Synthesis
Poster Session 1
Siyi Hu ⋅ Diego Martín Arroyo ⋅ Stephanie Debats ⋅ Fabian Manhardt ⋅ Luca Carlone ⋅ Federico Tombari
|
Tucson Ballroom & Prefunction Space 121 | |
|
Predicting Task fMRI Contrasts from Resting-State fMRI Using Sparse 3D Convolutions
Poster Session 5
Ivan Sviridov ⋅ Maria Boyko ⋅ Maksim Sharaev
|
Tucson Ballroom & Prefunction Space 50 | |
|
FreeCond: Free Lunch in the Input Conditions of Text-Guided Inpainting
Poster Session 4 + Reception
Teng-Fang Hsiao ⋅ Bo-Kai Ruan ⋅ Sung-Lin Tsai ⋅ Yi-Lun Wu ⋅ Hong-Han Shuai
|
Tucson Ballroom & Prefunction Space 116 | |
|
Leveraging Semantic Attribute Binding for Free-Lunch Color Control in Diffusion Models
Poster Session 6 + Refreshments
Héctor Laria ⋅ Alexandra Gomez-Villa ⋅ Jiang Qin ⋅ Muhammad Atif Butt ⋅ Bogdan Raducanu ⋅ Javier Vazquez-Corral ⋅ Joost van de Weijer ⋅ Kai Wang
|
Tucson Ballroom & Prefunction Space 47 | |
|
Unified Alignment Protocol: Making Sense of the Unlabeled Data in New Domains
Poster Session 3
Sabbir Ahmed ⋅ Mamshad Nayeem Rizve ⋅ Abdullah Al Arafat ⋅ Jacqueline Liu ⋅ Rahim Hossain ⋅ Mohaiminul Nahian ⋅ Adnan Siraj Rakin
|
Tucson Ballroom & Prefunction Space 6 | |
|
AFRAgent : An Adaptive Feature Renormalization Based High Resolution Aware GUI agent
Poster Session 1
Neeraj Anand ⋅ Rishabh Jain ⋅ Sohan Patnaik ⋅ Balaji Krishnamurthy ⋅ Mausoom Sarkar
|
Tucson Ballroom & Prefunction Space 110 | |
|
Reconstructing Realistic and Relightable Eyes
Poster Session 2 + Refreshments
Wesley Khademi ⋅ Jogendra Nath Kundu ⋅ Yatong An ⋅ Alexander Fix ⋅ David Colmenares
|
Tucson Ballroom & Prefunction Space 79 | |
|
MomentMix Augmentation with Length-Aware DETR for Temporally Robust Moment Retrieval
Poster Session 1
Seojeong Park ⋅ Jiho Choi ⋅ Kyungjune Baek ⋅ Hyunjung Shim
|
Tucson Ballroom & Prefunction Space 108 | |
|
Learning Group Actions In Disentangled Latent Image Representations
Poster Session 3
Farhana Hossain Swarnali ⋅ Miaomiao Zhang ⋅ TONMOY HOSSAIN
|
Tucson Ballroom & Prefunction Space 21 | |
|
DMAT: An End-to-End Framework for Joint Atmospheric Turbulence Mitigation and Object Detection
Poster Session 2 + Refreshments
Paul Hill ⋅ Zhiming Liu ⋅ Alin Achim ⋅ David Bull ⋅ Nantheera Anantrasirichai
|
Tucson Ballroom & Prefunction Space 121 | |
|
Context-Preserving Dermoscopic Editing: Mask-Guided Lesion-Aware Diffusion for Attribute Modification
Poster Session 4 + Reception
Tao Sun ⋅ Yun Jiang ⋅ Yarong Jin ⋅ Huanting Guo ⋅ Zequn Zhang
|
Tucson Ballroom & Prefunction Space 103 | |
|
SceneShine: Illumination-aware Human Scene Gaussian Re-Splatting from Mobile Device Video
Poster Session 6 + Refreshments
Xuqian Ren ⋅ Wenjia Wang ⋅ Mai Nguyen ⋅ Juho Kannala ⋅ Esa Rahtu
|
Tucson Ballroom & Prefunction Space 104 | |
|
WarpRF: Multi-View Consistency for Training-Free Uncertainty Quantification and Applications in Radiance Fields
Poster Session 4 + Reception
Sadra Safadoust ⋅ Fabio Tosi ⋅ Fatma Güney ⋅ Matteo Poggi
|
Tucson Ballroom & Prefunction Space 90 | |
|
ChameleonTuner: Automatic ISP Color Tuning in Subjective Scenarios
Poster Session 1
Zijie Tan ⋅ Yuxin Yue ⋅ Bahador Rashidi
|
Tucson Ballroom & Prefunction Space 29 | |
|
Sketch-guided Cage-based 3D Gaussian Splatting Deformation
Poster Session 3
Tianhao Xie ⋅ Noam Aigerman ⋅ Eugene Belilovsky ⋅ Tiberiu Popa
|
Tucson Ballroom & Prefunction Space 71 | |
|
AugMapNet: Improving Spatial Latent Structure via BEV Grid Augmentation for Enhanced Vectorized Online HD Map Construction
Poster Session 6 + Refreshments
Thomas Monninger ⋅ Md Zafar Anwar ⋅ Stanislaw Antol ⋅ Steffen Staab ⋅ Sihao Ding
|
Tucson Ballroom & Prefunction Space 127 | |
|
DiT-VTON: Diffusion Transformer Framework for Unified Multi-Category Virtual Try-On and Virtual Try-All with Integrated Image Editing
Poster Session 1
Qi Li ⋅ Shuwen Qiu ⋅ Kee Kiat Koo ⋅ Julien Han ⋅ Karim Bouyarmane
|
Tucson Ballroom & Prefunction Space 20 | |
|
Denoise, Divide, Distill, and Predict (D3P): Towards Forecasting Long-horizon Real-world Anomaly from Normalcy
Poster Session 5
Quentin Mérilleau ⋅ Snehashis Majhi ⋅ Antitza Dantcheva ⋅ Quan Kong ⋅ Lorenzo Garattoni ⋅ Gianpiero Francesca ⋅ Francois Bremond
|
Tucson Ballroom & Prefunction Space 43 | |
|
Efficient Vision Transformers via Token Merging with Head-wise Attention Correction
Poster Session 3
Yuki Ichikawa ⋅ Masato Motomura ⋅ Thiem Chu ⋅ Daichi Fujiki
|
Tucson Ballroom & Prefunction Space 95 | |
|
Splannequin: Freezing Monocular Mannequin-Challenge Footage with Dual-Detection Splatting
Poster Session 6 + Refreshments
Hao-Jen Chien ⋅ Yi-Chuan Huang ⋅ Chung-Ho Wu ⋅ Wei-Lun Chao ⋅ Yu-Lun Liu
|
Tucson Ballroom & Prefunction Space 79 | |
|
MixER: From Cross-Modal to Mixed-Modal Visible-Infrared Re-Identification
Poster Session 3
Alehdaghi ⋅ Rajarshi Bhattacharya ⋅ Dai Yannick ⋅ Pourya Shamsolmoali ⋅ Rafael M. O. Cruz ⋅ Eric Granger
|
Tucson Ballroom & Prefunction Space 49 | |
|
BiNAR: A Bi-Modal Framework for Non-Aligned RGB-IR 3D Reconstruction via Gaussian Splatting
Poster Session 4 + Reception
Zhongwen Wang ⋅ Han Ling ⋅ Weihao Zhang ⋅ Yinghui Sun ⋅ Quansen Sun
|
Tucson Ballroom & Prefunction Space 12 | |
|
Uncertainty-Aware Vision-Language Segmentation for Medical Imaging
Poster Session 6 + Refreshments
Aryan Das ⋅ Tanishq Rachamalla ⋅ Koushik Biswas ⋅ Swalpa Roy ⋅ Vinay Verma
|
Tucson Ballroom & Prefunction Space 122 | |
|
Cluster-based Pseudo-labeling for Semi-Supervised LiDAR Semantic Segmentation
Poster Session 1
Qingju Guo ⋅ Shuang Li ⋅ Jing Geng ⋅ Binhui Xie ⋅ Jiawei Shan ⋅ Wei Li
|
Tucson Ballroom & Prefunction Space 60 | |
|
Semantic Map Guided Bird's-Eye View Learning for Online HD Map Construction
Poster Session 6 + Refreshments
Huantao Ren ⋅ Hesham Eraqi ⋅ ABM Musa ⋅ Mohamed Moustafa
|
Tucson Ballroom & Prefunction Space 62 | |
|
SilverLining: Data-First Mitigation of Spatial and Spectral Shortcuts Without Introducing New Confounders
Poster Session 1
Balagopal Unnikrishnan ⋅ Michael Brudno ⋅ Chris McIntosh
|
Tucson Ballroom & Prefunction Space 124 | |
|
HyPCA-Net: Advancing Multimodal Fusion in Medical Image Analysis
Poster Session 2 + Refreshments
Joy Dhar ⋅ Manish Pandey ⋅ Debashis Das Chakladar ⋅ Maryam Haghighat ⋅ Azadeh Alavi ⋅ Sajib Mistry ⋅ Nayyar Zaidi
|
Tucson Ballroom & Prefunction Space 40 | |
|
Causality-Driven Audits of Model Robustness
Poster Session 5
Nathan Drenkow ⋅ William Paul ⋅ Christopher Ribaudo ⋅ Mathias Unberath
|
Tucson Ballroom & Prefunction Space 15 | |
|
KD360-VoxelBEV: LiDAR and 360-degree Camera Cross Modality Knowledge Distillation for Bird’s-Eye-View Segmentation
Poster Session 3
Wenke E ⋅ Yixin Sun ⋅ Jiaxu Liu ⋅ Hubert P. H. Shum ⋅ Amir Atapour-Abarghouei ⋅ Toby Breckon
|
Tucson Ballroom & Prefunction Space 54 | |
|
Universal Neural Architecture Space: Covering ConvNets, Transformers and Everything in Between
Poster Session 3
Ondrej Tybl ⋅ Lukas Neumann
|
Tucson Ballroom & Prefunction Space 73 | |
|
FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs
Poster Session 1
Carlos Plou ⋅ Cesar Borja ⋅ Ruben Martinez-Cantin ⋅ Ana Murillo
|
Tucson Ballroom & Prefunction Space 128 | |
|
SmokeBench: Evaluating Multimodal Large Language Models for Wildfire Smoke Detection
Poster Session 1
Tianye Qi ⋅ Weihao Li ⋅ Nick Barnes
|
Tucson Ballroom & Prefunction Space 100 | |
|
Anatomically-guided masked autoencoder pre-training for aneurysm detection
Poster Session 4 + Reception
Alberto Mario Ceballos Arroyo ⋅ Jisoo Kim ⋅ Chu-Hsuan Lin ⋅ Lei Qin ⋅ Geoffrey Young ⋅ Huaizu Jiang
|
Tucson Ballroom & Prefunction Space 135 | |
|
Disentangle and Regularize: Sign Language Production with Articulator-Based Disentanglement and Channel-Aware Regularization
Poster Session 6 + Refreshments
Meryem Taşyürek ⋅ Tuğçe Kızıltepe ⋅ Hacer Keles
|
Tucson Ballroom & Prefunction Space 119 | |
|
AuthGuard: Generalizable Deepfake Detection via Language Guidance
Poster Session 5
Guangyu Shen ⋅ Zhihua Li ⋅ Xiang Xu ⋅ Tianchen Zhao ⋅ Zheng Zhang ⋅ DONGSHENG An ⋅ Zhuowen Tu ⋅ Yifan Xing ⋅ Qin ZHANG
|
Tucson Ballroom & Prefunction Space 40 | |
|
Single-step Diffusion for Image Compression at Ultra-Low Bitrates
Poster Session 5
Chanung Park ⋅ Joo Chan Lee ⋅ Jong Hwan Ko
|
Tucson Ballroom & Prefunction Space 57 | |
|
Odo: Depth-Guided Diffusion for Identity-Preserving Body Reshaping
Poster Session 1
Siddharth Khandelwal ⋅ Sridhar Kamath ⋅ Arjun Jain
|
Tucson Ballroom & Prefunction Space 3 | |
|
Color Bind: Exploring Color Perception in Text-to-Image Models
Poster Session 2 + Refreshments
Shay Shomer-Chai ⋅ Wenxuan Peng ⋅ Bharath Hariharan ⋅ Hadar Averbuch-Elor
|
Tucson Ballroom & Prefunction Space 48 | |
|
DenseBEV: Transforming BEV Grid Cells into 3D Objects
Poster Session 2 + Refreshments
Marius Dähling ⋅ Sebastian Krebs ⋅ J. Zöllner
|
Tucson Ballroom & Prefunction Space 91 | |
|
MIST: Multilingual Incidental Dataset for Scene Text Detection
Poster Session 6 + Refreshments
Saumya Vijay Mundra ⋅ Ajoy Mondal ⋅ Jawahar CV
|
Tucson Ballroom & Prefunction Space 44 | |
|
NeuroBridge: Few-Shot Cross-Modal Neuron Re-identification via Dual-Channel Deep Metric Learning
Poster Session 6 + Refreshments
Wenwei Li ⋅ Mingwei Liao ⋅ Lingyi Cai ⋅ Anan LI
|
Tucson Ballroom & Prefunction Space 139 | |
|
General and Domain-Specific Zero-shot Detection of Generated Images via Conditional Likelihood
Poster Session 6 + Refreshments
Roy Betser ⋅ Omer Hofman ⋅ Roman Vainshtein ⋅ Guy Gilboa
|
Tucson Ballroom & Prefunction Space 58 | |
|
Model-free Domain Adaptation for Concealed Multimodal Large-Language Models
Poster Session 1
Yu Mitsuzumi ⋅ Akisato Kimura ⋅ Hisashi Kashima
|
Tucson Ballroom & Prefunction Space 118 | |
|
Autoregressive Styled Text Image Generation, but Make it Reliable
Poster Session 3
Carmine Zaccagnino ⋅ Fabio Quattrini ⋅ Vittorio Pippi ⋅ Silvia Cascianelli ⋅ Alessio Tonioni ⋅ Rita Cucchiara
|
Tucson Ballroom & Prefunction Space 72 | |
|
Perception-Inspired Color Space Design for Photo White Balance Editing
Poster Session 3
Yang Cheng ⋅ Ziteng Cui ⋅ Lin Gu ⋅ Shenghan Su ⋅ Zenghui Zhang
|
Tucson Ballroom & Prefunction Space 79 | |
|
Beyond Realism: Learning the Art of Expressive Composition with StickerNet
Poster Session 1
Haoming Lu ⋅ David Kocharian ⋅ Humphrey Shi
|
Tucson Ballroom & Prefunction Space 83 | |
|
RobustGait: Robustness Analysis for Appearance Based Gait Recognition
Poster Session 2 + Refreshments
Reeshoon Sayera ⋅ Akash Kumar ⋅ Sirshapan Mitra ⋅ Prudvi Kamtam ⋅ Yogesh Rawat
|
Tucson Ballroom & Prefunction Space 107 | |
|
SHaSaM: Submodular Hard Sample Mining for Fair Facial Attribute Recognition
Poster Session 6 + Refreshments
Anay Majee ⋅ Rishabh Iyer
|
Tucson Ballroom & Prefunction Space 25 | |
|
MagicDrive3D: Controllable 3D Generation for Any-View Rendering in Street Scenes
Poster Session 4 + Reception
Ruiyuan Gao ⋅ Kai Chen ⋅ Zhihao Li ⋅ Lanqing HONG ⋅ Zhenguo Li ⋅ Qiang Xu
|
Tucson Ballroom & Prefunction Space 138 | |
|
Cosine Similarity is Almost All You Need (for Prototypical-Part Models)
Poster Session 2 + Refreshments
Luke Moffett ⋅ Frank Willard ⋅ Maximillian Machado ⋅ Emmanuel Mokel ⋅ Jon Donnelly ⋅ Zhicheng Guo ⋅ Adam Costarino ⋅ Julia Yang ⋅ Giyoung Kim ⋅ Alina Barnett ⋅ Cynthia Rudin
|
Tucson Ballroom & Prefunction Space 17 | |
|
M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models
Poster Session 1
Hongyu Wang ⋅ Jiayu Xu ⋅ Senwei Xie ⋅ Ruiping Wang ⋅ Jialin Li ⋅ Zhaojie Xie ⋅ Bin Zhang ⋅ Chuyan Xiong ⋅ Xilin CHEN
|
Tucson Ballroom & Prefunction Space 37 | |
|
Data-Driven Lipschitz Continuity: A Cost-Effective Approach to Improve Adversarial Robustness
Poster Session 1
Erh-Chung Chen ⋅ Pin-Yu Chen ⋅ I-Hsin Chung ⋅ Che-Rung Lee
|
Tucson Ballroom & Prefunction Space 67 | |
|
Q-Former Autoencoder: A Modern Framework for Medical Anomaly Detection
Poster Session 6 + Refreshments
Francesco Dalmonte ⋅ Emirhan Bayar ⋅ Emre Akbas ⋅ Iuliana Georgescu
|
Tucson Ballroom & Prefunction Space 75 | |
|
CAAC: Confidence-Aware Attention Calibration to Reduce Hallucinations in Large Vision-Language Models
Poster Session 1
Mehrdad Fazli ⋅ Bowen Wei ⋅ Ahmet Sari ⋅ Ziwei Zhu
|
Tucson Ballroom & Prefunction Space 119 | |
|
Test Time Adaptation Using Adaptive Quantile Recalibration
Poster Session 5
Paria Mehrbod ⋅ Pedro Vianna ⋅ Geraldin Nanfack ⋅ Guy Wolf ⋅ Eugene Belilovsky
|
Tucson Ballroom & Prefunction Space 18 | |
|
V2XScene: Multi-View Consistent 3D Scene Simulation for Collaborative Perception
Poster Session 5
Yanfei Li ⋅ Yi GONG ⋅ Yuan Zeng
|
Tucson Ballroom & Prefunction Space 74 | |
|
Point2Pose: A Generative Framework for 3D Human Pose Estimation with Multi-View Point Cloud Dataset
Poster Session 5
Hyunsoo Lee ⋅ Daeum Jeon ⋅ Hyeokjae Oh
|
Tucson Ballroom & Prefunction Space 90 | |
|
GeoHSAF: Geometric Hippocampus Shape Analysis Framework for Longitudinal Alzheimer's Disease Classification
Poster Session 2 + Refreshments
MUBARAK OLAOLUWA ⋅ HENI LOUKIL ⋅ Arafet Sbei ⋅ Hassen Drira
|
Tucson Ballroom & Prefunction Space 71 | |
|
Seeing is Believing (and Predicting): Context-Aware Multi-Human Behavior Prediction with Vision Language Models
Poster Session 2 + Refreshments
Utsav Panchal ⋅ Yuchen Liu ⋅ Luigi Palmieri ⋅ Ilche Georgievski ⋅ Marco Aiello
|
Tucson Ballroom & Prefunction Space 52 | |
|
Segmentation-Aware Latent Diffusion for Satellite Image Super-Resolution: Enabling Smallholder Farm Boundary Delineation
Poster Session 2 + Refreshments
Aditi Agarwal ⋅ Anjali Jain ⋅ Nikita Saxena ⋅ Ishan Deshpande ⋅ Michal Kazmierski ⋅ Abigail Annkah ⋅ Nadav Sherman ⋅ Karthikeyan Shanmugam ⋅ Alok Talekar ⋅ Vaibhav Rajan
|
Tucson Ballroom & Prefunction Space 43 | |
|
Learning from Unknown for Open-Set Test-Time Adaptation
Poster Session 3
Taki Hasan Rafi ⋅ Amit Agarwal ⋅ Hitesh Patel ⋅ Dong-Kyu Chae
|
Tucson Ballroom & Prefunction Space 8 | |
|
3D Gaussian Point Encoders
Poster Session 2 + Refreshments
Jim James ⋅ Benjamin Wilson ⋅ Simon Lucey ⋅ James Hays
|
Tucson Ballroom & Prefunction Space 36 | |
|
A Unified Diffusion-Based Framework for Multi-Agent Trajectory Prediction Integrating Structured Multi-Modal Representations
Poster Session 5
Chenxi yang ⋅ Suyang Xi ⋅ Hong Ding ⋅ Yiqing Shen ⋅ Yunhao Liu
|
Tucson Ballroom & Prefunction Space 62 | |
|
PointSt3R: Point Tracking through 3D Ground Correspondence
Poster Session 6 + Refreshments
Rhodri Guerrier ⋅ Adam Harley ⋅ Dima Damen
|
Tucson Ballroom & Prefunction Space 22 | |
|
False Alarm Rectification for Early Smoke Segmentation
Poster Session 2 + Refreshments
Hongjin Zhao ⋅ Weihao Li ⋅ Ge-Peng Ji ⋅ Nick Barnes
|
Tucson Ballroom & Prefunction Space 53 | |
|
Grounding Degradations in Natural Language for All-In-One Video Restoration
Poster Session 4 + Reception
Muhammad Kamran Janjua ⋅ Amirhosein Ghasemabadi ⋅ Kunlin Zhang ⋅ Mohammad Salameh ⋅ Chao Gao ⋅ Di Niu
|
Tucson Ballroom & Prefunction Space 139 | |
|
OracleGS: Grounding Generative Priors for Sparse-View Gaussian Splatting
Poster Session 1
Atakan Topaloğlu ⋅ Kunyi Li ⋅ Michael Niemeyer ⋅ Nassir Navab ⋅ Ahmet Tekalp ⋅ Federico Tombari
|
Tucson Ballroom & Prefunction Space 8 | |
|
CanKD: Cross-Attention-based Non-local operation for Feature-based Knowledge Distillation
Poster Session 6 + Refreshments
Shizhe Sun ⋅ Wataru Ohyama
|
Tucson Ballroom & Prefunction Space 133 | |
|
CLoCKDistill: Consistent Location-and-Context-aware Knowledge Distillation for DETRs
Poster Session 5
Qizhen Lan ⋅ Qing Tian
|
Tucson Ballroom & Prefunction Space 132 | |
|
Spacewalk-18: A Benchmark for Multimodal and Long-form Procedural Video Understanding in Novel Domains
Poster Session 4 + Reception
Zitian Tang ⋅ Rohan Krishnan ⋅ Zhiqiu Yu ⋅ Chen Sun
|
Tucson Ballroom & Prefunction Space 18 | |
|
Countering Multi-modal Representation Collapse through Rank-targeted Fusion
Poster Session 4 + Reception
Seulgi Kim ⋅ Kiran Kokilepersaud ⋅ Mohit Prabhushankar ⋅ Ghassan AlRegib
|
Tucson Ballroom & Prefunction Space 44 | |
|
Fine-grained Defocus Blur Control for Generative Image Models
Poster Session 4 + Reception
Ayush Shrivastava ⋅ Connelly Barnes ⋅ Cecilia Zhang ⋅ Lingzhi Zhang ⋅ Andrew Owens ⋅ Sohrab Amirghodsi ⋅ Eli Shechtman
|
Tucson Ballroom & Prefunction Space 5 | |
|
Lorentz Entailment Cone for Semantic Segmentation
Poster Session 4 + Reception
Zahid Hasan ⋅ Masud Ahmed ⋅ Nirmalya Roy
|
Tucson Ballroom & Prefunction Space 89 | |
|
FNOPT: Resolution-Agnostic, Self-Supervised Cloth Simulation using Meta-Optimization with Fourier Neural Operators
Poster Session 5
Ruochen Chen ⋅ Thuy Tran ⋅ Shaifali Parashar
|
Tucson Ballroom & Prefunction Space 125 | |
|
Gaussian Splatting Map Registration with Orthographic Bird's-Eye-View Renderings
Poster Session 5
Hugo LEBLOND ⋅ Gilles SIMON ⋅ Renato Martins ⋅ Cedric Demonceaux ⋅ Marie-odile Berger
|
Tucson Ballroom & Prefunction Space 27 | |
|
Boosting Medical Vision-Language Pretraining via Momentum Self-Distillation under Limited Computing Resources
Poster Session 1
Phuc Pham ⋅ Nhu Pham ⋅ Ngoc Ly
|
Tucson Ballroom & Prefunction Space 82 | |
|
WiSAR3D - Aerial LiDAR dataset for 3D object detection
Poster Session 5
Oren Shrout ⋅ Ori Nizan ⋅ Yizhak Ben-Shabat ⋅ Ayellet Tal
|
Tucson Ballroom & Prefunction Space 75 | |
|
LighthouseGS: Indoor Structure-aware 3D Gaussian Splatting for Panorama-Style Mobile Captures
Poster Session 3
Seungoh Han ⋅ Jaehoon Jang ⋅ Hyunsu Kim ⋅ Jaeheung Surh ⋅ Junhyung Kwak ⋅ Hyowon Ha ⋅ Kyungdon Joo
|
Tucson Ballroom & Prefunction Space 50 | |
|
Distilling What and Why: Enhancing Driver Intention Prediction with MLLMs
Poster Session 6 + Refreshments
SAINITHIN ARTHAM ⋅ Avijit Dasgupta ⋅ Shankar Gangisetty ⋅ Jawahar CV
|
Tucson Ballroom & Prefunction Space 8 | |
|
Modeling and Learning Multiple Hypotheses for Monocular 3D Object Detection
Poster Session 5
Hyeonjeong Park ⋅ Peixi Xiong ⋅ Pei Yu ⋅ Wei Tang
|
Tucson Ballroom & Prefunction Space 118 | |
|
Learning to Animate Images from A Few Videos to Portray Delicate Human Actions
Poster Session 1
Haoxin Li ⋅ Yingchen Yu ⋅ Qilong Wu ⋅ Hanwang Zhang ⋅ Song Bai ⋅ Boyang Li
|
Tucson Ballroom & Prefunction Space 53 | |
|
Towards Streaming LiDAR Object Detection with Point Clouds as Egocentric Sequences
Poster Session 3
Mellon Zhang ⋅ Glen Chou ⋅ Saibal Mukhopadhyay
|
Tucson Ballroom & Prefunction Space 34 | |
|
DUDA: Distilled Unsupervised Domain Adaptation for Lightweight Semantic Segmentation
Poster Session 6 + Refreshments
Beomseok Kang ⋅ Niluthpol Mithun ⋅ Abhinav Rajvanshi ⋅ Han-pang Chiu ⋅ Supun Samarasekera
|
Tucson Ballroom & Prefunction Space 88 | |
|
VADER: Towards Causal Video Anomaly Understanding with Relation-Aware Large Language Models
Poster Session 6 + Refreshments
Ying Cheng ⋅ Yu-Ho Lin ⋅ Min-Hung Chen ⋅ Fu-En Yang ⋅ Shang-Hong Lai
|
Tucson Ballroom & Prefunction Space 10 | |
|
QuEENet: Quantum-Enhanced Expressive Network for Image Classification
Poster Session 6 + Refreshments
Shashank Bayal ⋅ Dawane Govind ⋅ Komal Komal ⋅ SANTOSH VIPPARTHI ⋅ Subrahmanyam Murala
|
Tucson Ballroom & Prefunction Space 65 | |
|
ObjectCore -– Efficient Few-shot Logical Anomaly Detection using Object Representations
Poster Session 3
Matic Fučka ⋅ Vitjan Zavrtanik ⋅ Danijel Skocaj
|
Tucson Ballroom & Prefunction Space 90 | |
|
HodgeFormer: Transformers for Learnable Operators on Triangular Meshes through Data-Driven Hodge Matrices
Poster Session 5
Akis Nousias ⋅ Stavros Nousias
|
Tucson Ballroom & Prefunction Space 95 | |
|
OW-Rep: Open World Object Detection with Instance Representation Learning
Poster Session 1
SUNOH LEE ⋅ Minsik Jeon ⋅ Jihong Min ⋅ Junwon Seo
|
Tucson Ballroom & Prefunction Space 33 | |
|
Marshaled Learning: Bridging Large Neural Networks with Memory-Constrained Trusted Execution Environments in Federated Learning
Poster Session 1
Shiwei Ding ⋅ Xiaoyong Yuan ⋅ Zhenlin Wang ⋅ Lan Zhang ⋅ Giuseppe Ateniese
|
Tucson Ballroom & Prefunction Space 62 | |
|
DTMIR-Pro: Domain Translation with Prompt-based Latent-Space Generalization for Multi-Weather Image Restoration
Poster Session 3
Ashutosh Kulkarni ⋅ Prashant Patil ⋅ SANTOSH VIPPARTHI ⋅ Subrahmanyam Murala ⋅ Balasubramanian Raman
|
Tucson Ballroom & Prefunction Space 89 | |
|
SPAR-Det: Segmentation-guided and Prior-Aided Routing for Small Object Detection
Poster Session 2 + Refreshments
Seungchan Kwon ⋅ Gyuil Lim ⋅ Youngjoon Han
|
Tucson Ballroom & Prefunction Space 70 | |
|
TA-Prompting: Enhancing Video Large Language Models for Dense Video Captioning via Temporal Anchors
Poster Session 1
Wei-Yuan Cheng ⋅ Kai-Po Chang ⋅ Chi-Pin Huang ⋅ Fu-En Yang ⋅ Frank Wang
|
Tucson Ballroom & Prefunction Space 22 | |
|
Large Sign Language Models: Toward 3D American Sign Language Translation
Poster Session 3
Sen Zhang ⋅ Sen Zhang ⋅ Di Liu ⋅ Zhaoyang Xia ⋅ Mingyu Zhao ⋅ Chaowei Tan ⋅ Vivian Li ⋅ Bo Liu ⋅ Dimitri Metaxas ⋅ Mubbasir Kapadia
|
Tucson Ballroom & Prefunction Space 18 | |
|
CADE: Continual Weakly-supervised Video Anomaly Detection with Ensembles
Poster Session 1
Satoshi HASHIMOTO ⋅ Tatsuya Konishi ⋅ Tomoya Kaichi ⋅ Kazunori Matsumoto ⋅ Mori Kurokawa
|
Tucson Ballroom & Prefunction Space 68 | |
|
UNO: Unifying One-stage Video Scene Graph Generation via Object-Centric Visual Representation Learning
Poster Session 2 + Refreshments
Huy Le ⋅ Nhat Chung ⋅ Tung Kieu ⋅ Jingkang Yang ⋅ Ngan Le
|
Tucson Ballroom & Prefunction Space 131 | |
|
CSGaussian: Progressive Rate-Distortion Compression and Segmentation for 3D Gaussian Splatting
Poster Session 5
Yu-Jen Tseng ⋅ Chia-Hao Kao ⋅ Jing-Zhong Chen ⋅ Alessandro Gnutti ⋅ Shao-Yuan Lo ⋅ Yen-Yu Lin ⋅ Wen-Hsiao Peng
|
Tucson Ballroom & Prefunction Space 103 | |
|
RealDroneVision: Dataset and Architecture Advancements for Small-Object Drone Detection
Poster Session 5
Arun Kumar Sivapuram ⋅ Pranav Peddinti ⋅ Harish Puppala ⋅ Komuravelli Prashanth ⋅ Jaladi Sri Harsha ⋅ Gorthi Subrahmanyam
|
Tucson Ballroom & Prefunction Space 84 | |
|
AutoSew: A Geometric Approach to Stitching Prediction with Graph Neural Networks
Poster Session 1
Pablo Ríos ⋅ Elena Garces ⋅ Jorge Lopez-Moreno
|
Tucson Ballroom & Prefunction Space 132 | |
|
SDT-6D: Fully Sparse Depth-Transformer for Staged End-to-End 6D Pose Estimation in Industrial Multi-View Bin Picking
Poster Session 6 + Refreshments
Nico Leuze ⋅ Maximilian Hoh ⋅ Samed Doğan ⋅ Nicolas Rodriguez Pena ⋅ Alfred Schöttl
|
Tucson Ballroom & Prefunction Space 114 | |
|
Decomposition Sampling for Efficient Region Annotations in Active Learning
Poster Session 3
Jingna Qiu ⋅ Frauke Wilm ⋅ Mathias Oettl ⋅ Jonas Utz ⋅ Maja Schlereth ⋅ Moritz Schillinger ⋅ Marc Aubreville ⋅ Katharina Breininger
|
Tucson Ballroom & Prefunction Space 119 | |
|
Test-Time Adaptation through Semantically-guided Feature Decomposition for Few-shot Chest X-ray Diagnosis
Poster Session 2 + Refreshments
Jayant Mahawar ⋅ Angshuman Paul
|
Tucson Ballroom & Prefunction Space 98 | |
|
Hestia: Voxel-Face-Aware Hierarchical Next-Best-View Acquisition for Efficient 3D Reconstruction
Poster Session 4 + Reception
Cheng-You Lu ⋅ Zhuoli Zhuang ⋅ Nguyen Le ⋅ da xiao ⋅ Yu-Cheng Chang ⋅ Thomas Do ⋅ Srinath Sridhar ⋅ Chin-teng Lin
|
Tucson Ballroom & Prefunction Space 97 | |
|
Synthesizing Compositional Videos from Text Description
Poster Session 5
Prajwal Singh ⋅ Kuldeep Kulkarni ⋅ Shanmuganathan Raman ⋅ Harsh Rangwani
|
Tucson Ballroom & Prefunction Space 93 | |
|
SpikeRain: Towards Energy-Efficient Single Image Deraining with Spiking Neural Networks
Poster Session 1
Md Tanvir Islam ⋅ Inzamamul Alam ⋅ Sambit Bakshi ⋅ Khan Muhammad ⋅ Javier Del Ser ⋅ Sangtae Ahn
|
Tucson Ballroom & Prefunction Space 105 | |
|
Robust Multimodal Emotion Recognition from Incomplete Modalities via Query-Based Unimodal and Cross-Modal Learning
Poster Session 4 + Reception
Ryo Miyoshi ⋅ Mayu Otani ⋅ Yuki Okafuji
|
Tucson Ballroom & Prefunction Space 59 | |
|
ICONIC-444: A 3.1-Million-Image Dataset for OOD Detection Research
Poster Session 6 + Refreshments
Gerhard Krumpl ⋅ Henning Avenhaus ⋅ Horst Possegger
|
Tucson Ballroom & Prefunction Space 116 | |
|
Polymorph: Energy-Efficient Multi-Label Classification for Video Streams on Embedded Devices
Poster Session 5
Saeid Ghafouri ⋅ Mohsen Fayyaz ⋅ Xiangchen Li ⋅ Deepu John ⋅ Bo Ji ⋅ Dimitrios Nikolopoulos ⋅ Hans Vandierendonck
|
Tucson Ballroom & Prefunction Space 61 | |
|
mmWeaver: Environment-Specific mmWave Signal Synthesis from a Photo and Activity Description
Poster Session 2 + Refreshments
Mahathir Monjur ⋅ Shahriar Nirjon
|
Tucson Ballroom & Prefunction Space 44 | |
|
BlendCLIP: Bridging Synthetic and Real Domains for Zero-Shot 3D Object Classification with Multimodal Pretraining
Poster Session 4 + Reception
Ajinkya Khoche ⋅ Gergő Nagy ⋅ Maciej Wozniak ⋅ Thomas Gustafsson ⋅ Patric Jensfelt
|
Tucson Ballroom & Prefunction Space 142 | |
|
Sketch3R: Rapid and Realistic 3D VR Sketch Creation to Shape Retrieval
Poster Session 6 + Refreshments
Mritunjoy Halder ⋅ Shivam Shukla ⋅ Lokender Tiwari ⋅ Raghav Mittal ⋅ Brojeshwar Bhowmick
|
Tucson Ballroom & Prefunction Space 140 | |
|
Training-free Conditional Image Embedding Framework Leveraging Large Vision Language Models
Poster Session 6 + Refreshments
Masayuki Kawarada ⋅ Kosuke Yamada ⋅ Antonio Tejero-de-Pablos ⋅ Naoto Inoue
|
Tucson Ballroom & Prefunction Space 42 | |
|
DF-Mamba: Deformable State Space Modeling for 3D Hand Pose Estimation in Interactions
Poster Session 4 + Reception
Yifan Zhou ⋅ Takehiko Ohkawa ⋅ Guwenxiao Zhou ⋅ Kanoko Goto ⋅ Takumi Hirose ⋅ Yusuke Sekikawa ⋅ Nakamasa Inoue
|
Tucson Ballroom & Prefunction Space 102 | |
|
Semi-supervised Key-Point Estimation for Echocardiography Video
Poster Session 4 + Reception
Seok-Hwan Oh ⋅ hyeonjik lee ⋅ Guil Jung ⋅ Myeong-Gee Kim ⋅ Young-Min Kim ⋅ Hyuksool Kwon ⋅ Hyeon-min Bae
|
Tucson Ballroom & Prefunction Space 134 | |
|
CLUE: Bringing Machine Unlearning to Mobile Devices
Poster Session 3
A. Q. M. Sazzad Sayyed ⋅ Nathaniel Bastian ⋅ Michael Lucia ⋅ Ananthram Swami ⋅ Francesco Restuccia
|
Tucson Ballroom & Prefunction Space 80 | |
|
Snapmoji: Instant Generation of Animatable Dual-Stylized Avatars
Poster Session 2 + Refreshments
Eric Chen ⋅ Di Liu ⋅ Sizhuo Ma ⋅ Michael Vasilkovsky ⋅ Bing Zhou ⋅ Qiang Gao ⋅ Wenzhou Wang ⋅ Jiahao Luo ⋅ Dimitri Metaxas ⋅ Vincent Sitzmann ⋅ Jian Wang
|
Tucson Ballroom & Prefunction Space 51 | |
|
From Bands to Depth: Understanding Bathymetry Decisions on Sentinel-2
Poster Session 2 + Refreshments
Satyaki Roy Chowdhury ⋅ Aswathnarayan Radhakrishnan ⋅ Hari Subramoni
|
Tucson Ballroom & Prefunction Space 62 | |
|
From Street to Orbit: Training-Free Cross-View Retrieval via Location Semantics and LLM Guidance
Poster Session 1
Jeongho Min ⋅ Dongyoung Kim ⋅ Jaehyup Lee
|
Tucson Ballroom & Prefunction Space 55 | |
|
Codebook Knowledge with Mamba-Transformer For Low-Light Image Enhancement
Poster Session 3
Runhua Deng ⋅ Aiwen Jiang ⋅ Long Peng ⋅ Qiuhai Yan
|
Tucson Ballroom & Prefunction Space 77 | |
|
Morphing Through Time: Diffusion-Based Bridging of Temporal Gaps for Robust Alignment in Change Detection
Poster Session 1
Seyedehanita Madani ⋅ Vishal Patel
|
Tucson Ballroom & Prefunction Space 42 | |
|
FujiView: Multimodal Late-Fusion for Predicting Scenic Visibility
Poster Session 4 + Reception
Bryce Bible ⋅ Shah Hasnaeen ⋅ Hairong Qi
|
Tucson Ballroom & Prefunction Space 131 | |
|
Anatomy-VLM: A Fine-grained Vision-Language Model for Medical Interpretation
Poster Session 2 + Refreshments
Difei Gu ⋅ Yunhe Gao ⋅ Mu Zhou ⋅ Dimitri Metaxas
|
Tucson Ballroom & Prefunction Space 135 | |
|
Unsupervised Modular Adaptive Region Growing and RegionMix Classification for Wind Turbine Segmentation
Poster Session 3
Raül Pérez-Gonzalo ⋅ Riccardo Magro ⋅ Andreas Espersen ⋅ Antonio Agudo
|
Tucson Ballroom & Prefunction Space 92 | |
|
Learning Action Hierarchies via Hybrid Geometric Diffusion
Poster Session 3
Arjun Kaushik Kaushik ⋅ Nalini Ratha ⋅ Venu Govindaraju
|
Tucson Ballroom & Prefunction Space 20 | |
|
Self-Supervised Compression and Artifact Correction for Streaming Underwater Imaging Sonar
Poster Session 3
Rongsheng Qian ⋅ Chi Xu ⋅ Xiaoqiang Ma ⋅ Hao Fang ⋅ Yili Jin ⋅ William Atlas ⋅ Jiangchuan Liu
|
Tucson Ballroom & Prefunction Space 123 | |
|
BOP-Distrib: Revisiting 6D Pose Estimation Benchmarks for Better Evaluation under Visual Ambiguities
Poster Session 2 + Refreshments
Boris Meden ⋅ Asma Brazi ⋅ Fabrice Mayran de Chamisso ⋅ Steve Bourgeois ⋅ Vincent Lepetit
|
Tucson Ballroom & Prefunction Space 16 | |
|
VividAnimator: An End-to-End Audio and Pose-driven Half-Body Human Animation Framework
Poster Session 4 + Reception
Donglin Huang ⋅ Yongyuan Li ⋅ Tianhang Liu ⋅ Junming Huang ⋅ Xiaoda Yang ⋅ Chi Wang ⋅ Weiwei Xu
|
Tucson Ballroom & Prefunction Space 4 | |
|
Edge-Aware Image Manipulation via Diffusion Models with a Novel Structure-Preservation Loss
Poster Session 4 + Reception
Minsu Gong ⋅ Nuri Ryu ⋅ Jungseul Ok ⋅ Sunghyun Cho
|
Tucson Ballroom & Prefunction Space 82 | |
|
3D Superquadric Splatting
Poster Session 4 + Reception
Daniel MacSwayne ⋅ Ales Leonardis ⋅ Jianbo Jiao
|
Tucson Ballroom & Prefunction Space 83 | |
|
Learnable Query-Enhanced Pose Transformation
Poster Session 2 + Refreshments
Yi-Zhen Wang ⋅ Hong-Han Shuai
|
Tucson Ballroom & Prefunction Space 59 | |
|
VLMDiff: Leveraging Vision-Language Models for Multi-Class Anomaly Detection with Diffusion
Poster Session 5
Samet Hicsonmez ⋅ Abd El Rahman Shabayek ⋅ Djamila Aouada
|
Tucson Ballroom & Prefunction Space 49 | |
|
Bi-ICE: An Inner Interpretable Framework for Image Classification via Bi-directional Interactions between Concept and Input Embeddings
Poster Session 3
Jinyung Hong ⋅ Yearim Kim ⋅ Keun Hee Park ⋅ Sangyu Han ⋅ Nojun Kwak ⋅ Theodore Pavlic
|
Tucson Ballroom & Prefunction Space 88 | |
|
Bridging the Domain Gap in Small Multimodal Models: A Dual-level Alignment Perspective
Poster Session 6 + Refreshments
Aveen Dayal ⋅ Peketi Divya ⋅ Nidhi Tiwari ⋅ Linga Reddy Cenkeramaddi ⋅ C Mohan ⋅ Abhinav Kumar
|
Tucson Ballroom & Prefunction Space 100 | |
|
UniTabBank: A Large Scale Multi-Lingual, Multi-Layout, Multi-Type, Multi-Format Dataset for Table Detection
Poster Session 5
Ajoy Mondal ⋅ Saumya Vijay Mundra ⋅ Avijit Dasgupta ⋅ Jawahar CV
|
Tucson Ballroom & Prefunction Space 66 | |
|
Text Slider: Efficient and Plug-and-Play Continuous Concept Control for Image/Video Synthesis via LoRA Adapters
Poster Session 1
Pin-Yen Chiu ⋅ I-Sheng Fang ⋅ Jun-Cheng Chen
|
Tucson Ballroom & Prefunction Space 59 | |
|
VitaGlyph: Vitalizing Artistic Typography with Flexible Dual-branch Diffusion Models
Poster Session 6 + Refreshments
Kailai Feng ⋅ Yabo Zhang ⋅ Haodong Yu ⋅ Zhilong Ji ⋅ Jinfeng Bai ⋅ Hongzhi Zhang ⋅ Wangmeng Zuo
|
Tucson Ballroom & Prefunction Space 97 | |
|
PoseGaussian: Pose-Driven Novel View Synthesis for Robust 3D Human Reconstruction
Poster Session 4 + Reception
Ju Shen ⋅ Chen Chen ⋅ Tam Nguyen ⋅ Vijayan Asari
|
Tucson Ballroom & Prefunction Space 69 | |
|
STEG-AIW: Spatio-Temporal Gating and Adaptive-Timestep Inference for Efficient Spiking Neural Networks
Poster Session 3
Gulfam A Saju ⋅ Anton Spirkin ⋅ Felipe Marcelino ⋅ Yuchou Chang
|
Tucson Ballroom & Prefunction Space 121 | |
|
Workzone3D: A Multimodal Dataset for 3D Work Zone Perception in Autonomous Driving
Poster Session 3
Shounak Sural ⋅ Nishad Sahu ⋅ Ragunathan Rajkumar
|
Tucson Ballroom & Prefunction Space 101 | |
|
CasTex: Cascaded Text-to-Texture Synthesis via Explicit Texture Maps and Physically-Based Shading
Poster Session 3
Mishan Aliev ⋅ Dmitry Baranchuk ⋅ Kirill Struminsky
|
Tucson Ballroom & Prefunction Space 47 | |
|
Prompt-OT: An Optimal Transport Regularization Paradigm for Knowledge Preservation in Vision-Language Model Adaptation
Poster Session 1
Xiwen Chen ⋅ Wenhui Zhu ⋅ Peijie Qiu ⋅ Hao Wang ⋅ Huayu Li ⋅ Haiyu Wu ⋅ XUANZHAO DONG ⋅ Aris Sotiras ⋅ Yalin Wang ⋅ Abolfazl Razi
|
Tucson Ballroom & Prefunction Space 64 | |
|
HumanGuideNet: Adapter-Based Alignment of Deep Neural Networks with Human Similarity Judgments
Poster Session 2 + Refreshments
Xufu Liu ⋅ Yifan Yang ⋅ Zhengxin Zhang
|
Tucson Ballroom & Prefunction Space 37 | |
|
PrevMatch: Revisiting and Maximizing Temporal Knowledge in Semi-Supervised Semantic Segmentation
Poster Session 4 + Reception
Wooseok Shin ⋅ Hyun Joon Park ⋅ Jin Sob Kim ⋅ Juan Yun ⋅ Se Park ⋅ Sung Han
|
Tucson Ballroom & Prefunction Space 64 | |
|
Zero-Shot Table Extraction in Business Documents: A Unified Benchmark with Error Taxonomy and Ecological Analysis
Poster Session 4 + Reception
Eliott THOMAS ⋅ Mickael Coustaty ⋅ Aurélie JOSEPH ⋅ Tri-Cong Pham ⋅ Gaspar DELOIN ⋅ Elodie CAREL ⋅ Vincent d'Andecy ⋅ Jean-marc Ogier
|
Tucson Ballroom & Prefunction Space 66 | |
|
MAESTRO: Masked AutoEncoders for Multimodal, Multitemporal, and Multispectral Earth Observation Data
Poster Session 1
Antoine Labatie ⋅ Michael Vaccaro ⋅ Nina Lardiere ⋅ Anatol Garioud ⋅ Nicolas Gonthier
|
Tucson Ballroom & Prefunction Space 21 | |
|
IMPACT: Interpretable Most Important Person Analysis and Classification using Transformer-based Models
Poster Session 6 + Refreshments
Akshat Rampuria ⋅ Kamakshya Nayak ⋅ Kamalakar Thakare ⋅ Tushar Joshi ⋅ Aditya Singh ⋅ Haesol Park ⋅ Heeseung Choi ⋅ Debi Dogra ⋅ Ig-Jae Kim
|
Tucson Ballroom & Prefunction Space 93 | |
|
MapVerse: A Benchmark for Geospatial Question Answering on Diverse Real-World Maps
Poster Session 6 + Refreshments
Sharat Bhat ⋅ Harshita Khandelwal ⋅ Tushar Kataria ⋅ Vivek Gupta
|
Tucson Ballroom & Prefunction Space 92 | |
|
HistoMILKD: A Multiple Instance Learning based Multi-Teacher Knowledge Distillation Framework for Whole Slide Image Classification
Poster Session 3
Mayur Mallya ⋅ Ali Khajegili Mirabadi ⋅ Hossein Farahani ⋅ Ali Bashashati
|
Tucson Ballroom & Prefunction Space 45 | |
|
SymNet: A Multi-Task Network for Joint Radio Map Reconstruction and Transmitter Localization
Poster Session 1
Lyuzhou Ye ⋅ Thanh Le ⋅ Yan Huang
|
Tucson Ballroom & Prefunction Space 15 | |
|
Perceptually Guided 3DGS Streaming and Rendering for Mixed Reality
Poster Session 3
Yunxiang Zhang ⋅ Sai Mupparaju ⋅ Kenneth Chen ⋅ Jenna Kang ⋅ Xinyu Zhang ⋅ Maito Omori ⋅ Kazuyuki Arimatsu ⋅ Qi Sun
|
Tucson Ballroom & Prefunction Space 124 | |
|
Cycle-consistent Multi-graph Matching for Self-supervised Annotation of C. Elegans
Poster Session 6 + Refreshments
Sebastian Stricker ⋅ Christoph Karg ⋅ Lisa Hutschenreiter ⋅ Bogdan Savchynskyy ⋅ Dagmar Kainmueller
|
Tucson Ballroom & Prefunction Space 1 | |
|
R3: Reconstruction, Raw, and Rain: Deraining Directly in the Bayer Domain
Poster Session 4 + Reception
Nate Rothschild ⋅ Moshe Kimhi ⋅ Avi Mendelson ⋅ Chaim Baskin
|
Tucson Ballroom & Prefunction Space 98 | |
|
Sun-E: Dataset and Benchmark for Event-Based Sun Sensing
Poster Session 4 + Reception
Sydney Dolan ⋅ Alessandro Golkar
|
Tucson Ballroom & Prefunction Space 51 | |
|
Dressing the Imagination: A Dataset for AI-Powered Translation of Text into Fashion Outfits and A Novel NeRA Adapter for Enhanced Feature Adaptation
Poster Session 2 + Refreshments
Gayatri Deshmukh ⋅ Somsubhra De ⋅ Chirag Sehgal ⋅ Jishu Gupta ⋅ Sparsh Mittal
|
Tucson Ballroom & Prefunction Space 65 | |
|
Mitigating Object and Action Hallucinations in Multimodal LLMs via Self-Augmented Contrastive Alignment
Poster Session 3
Kai-Po Chang ⋅ Wei-Yuan Cheng ⋅ Chi-Pin Huang ⋅ Fu-En Yang ⋅ Frank Wang
|
Tucson Ballroom & Prefunction Space 24 | |
|
Yunheon Lee, Juncheol Ye, Jaehong Kim, Dongsu Han
NerVast: Compression-Efficient Scaling of Implicit Neural Video Representations via Scene-based Parameter-sharing
Poster Session 2 + Refreshments
Yunheon Lee ⋅ Juncheol Ye ⋅ Jaehong Kim ⋅ Dongsu Han
|
Tucson Ballroom & Prefunction Space 114 | |
|
End-to-End Fine-Tuning of 3D Texture Generation using Differentiable Rewards
Poster Session 1
Amirhossein Zamani ⋅ Tianhao Xie ⋅ Amir Aghdam ⋅ Tiberiu Popa ⋅ Eugene Belilovsky
|
Tucson Ballroom & Prefunction Space 17 | |
|
Reverse Personalization
Poster Session 1
Han-Wei Kung ⋅ Tuomas Varanka ⋅ Nicu Sebe
|
Tucson Ballroom & Prefunction Space 95 | |
|
DiffRegCD: Integrated Registration and Change Detection with Diffusion Features
Poster Session 6 + Refreshments
Seyedehanita Madani ⋅ Rama Chellappa ⋅ Vishal Patel
|
Tucson Ballroom & Prefunction Space 29 | |
|
FSP-DETR: Few-Shot Prototypical Parasitic Ova Detection
Poster Session 4 + Reception
Shubham Trehan ⋅ Udhav Ramachandran ⋅ Akash Rao ⋅ Ruth Scimeca ⋅ Sathya Aakur
|
Tucson Ballroom & Prefunction Space 101 | |
|
MIX-based Foreground and Background Patch Augmentation Guided by Physics and Material Properties for X-ray Detection
Poster Session 1
Xintong Liu ⋅ Dongliang Chang ⋅ Yujun Tong ⋅ Zhanyu Ma
|
Tucson Ballroom & Prefunction Space 94 | |
|
Controllable Long-term Motion Generation with Extended Joint Targets
Poster Session 4 + Reception
Eunjong Lee ⋅ Eunhee Kim ⋅ Sanghoon Hong ⋅ Eunho Jung ⋅ Jihoon Kim
|
Tucson Ballroom & Prefunction Space 84 | |
|
MuSACo: Multimodal Subject-Specific Selection and Adaptation for Expression Recognition with Co-Training
Poster Session 3
Muhammad Osama Zeeshan ⋅ Natacha Gillet ⋅ Alessandro Lameiras Koerich ⋅ Marco Pedersoli ⋅ Francois Bremond ⋅ Eric Granger
|
Tucson Ballroom & Prefunction Space 66 | |
|
A Deep Network for Object Detection on Inland Waters
Poster Session 5
Dennis Griesser ⋅ Bastian Goldluecke ⋅ Matthias Franz ⋅ Georg Umlauf
|
Tucson Ballroom & Prefunction Space 76 | |
|
Unsupervised Memorability Modeling from Tip-of-the-Tongue Retrieval Queries
Poster Session 3
Sree Bhattacharyya ⋅ Yaman Singla ⋅ Sudhir Yarram ⋅ Somesh Singh ⋅ Harini S I ⋅ James Wang
|
Tucson Ballroom & Prefunction Space 126 | |
|
VAST-ReID: A Low-Light Benchmark Dataset for Person Re-Identification with Visual and Attribute-Rich Semantic Tracking
Poster Session 5
Hammad Khan ⋅ Rakesh Giri ⋅ Kamalakar Thakare ⋅ Heeseung Choi ⋅ Hyungjoo Jung ⋅ Debi Dogra ⋅ Ig-Jae Kim
|
Tucson Ballroom & Prefunction Space 4 | |
|
CONSTANT: Towards High-Quality One-Shot Handwriting Generation with Patch Contrastive Enhancement and Style-Aware Quantization
Poster Session 4 + Reception
Anh-Duy Le ⋅ Van Pham ⋅ Thanh Vo ⋅ Mai Toan ⋅ Tuan-Anh Tran
|
Tucson Ballroom & Prefunction Space 1 | |
|
One-Shot Fine-Grained Re-Identification of Paint Marked Honey Bees using Vision Foundation Models
Poster Session 1
Luke Meyers ⋅ Josué A. Rodríguez-Cordero ⋅ Remi Megret
|
Tucson Ballroom & Prefunction Space 54 | |
|
Automated Suturing Skill Assessment in Robot-assisted Surgery from Endoscopic Videos using Clinically-guided Evaluation Criteria
Poster Session 6 + Refreshments
Atharva Deo ⋅ Ujjwal Pasupulety ⋅ Nicholas Matsumoto ⋅ Jay Moran ⋅ Cherine Yang ⋅ Jeanine Kim ⋅ Rafal Kocielnik ⋅ Aurash Naser-Tavakolian ⋅ Andrew Hung
|
Tucson Ballroom & Prefunction Space 2 | |
|
Enhancing Vision Language Corruption Robustness using Cross Distribution & Prompted Denoisers
Poster Session 4 + Reception
Sameer Shafayet Latif ⋅ Sadab Shiper ⋅ K. Kiran ⋅ Md Ishmam ⋅ MD HOSSAIN ⋅ Abu Kamal ⋅ Md. Ashmafee
|
Tucson Ballroom & Prefunction Space 141 | |
|
FCC: Fully Connected Correlation for One-Shot Segmentation
Poster Session 4 + Reception
Seonghyeon Moon ⋅ Haein Kong ⋅ Muhammad Haris Khan ⋅ Mubbasir Kapadia ⋅ Yuewei Lin
|
Tucson Ballroom & Prefunction Space 52 | |
|
UI-Styler: Ultrasound Image Style Transfer with Class-Aware Prompts for Cross-Device Diagnosis Using a Frozen Black-Box Inference Network
Poster Session 2 + Refreshments
Nhat-Tuong Do-Tran ⋅ Ngoc-Hoang-Lam Le ⋅ Ching-Chun Huang
|
Tucson Ballroom & Prefunction Space 128 | |
|
ISALux: Illumination and Semantics-Aware Transformer Employing Mixture of Experts for Low Light Image Enhancement
Poster Session 6 + Refreshments
Raul Balmez ⋅ Alexandru Brateanu ⋅ Ciprian Orhei ⋅ Codruta Ancuti ⋅ Cosmin Ancuti
|
Tucson Ballroom & Prefunction Space 63 | |
|
KMOPS: Keypoint-Driven Method for Multi-Object Pose and Metric Size Estimation from Stereo Images
Poster Session 3
Ying-Kun Wu ⋅ Yi Shen ⋅ Tzuhsuan Huang ⋅ I-Sheng Fang ⋅ Jun-Cheng Chen
|
Tucson Ballroom & Prefunction Space 132 | |
|
Learning Unified Spatio-temporal Representations for Efficient Compressed Video Understanding
Poster Session 4 + Reception
Shristi Biswas Biswas ⋅ Efstathia Soufleri ⋅ Arani Roy ⋅ Kaushik Roy
|
Tucson Ballroom & Prefunction Space 45 | |
|
HiGlassRM: Learning to Remove High-prescription Glasses via Synthetic Dataset Generation
Poster Session 4 + Reception
Sebin Lee ⋅ Heewon Kim
|
Tucson Ballroom & Prefunction Space 28 | |
|
Enhancing Object Detection Training via Joint Image-Annotation Generation
Poster Session 2 + Refreshments
Roy Uziel ⋅ Oded Bialer
|
Tucson Ballroom & Prefunction Space 31 | |
|
R-MMA: Enhancing Vision-Language Models with Recurrent Adapters for Few-Shot and Cross-Domain Generalization
Poster Session 5
Md Fahim ⋅ Md Ishmam ⋅ Mir Sazzat Hossain ⋅ M Ashraful Amin ⋅ Amin Ali ⋅ A K M Mahbubur Rahman
|
Tucson Ballroom & Prefunction Space 67 | |
|
OPFormer: Object Pose Estimation leveraging foundation model with geometric encoding
Poster Session 3
Artem Moroz ⋅ Vít Zeman ⋅ Martin Mikšík ⋅ Elizaveta Isianova ⋅ Miroslav David ⋅ Pavel Burget ⋅ Varun Burde
|
Tucson Ballroom & Prefunction Space 133 | |
|
Robust Scene Coordinate Regression via Geometrically-Consistent Global Descriptors
Poster Session 6 + Refreshments
Son Tung Nguyen ⋅ Alejandro Fontan ⋅ Michael Milford ⋅ Tobias Fischer
|
Tucson Ballroom & Prefunction Space 96 | |
|
SphereEdit: Spherical Semantic Editing in Diffusion Models
Poster Session 6 + Refreshments
Salamata Konate ⋅ Hassan Hamidi ⋅ Elham Dolatabadi ⋅ Frank Rudzicz ⋅ Laleh Seyyed-Kalantari
|
Tucson Ballroom & Prefunction Space 84 | |
|
NoHumansRequired: Autonomous High-Quality Image Editing Triplet Mining
Poster Session 5
Maksim Kuprashevich ⋅ Grigorii Alekseenko ⋅ Irina Tolstykh ⋅ Georgii Fedorov ⋅ Bulat Suleimanov ⋅ Vladimir Dokholyan ⋅ Aleksandr Gordeev
|
Tucson Ballroom & Prefunction Space 25 | |
|
ProtoGMVAE: A Variational Auto-Encoder with True Gaussian Mixture Prior for Prototypical-based Self-Explainability
Poster Session 4 + Reception
Martin Blanchard ⋅ Christophe Ducottet ⋅ Damien Muselet ⋅ Olivier Delézay
|
Tucson Ballroom & Prefunction Space 106 | |
|
Stabilizing Direct Training of Spiking Neural Networks: Membrane Potential Initialization and Threshold-robust Surrogate Gradient
Poster Session 6 + Refreshments
Hyunho Kook ⋅ Byeongho Yu ⋅ Jeong Oh ⋅ Eunhyeok Park
|
Tucson Ballroom & Prefunction Space 123 | |
|
MR-Pruner: Training-free Multi-resolution Visual Token Pruning for Multi-modal Large Language Models
Poster Session 1
Seunghoon Han ⋅ Hyewon Lee ⋅ Soyoung Park ⋅ Jong-Ryul Lee ⋅ Sungsu Lim
|
Tucson Ballroom & Prefunction Space 104 | |
|
Uncertainty-Aware Subset Selection for Robust Visual Explainability under Distribution Shifts
Poster Session 2 + Refreshments
Madhav Gupta ⋅ Vishak Prasad C ⋅ Ganesh Ramakrishnan
|
Tucson Ballroom & Prefunction Space 22 | |
|
SSMT-Net: A Semi-Supervised Multitask Transformer-Based Network for Thyroid Nodule Segmentation in Ultrasound Images
Poster Session 5
Muhammad Umar Farooq ⋅ Abd Ur Rehman ⋅ Azka Rehman ⋅ Muhammad Usman ⋅ Dong-Kyu Chae
|
Tucson Ballroom & Prefunction Space 26 | |
|
LooC: Effective Low-Dimensional Codebook for Compositional Vector Quantization
Poster Session 1
Jie Li ⋅ Kwan-Yee K. Wong ⋅ Kai Han
|
Tucson Ballroom & Prefunction Space 16 | |
|
Quantifying the Limits of Segmentation Foundation Models: Modeling Challenges in Segmenting Tree-Like and Low-Contrast Objects
Poster Session 4 + Reception
Yixin Zhang ⋅ Nicholas Konz ⋅ Kevin Kramer ⋅ Maciej Mazurowski
|
Tucson Ballroom & Prefunction Space 88 | |
|
Fully Unsupervised Self-debiasing of Text-to-Image Diffusion Models
Poster Session 1
Korada Sri Vardhana ⋅ Shrikrishna Lolla ⋅ Soma Biswas
|
Tucson Ballroom & Prefunction Space 117 | |
|
Beyond the Highlights: Video Retrieval with Salient and Surrounding Contexts
Poster Session 2 + Refreshments
Jaehun Bang ⋅ Moon Ye-Bin ⋅ Tae-Hyun Oh ⋅ Kyungdon Joo
|
Tucson Ballroom & Prefunction Space 74 | |
|
Histogram Assisted Quality Aware Generative Model for Resolution Invariant NIR Image Colorization
Poster Session 5
Abhinav Abhinav ⋅ Rajeev Ranjan Dwivedi ⋅ Samiran Das ⋅ Vinod Kurmi
|
Tucson Ballroom & Prefunction Space 60 | |
|
Revisiting an Old Perspective Projection for Monocular 3D Morphable Models Regression
Poster Session 6 + Refreshments
Toby Chong ⋅ Ryota Nakajima
|
Tucson Ballroom & Prefunction Space 57 | |
|
CAPE: A CLIP-Aware Pointing Ensemble of Complementary Heatmap Cues for Embodied Reference Understanding
Poster Session 3
Fevziye Irem Eyiokur ⋅ Dogucan Yaman ⋅ Hazım Ekenel ⋅ Alexander Waibel
|
Tucson Ballroom & Prefunction Space 98 | |
|
Augmenting with NeRFs: Fast Relocalization on Densified Datasets
Poster Session 3
Michael Tomadakis ⋅ Rebecca Borissova ⋅ Yuxuan Zhang ⋅ Sanjeev Koppal
|
Tucson Ballroom & Prefunction Space 14 | |
|
iMotion-LLM: Instruction-Conditioned Trajectory Generation
Poster Session 2 + Refreshments
Abdulwahab Felemban ⋅ Nussair Hroub ⋅ Jian Ding ⋅ Faizan Khan ⋅ Xiaoqian Shen ⋅ Abduallah Mohamed ⋅ Mohamed Elhoseiny
|
Tucson Ballroom & Prefunction Space 123 | |
|
DreamMakeup: Face Makeup Customization using Latent Diffusion Models
Poster Session 1
Geon Yeong Park ⋅ Inhwa Han ⋅ Serin Yang ⋅ Yeobin Hong ⋅ Seongmin Jeong ⋅ Heechan Jeon ⋅ Myeongjin Goh ⋅ Sung Yi ⋅ Jin Nam ⋅ Jong Ye
|
Tucson Ballroom & Prefunction Space 41 | |
|
An Efficient Multi-Rater Setup Towards Personalized and Diversified Medical Image Segmentation
Poster Session 4 + Reception
Sajed Almorsy ⋅ Ayman Khalafallah ⋅ Marwan Torki
|
Tucson Ballroom & Prefunction Space 99 | |
|
Salience-SGG: Enhancing Unbiased Scene Graph Generation with Iterative Salience Estimation
Poster Session 1
Runfeng Qu ⋅ Ole Hall ⋅ Pia Bideau ⋅ Julie Ouerfelli-Ethier ⋅ Martin Rolfs ⋅ Klaus Obermayer ⋅ Olaf Hellwich
|
Tucson Ballroom & Prefunction Space 99 | |
|
CURIO: Curvature-Aligned and Efficient OCR for Low-Resource Historical Manuscripts
Poster Session 2 + Refreshments
Sai Madhusudan Gunda ⋅ Tathagata Ghosh ⋅ Simran Sandral ⋅ Ravi Kiran Sarvadevabhatla
|
Tucson Ballroom & Prefunction Space 57 | |
|
SkelSplat: Robust Multi-view 3D Human Pose Estimation with Differentiable Gaussian Rendering
Poster Session 3
Laura Bragagnolo ⋅ Leonardo Barcellona ⋅ Stefano Ghidoni
|
Tucson Ballroom & Prefunction Space 11 | |
|
Learning spatio-temporal feature representations for video-based gaze estimation
Poster Session 4 + Reception
Alexandre Personnic ⋅ Mihai Bace
|
Tucson Ballroom & Prefunction Space 80 | |
|
VLMs Guided Interpretable Decision Making in Autonomous Driving
Poster Session 4 + Reception
Xin Hu ⋅ TAOTAO JING ⋅ Renran Tian ⋅ Zhengming Ding
|
Tucson Ballroom & Prefunction Space 20 | |
|
Enhancing Monocular 3D Hand Reconstruction with Learned Texture Priors
Poster Session 5
Giorgos Karvounas ⋅ Nikolaos Kyriazis ⋅ Iason Oikonomidis ⋅ Georgios Pavlakos ⋅ Antonis Argyros
|
Tucson Ballroom & Prefunction Space 121 | |
|
Systematic Analysis of the Unintentional CSAM-Generation-Potential of Text-to-Image Models
Poster Session 1
Nicolas Göller ⋅ Martin Steinebach
|
Tucson Ballroom & Prefunction Space 48 | |
|
Enhanced Back-Projection of Vision Features for 3D Symmetry Detection
Poster Session 1
Isaac Aguirre ⋅ Ivan Sipiran
|
Tucson Ballroom & Prefunction Space 7 | |
|
Descrip3D: Enhancing Large Language Model-based 3D Scene Understanding with Object-Level Text Descriptions
Poster Session 2 + Refreshments
Jintang Xue ⋅ Ganning Zhao ⋅ Jie-En Yao ⋅ Hong-En Chen ⋅ Yue Hu ⋅ Meida Chen ⋅ Suya You ⋅ Chung Chieh Kuo
|
Tucson Ballroom & Prefunction Space 32 | |
|
MARS: a Multimodal Alignment and Ranking System for Few-Shot Segmentation
Poster Session 1
Nico Catalano ⋅ Stefano Samele ⋅ Paolo Pertino ⋅ Matteo Matteucci
|
Tucson Ballroom & Prefunction Space 123 | |
|
Occlusion Boundary and Depth: Mutual Enhancement via Multi-Task Learning
Poster Session 4 + Reception
Lintao XU ⋅ Yinghao WANG ⋅ Chaohui Wang
|
Tucson Ballroom & Prefunction Space 14 | |
|
MoRe: Monocular Geometry Refinement via Graph Optimization for Cross-View Consistency
Poster Session 4 + Reception
Dongki Jung ⋅ Jaehoon Choi ⋅ Yonghan Lee ⋅ Sungmin Eum ⋅ Heesung Kwon ⋅ Dinesh Manocha
|
Tucson Ballroom & Prefunction Space 53 | |
|
Vision-informed Semantic Text Alignment for Open-set Recognition in Remote Sensing
Poster Session 2 + Refreshments
Siddhant Gole ⋅ Akash Pal ⋅ Ankit Jha ⋅ Subhasis Chaudhuri ⋅ Biplab Banerjee
|
Tucson Ballroom & Prefunction Space 134 | |
|
GrounDiff: Diffusion-Based Ground Surface Generation from Digital Surface Models
Poster Session 1
Oussema Dhaouadi ⋅ Johannes Meier ⋅ Jacques Kaiser ⋅ Daniel Cremers
|
Tucson Ballroom & Prefunction Space 130 | |
|
RampWatch: An In-the-Wild Dataset and Text-Guided Detection Framework for Recreational Vessels
Poster Session 6 + Refreshments
Malik Muhammad Asim ⋅ Claire Smallwood ⋅ Abdullah Tariq ⋅ Johnny Lo ⋅ Syed Zulqarnain Gilani
|
Tucson Ballroom & Prefunction Space 36 | |
|
AnyAnomaly: Zero-Shot Customizable Video Anomaly Detection with LVLM
Poster Session 3
Sunghyun Ahn ⋅ Youngwan Jo ⋅ Kijung Lee ⋅ Sein Kwon ⋅ Inpyo Hong ⋅ Sanghyun Park
|
Tucson Ballroom & Prefunction Space 10 | |
|
STARS: Self-supervised Tuning for 3D Action Recognition in Skeleton Sequences
Poster Session 2 + Refreshments
Soroush Mehraban ⋅ Javad Rajabi ⋅ Andrea Iaboni ⋅ Babak Taati
|
Tucson Ballroom & Prefunction Space 137 | |
|
ObjectMeshDeform : Towards recovering precise 3D geometry of real objects via image-guided mesh deformation of 3D generative priors
Poster Session 2 + Refreshments
Siddharth Katageri ⋅ SANJANA SINHA ⋅ Sourav Ghosh ⋅ Soumyadip Maity ⋅ Brojeshwar Bhowmick
|
Tucson Ballroom & Prefunction Space 111 | |
|
PADM: A Physics-aware Diffusion Model for Attenuation Correction
Poster Session 2 + Refreshments
Trung Pham ⋅ Hoang Vu ⋅ Anh Chu ⋅ Dac Thai Nguyen ⋅ Trung Thanh Nguyen ⋅ THAO TRUONG TRUONG ⋅ Mai Son ⋅ Thanh Nguyen ⋅ Phi Le Nguyen
|
Tucson Ballroom & Prefunction Space 113 | |
|
Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression
Poster Session 3
Roy Jennings ⋅ Genady Paikin ⋅ Roy Shaul ⋅ Evgeny Soloveichik
|
Tucson Ballroom & Prefunction Space 52 | |
|
D2Mamba: Dual Domain Guided Informed Search in State Space Model for Underwater Image Enhancement
Poster Session 5
Alik Pramanick ⋅ Soumajit Roy ⋅ ARIJIT SUR
|
Tucson Ballroom & Prefunction Space 126 | |
|
TopoRec: Point Cloud Recognition Using Topological Data Analysis
Poster Session 6 + Refreshments
Anirban Ghosh ⋅ Iliya Kulbaka ⋅ Ian Dahlin ⋅ Ayan Dutta
|
Tucson Ballroom & Prefunction Space 33 | |
|
AdaptViG: Adaptive Vision GNN with Exponential Decay Gating
Poster Session 1
Mustafa Munir ⋅ Mostafijur Rahman ⋅ Radu Marculescu
|
Tucson Ballroom & Prefunction Space 43 | |
|
DynaGSLAM: Real-Time Gaussian-Splatting SLAM for Online Rendering, Tracking, Motion Predictions of Moving Objects in Dynamic Scenes
Poster Session 2 + Refreshments
Runfa Li ⋅ Mahdi Shaghaghi ⋅ Keito Suzuki ⋅ Xinshuang Liu ⋅ Varun Moparthi ⋅ Bang Du ⋅ Walker Curtis ⋅ Martin Renschler ⋅ Ki Myung Brian Lee ⋅ Nikolay Atanasov ⋅ Truong Nguyen
|
Tucson Ballroom & Prefunction Space 97 | |
|
SD-CSFL: A Synthetic Data-Driven Conformity Scoring Framework for Robust Federated Learning
Poster Session 5
Ebtisaam Alharbi ⋅ Abdulrahman Kerim ⋅ Leandro Soriano Marcolino ⋅ Qiang Ni
|
Tucson Ballroom & Prefunction Space 105 | |
|
AirLock+: Scaling UAV-to-Satellite Image Registration for Target Geolocalization and Geospatial Augmented Reality
Poster Session 3
Zhiyun Deng ⋅ Austin Case ⋅ Luis Sentis
|
Tucson Ballroom & Prefunction Space 40 | |
|
Gaussian Swaying: Surface-Based Framework for Aerodynamic Simulation with 3D Gaussians
Poster Session 4 + Reception
Hongru Yan ⋅ Xiang Zhang ⋅ Zeyuan Chen ⋅ Fangyin Wei ⋅ Zhuowen Tu
|
Tucson Ballroom & Prefunction Space 62 | |
|
Overcoming Fine-Grained Visual Challenges in Animal Re-Identification via Semantic Feature Alignment
Poster Session 1
Yihao Wu ⋅ Di Zhao ⋅ Yuzhuo Li ⋅ Matthew Alajas ⋅ Alistair Glen ⋅ Jingfeng Zhang ⋅ Gillian Dobbie ⋅ Daniel Wilson ⋅ Yun Sing Koh
|
Tucson Ballroom & Prefunction Space 36 | |
|
UniDiff: Parameter-Efficient Adaptation of Diffusion Models for Land Cover Classification with Multi-Modal Remotely Sensed Imagery and Sparse Annotations
Poster Session 4 + Reception
Yuzhen Hu ⋅ Saurabh Prasad
|
Tucson Ballroom & Prefunction Space 31 | |
|
Zero-LEAD: Source-Free Universal Domain Adaptation for Abdominal Multi-Organ Segmentation
Poster Session 5
Ahmed El-Sayed ⋅ Marwan Torki
|
Tucson Ballroom & Prefunction Space 87 | |
|
Overcoming Small Data Limitations in Video-Based Infant Respiration Estimation
Poster Session 5
Liyang Song ⋅ Hardik Bishnoi ⋅ Sai Manne ⋅ Sarah Ostadabbas ⋅ Briana Taylor ⋅ Michael Wan
|
Tucson Ballroom & Prefunction Space 52 | |
|
SUGAR: A Sweeter Spot for Generative Unlearning of Many Identities
Poster Session 2 + Refreshments
Dung Nguyen ⋅ Quang Nguyen ⋅ Preston Robinette ⋅ Eli Jiang ⋅ Taylor Johnson ⋅ Kevin Leach
|
Tucson Ballroom & Prefunction Space 125 | |
|
One-shot Portrait Stylizaiton via Geometric Alignment
Poster Session 4 + Reception
Xinrui Wang ⋅ Zilin Guo ⋅ Zhuoru Li ⋅ Jinze Yu ⋅ Heng Zhang ⋅ Yusuke Iwasawa ⋅ Yutaka Matsuo ⋅ Jiaxian Guo
|
Tucson Ballroom & Prefunction Space 65 | |
|
RobuMTL: Enhancing Multi-Task Learning Robustness Against Weather Conditions
Poster Session 4 + Reception
Tasneem Shaffee ⋅ Sherief Reda
|
Tucson Ballroom & Prefunction Space 125 | |
|
Graph-Based Spectral Attention with Multi-Spectral Images for Illuminant Estimation
Poster Session 2 + Refreshments
Dong-Hoon Kang ⋅ Seung-Yeop Baek ⋅ Jong-Ok Kim
|
Tucson Ballroom & Prefunction Space 142 | |
|
BoxSplitGen: A Generative Model for 3D Part Bounding Boxes in Varying Granularity
Poster Session 2 + Refreshments
Juil Koo ⋅ Wei-Tung Lin ⋅ Chanho Park ⋅ Chanhyeok Park ⋅ Minhyuk Sung
|
Tucson Ballroom & Prefunction Space 35 | |
|
AD2: Analysis and Detection of Adversarial Threats in Visual Perception for End-to-End Autonomous Driving Systems
Poster Session 2 + Refreshments
Ishan Sahu ⋅ Somnath Hazra ⋅ Somak Aditya ⋅ Soumyajit Dey
|
Tucson Ballroom & Prefunction Space 27 | |
|
LASOR: Towards Clinically Transparent and Explainable Ophthalmic Report Generation via Lesion-Aware Segmentation
Poster Session 4 + Reception
Jian Park ⋅ Hyunseon Won ⋅ JeeEun Kim ⋅ JOON HWANG ⋅ Jeong Han ⋅ Ji Park ⋅ Daniel Hwang ⋅ Jinyoung Han
|
Tucson Ballroom & Prefunction Space 87 | |
|
Can We Challenge Open-Vocabulary Object Detectors with Generated Content in Street Scenes?
Poster Session 1
Annika Mütze ⋅ Sadia Ilyas ⋅ Christian Dörpelkus ⋅ Matthias Rottmann
|
Tucson Ballroom & Prefunction Space 71 | |
|
SOAF: Scene Occlusion-aware Neural Acoustic Field
Poster Session 3
Huiyu Gao ⋅ Jiahao Ma ⋅ David Ahmedt-Aristizabal ⋅ Chuong Nguyen ⋅ Miaomiao Liu
|
Tucson Ballroom & Prefunction Space 113 | |
|
SOPHY: Generating Simulation-Ready Objects with Physical Materials
Poster Session 4 + Reception
Junyi Cao ⋅ Evangelos Kalogerakis
|
Tucson Ballroom & Prefunction Space 39 | |
|
Diversity Preserving Coresets for Image Quality Assessment
Poster Session 6 + Refreshments
Arpita Nema ⋅ Hanwei Zhu ⋅ Xi Zhang ⋅ Weisi Lin
|
Tucson Ballroom & Prefunction Space 69 | |
|
SeaClips: A Video Dataset for Maritime Object Detection.
Poster Session 4 + Reception
Franziska Denk ⋅ Christian Rankl ⋅ Shaban ALMOUAHED ⋅ David Moser ⋅ Robert Sablatnig
|
Tucson Ballroom & Prefunction Space 30 | |
|
Tables Decoded: DELTA for Structure, TARQA for Understanding
Poster Session 2 + Refreshments
Jahanvi Rajput ⋅ Dhruv Kudale ⋅ Saikiran Kasturi ⋅ Utkarsh Verma ⋅ Ganesh Ramakrishnan
|
Tucson Ballroom & Prefunction Space 129 | |
|
DREAM: Dynamic Prompts and GuidedMix for Efficient Continual Adaptation of Visual-Language Models
Poster Session 5
Evelyn Chee ⋅ Mong-Li Lee ⋅ Wynne Hsu
|
Tucson Ballroom & Prefunction Space 6 | |
|
Blur2Sharp: Human Novel Pose and View Synthesis with Generative Prior Refinement
Poster Session 3
Chia Lai ⋅ I-Hsuan Lo ⋅ Yen-Ku Yeh ⋅ Thanh-Nguyen Truong ⋅ Ching-Chun Huang
|
Tucson Ballroom & Prefunction Space 41 | |
|
GorillaWatch: An Automated System for In-the-Wild Gorilla Re-Identification and Population Monitoring
Poster Session 6 + Refreshments
Maximilian Schall ⋅ Felix Knöfel ⋅ Noah König ⋅ Jan Kubeler ⋅ Maximilian von Klinski ⋅ Joan Linnemann ⋅ Xiaoshi Liu ⋅ Iven Schlegelmilch ⋅ Ole Woyciniuk ⋅ Alexandra Schild ⋅ Dante Wasmuht ⋅ Magdalena Bermejo Espinet ⋅ Germán Illera Basas ⋅ Gerard de Melo
|
Tucson Ballroom & Prefunction Space 110 | |
|
DATTA: Domain-Adversarial Test-Time Adaptation for Cross-Domain WiFi-Based Human Activity Recognition
Poster Session 3
Julian Strohmayer ⋅ Rafael Sterzinger ⋅ Matthias Wödlinger ⋅ Martin Kampel
|
Tucson Ballroom & Prefunction Space 48 | |
|
CLIP-IT: CLIP-based Pairing of Histology Images with Privileged Textual Information
Poster Session 3
Banafsheh Karimian ⋅ Giulia Avanzato ⋅ Soufiane Belharbi ⋅ Alexis Guichemerre ⋅ Luke McCaffrey ⋅ Mohammadhadi Shateri ⋅ Eric Granger
|
Tucson Ballroom & Prefunction Space 75 | |
|
Exploiting Label-Independent Regularization from Spatial Patterns for Whole Slide Image Analysis
Poster Session 6 + Refreshments
Weiyi Wu ⋅ Xinwen Xu ⋅ Chongyang Gao ⋅ Xingjian Diao ⋅ Siting Li ⋅ Jiang Gui
|
Tucson Ballroom & Prefunction Space 136 | |
|
Crafting Descriptive Information for a Zero-shot Method to Improve Knowledge-Based Visual Question Answering Performance
Poster Session 3
Mohammad Moradi ⋅ Sudhir Mudur
|
Tucson Ballroom & Prefunction Space 19 | |
|
From Few-Shot to Zero-Shot Pallet Load Recognition: A Deployed Embedding-Based Vision System for Industrial Logistics
Poster Session 2 + Refreshments
Juan Jesús Losada del Olmo ⋅ Emilio Ballesteros ⋅ Pedro Lopez-de-Teruel ⋅ Alberto Ruiz
|
Tucson Ballroom & Prefunction Space 141 | |
|
SaccadeX: Directed Acyclic Graph-based Semi-Supervised Learning of Continuous Ocular Dynamics from Sparse Neuromorphic Streams
Poster Session 1
Nuwan Bandara ⋅ Thivya Kandappu ⋅ Archan Misra
|
Tucson Ballroom & Prefunction Space 133 | |
|
See, Think, Learn: A Self-Taught Multimodal Reasoner
Poster Session 6 + Refreshments
Sourabh Sharma ⋅ Sonam Gupta ⋅ Sadbhawna Thakur
|
Tucson Ballroom & Prefunction Space 105 | |
|
PVeRA: Probabilistic Vector-Based Random Matrix Adaptation
Poster Session 2 + Refreshments
Leo Fillioux ⋅ Enzo Ferrante ⋅ Paul-Henry Cournède ⋅ Maria Vakalopoulou ⋅ Stergios Christodoulidis
|
Tucson Ballroom & Prefunction Space 100 | |
|
Non-Aligned Reference Image Quality Assessment for Novel View Synthesis
Poster Session 5
Abhijay Ghildyal ⋅ Rajesh Sureddi ⋅ Nabajeet Barman ⋅ Saman Zadtootaghaj ⋅ Alan Bovik
|
Tucson Ballroom & Prefunction Space 53 | |
|
View-aware Cross-modal Distillation for Multi-view Action Recognition
Poster Session 6 + Refreshments
Trung Thanh Nguyen ⋅ Yasutomo Kawanishi ⋅ Vijay John ⋅ Takahiro Komamizu ⋅ Ichiro Ide
|
Tucson Ballroom & Prefunction Space 54 | |
|
Beyond Real Weights: Hypercomplex Representations for Stable Quantization
Poster Session 1
Jawad Ibn Ahad ⋅ Maisha Rahman ⋅ Amrijit Biswas ⋅ Muhammad Kabir ⋅ Robin Krambroeckers ⋅ Sifat Momen ⋅ Nabeel Mohammed ⋅ Shafin Rahman
|
Tucson Ballroom & Prefunction Space 113 | |
|
Power of Boundary and Reflection: Semantic Transparent Object Segmentation using Pyramid Vision Transformer with Transparent Cues
Poster Session 3
Tuan-Anh Vu ⋅ Nguyen Hai ⋅ Ziqiang Zheng ⋅ Binh-Son Hua ⋅ Qing Guo ⋅ Ivor Tsang ⋅ Sai-Kit Yeung
|
Tucson Ballroom & Prefunction Space 42 | |
|
QAL : A Loss for Recall–Precision Balance in 3D Reconstruction
Poster Session 6 + Refreshments
Pranay Meshram ⋅ Yash Turkar ⋅ kartikeya singh ⋅ Praveen Raj Masilamani ⋅ Charuvahan Adhivarahan ⋅ Karthik Dantu
|
Tucson Ballroom & Prefunction Space 73 | |
|
Efficient Text-Guided Convolutional Adapter for the Diffusion Model
Poster Session 3
Aryan Das ⋅ Koushik Biswas ⋅ Swalpa Roy ⋅ Badri Patro ⋅ Vinay Verma
|
Tucson Ballroom & Prefunction Space 105 | |
|
ClusterMine: Robust Label-Free Visual Out-Of-Distribution Detection via Concept Mining from Text Corpora
Poster Session 2 + Refreshments
Nikolaos Adaloglou ⋅ Diana Petrusheva ⋅ Mohamed Asker ⋅ Felix Michels ⋅ Markus Kollmann
|
Tucson Ballroom & Prefunction Space 56 | |
|
Digital Forensic AI You Can Explain: A Case Study on Video Source Camera Identification
Poster Session 5
Maryna Veksler ⋅ Kemal Akkaya ⋅ Selcuk Uluagac
|
Tucson Ballroom & Prefunction Space 117 | |
|
Confidence Through Parallel Attention for Depth and Uncertainty Estimation in Dynamic Environments
Poster Session 4 + Reception
Onkar Susladkar ⋅ Rohit Pawar ⋅ Chirag Sehgal ⋅ Samaksh Ujjawal ⋅ Sparsh Mittal
|
Tucson Ballroom & Prefunction Space 11 | |
|
TED-4DGS: Temporally Activated and Embedding-based Deformation for 4DGS Compression
Poster Session 5
Cheng-Yuan Ho ⋅ He-Bi Yang ⋅ Jui-Chiu Chiang ⋅ Yu-Lun Liu ⋅ Wen-Hsiao Peng
|
Tucson Ballroom & Prefunction Space 55 | |
|
Improvise, Adapt, Overcome — Telescopic Adapters for Efficient fine-tuning of Vision Language Models in Medical Imaging
Poster Session 6 + Refreshments
Ujjwal Mishra ⋅ VINITA SHUKLA ⋅ Praful Hambarde ⋅ Amit Shukla
|
Tucson Ballroom & Prefunction Space 39 | |
|
FedEFC: Federated Learning Using Enhanced Forward Correction Against Noisy Labels
Poster Session 6 + Refreshments
Seunghun Yu ⋅ Jin-Hyun Ahn ⋅ Joonhyuk Kang
|
Tucson Ballroom & Prefunction Space 85 | |
|
Analysis of Text Accuracy and Visual Alignment in Vision-Language Models for Artistic Text Generation
Poster Session 1
Fatima Alderazi ⋅ Motaz Alfarraj
|
Tucson Ballroom & Prefunction Space 84 | |
|
MoSCo: Real-time and Efficient Text-to-Motion Synthesis via Delta Training
Poster Session 5
Zhiyuan Zhang ⋅ Lingqiao Liu
|
Tucson Ballroom & Prefunction Space 48 | |
|
GDoFS: Gaussian DoF Separation for Plausible 3D Geometry in Sparse-View 3DGS
Poster Session 5
Yongsung Kim ⋅ Jooyoung Choi ⋅ Sungroh Yoon
|
Tucson Ballroom & Prefunction Space 80 | |
|
DexAvatar: 3D Sign Language Reconstruction with Hand and Body Pose Priors
Poster Session 5
Kaustubh Kundu ⋅ Hrishav Barua ⋅ Lucy Robertson-Bell ⋅ Zhixi Cai ⋅ Kalin Stefanov
|
Tucson Ballroom & Prefunction Space 5 | |
|
Feature-Disentangling RGB-NIR Fusion Network for Remote Driver Physiological Measurement
Poster Session 1
Tayssir Bouraffa ⋅ Ziyuan Wang ⋅ Daniel Strüber
|
Tucson Ballroom & Prefunction Space 63 | |
|
WiSE-OD: Benchmarking Robustness in Infrared Object Detection
Poster Session 4 + Reception
Heitor Medeiros ⋅ ATIF BELAL ⋅ Masih Aminbeidokhti ⋅ Eric Granger ⋅ Marco Pedersoli
|
Tucson Ballroom & Prefunction Space 60 | |
|
Gated Temporal Fusion Transformers for Robust Multi-Object Tracking
Poster Session 4 + Reception
Jinho Kim ⋅ Kuk-Jin Yoon
|
Tucson Ballroom & Prefunction Space 23 | |
|
WALDO: Where Unseen Model-based 6D Pose Estimation Meets Occlusion
Poster Session 3
Sajjad Pakdamansavoji ⋅ Yintao Ma ⋅ Amir Rasouli ⋅ TONGTONG CAO
|
Tucson Ballroom & Prefunction Space 110 | |
|
Feedback Alignment Meets Low-Rank Manifolds: A Structured Recipe for Local Learning
Poster Session 3
Arani Roy ⋅ Marco P. E. Apolinario ⋅ Shristi Biswas Biswas ⋅ Kaushik Roy
|
Tucson Ballroom & Prefunction Space 7 | |
|
Learning Beyond Labels: Self-Supervised Handwritten Text Recognition
Poster Session 5
Shree Mitra ⋅ Ajoy Mondal ⋅ Jawahar CV
|
Tucson Ballroom & Prefunction Space 81 | |
|
FLoMo-Net: A Novel Task-Adaptive Mixture of Experts Routing Framework with Frequency and Uncertainty Correction for Medical Image Segmentation
Poster Session 3
Md Rayhan Ahmed ⋅ Patricia Lasserre
|
Tucson Ballroom & Prefunction Space 106 | |
|
VISTA: A Vision and Intent-Aware Social Attention Framework for Multi-Agent Trajectory Prediction
Poster Session 1
Stephane Da Silva Martins ⋅ Emanuel Aldea ⋅ Sylvie Le Hégarat-Mascle
|
Tucson Ballroom & Prefunction Space 28 | |
|
Orca: Object Recognition and Comprehension for Archiving Marine Species
Poster Session 2 + Refreshments
Yuk Kwan Wong ⋅ Liang Haixin ⋅ Zeyu Ma ⋅ Yiwei Chen ⋅ Ziqiang Zheng ⋅ Rinaldi Gotama ⋅ Pascal Sebastian ⋅ Lauren Sparks ⋅ Sai-Kit Yeung
|
Tucson Ballroom & Prefunction Space 18 | |
|
GaussianHeadTalk: Wobble-Free 3D Talking Heads with Audio Driven Gaussian Splatting
Poster Session 6 + Refreshments
Madhav Agarwal ⋅ Mingtian Zhang ⋅ Laura Sevilla-Lara ⋅ Steven McDonagh
|
Tucson Ballroom & Prefunction Space 78 | |
|
Pretraining Helps When Capacity Allows: Evidence from Ultra-Small ConvNets
Poster Session 6 + Refreshments
Srikanth Muralidharan ⋅ Heitor Medeiros ⋅ Masih Aminbeidokhti ⋅ Eric Granger ⋅ Marco Pedersoli
|
Tucson Ballroom & Prefunction Space 107 | |
|
Intra-Class Probabilistic Embeddings for Uncertainty Estimation in Vision-Language Models
Poster Session 2 + Refreshments
Zhenxiang Lin ⋅ Maryam Haghighat ⋅ Will Browne ⋅ Dimity Miller
|
Tucson Ballroom & Prefunction Space 87 | |
|
Do generative video models understand physical principles?
Poster Session 1
Saman Motamed ⋅ Laura Culp ⋅ Kevin Swersky ⋅ Priyank Jaini ⋅ Robert Geirhos
|
Tucson Ballroom & Prefunction Space 91 | |
|
RAT4D: Rig and Animate Objects without Surface Templates in 4D
Poster Session 1
Mosam Dabhi ⋅ Simon Lucey ⋅ Laszlo Jeni
|
Tucson Ballroom & Prefunction Space 38 | |
|
Mitigating Backdoor Attacks via Trigger Reconstruction and Model Hardening
Poster Session 1
Guanhong Tao ⋅ Siyuan Cheng ⋅ Guangyu Shen ⋅ Yingqi Liu ⋅ Shengwei An ⋅ ZHUO ZHANG ⋅ Zhenting Wang ⋅ Hanxi Guo ⋅ Xiangyu Zhang
|
Tucson Ballroom & Prefunction Space 56 | |
|
Divide and Refine: Enhancing Multimodal Representation and Explainability for Emotion Recognition in Conversation
Poster Session 2 + Refreshments
Tuan Mai ⋅ Cam-Van Thi Nguyen ⋅ Duc-Trong Le
|
Tucson Ballroom & Prefunction Space 122 | |
|
SSplain: Sparse and Smooth Explainer for Retinopathy of Prematurity Classification
Poster Session 2 + Refreshments
Elifnur Sunger ⋅ Tales Imbiriba ⋅ J. Campbell ⋅ Deniz Erdogmus ⋅ Stratis Ioannidis ⋅ Jennifer Dy
|
Tucson Ballroom & Prefunction Space 28 | |
|
Broadcast2Pitch: Game State Reconstruction from Unconstrained Soccer Videos
Poster Session 4 + Reception
Yin May Oo ⋅ Yewon Hwang ⋅ Muhammad Robbani ⋅ VANYI CHAO ⋅ Ankhzaya Jamsrandorj ⋅ Hoang Nguyen ⋅ Kyung-Ryoul Mun ⋅ Jinwook Kim
|
Tucson Ballroom & Prefunction Space 19 | |
|
Dronaquatics: Real-time Swimming Analytics Using Drone Captured Imagery
Poster Session 4 + Reception
Thu Tran ⋅ Harold Abraham Joseph ⋅ Kichang Lee ⋅ Kenny Choo ⋅ Dong Ma ⋅ Shaohui Foong ⋅ Thivya Kandappu ⋅ Jeonggil Ko ⋅ Rajesh Balan
|
Tucson Ballroom & Prefunction Space 57 | |
|
Clear Sights on Site: A Spatial-Adaptive Channel Network for Deblurring Construction Site Images
Poster Session 5
Bonyani ⋅ Maryam Soleymani ⋅ Chao Wang
|
Tucson Ballroom & Prefunction Space 108 | |
|
SynPlay: Large-Scale Synthetic Human Data with Real-World Diversity for Aerial-View Perception
Poster Session 1
Jinsub Yim ⋅ Hyungtae Lee ⋅ Sungmin Eum ⋅ Yi-Ting Shen ⋅ Yan Zhang ⋅ Heesung Kwon ⋅ Shuvra Bhattacharyya
|
Tucson Ballroom & Prefunction Space 90 | |
|
Beyond Paired Data: Self-Supervised UAV Geo-Localization from Reference Imagery Alone
Poster Session 6 + Refreshments
Tristan Amadei ⋅ Enric Meinhardt-Llopis ⋅ Benedicte Bascle ⋅ Corentin ABGRALL ⋅ Gabriele Facciolo
|
Tucson Ballroom & Prefunction Space 20 | |
|
Illuminating Darkness: Learning to Enhance Low-light Images In-the-Wild
Poster Session 2 + Refreshments
S Sharif ⋅ Abdur Rehman ⋅ Zain Abidin ⋅ Fayaz Ali ⋅ Radu Timofte ⋅ Rizwan Naqvi
|
Tucson Ballroom & Prefunction Space 81 | |
|
VideoSketcher: A Training-Free Approach for Coherent Video Sketch Transfer
Poster Session 6 + Refreshments
Huining Li ⋅ Bangzhen Liu ⋅ Rui Yang ⋅ Yang Zhou ⋅ Chenshu Xu ⋅ Xufang PANG ⋅ Shengfeng He
|
Tucson Ballroom & Prefunction Space 13 | |
|
Crash2DocAI: Automated Integration of Post-Crash Car Part Images into Technical Reports
Poster Session 6 + Refreshments
Václav Diviš ⋅ Jessica Giovagnola ⋅ Khalil Ben Chikha ⋅ Marek Hrúz
|
Tucson Ballroom & Prefunction Space 101 | |
|
TacticalCalib: End-to-End 6-DoF Camera Pose Regression for Tactical Camera Calibration
Poster Session 5
Liang Fan ⋅ Xiaoqian Liu ⋅ Zhi Chen ⋅ Lingkai Yang
|
Tucson Ballroom & Prefunction Space 72 | |
|
Joint Modeling of Corruption-Driven and Information-Limited Uncertainty for Robust 3D Gaussian Splatting
Poster Session 1
Zeji Hui ⋅ Amirali Khodadadian Gostar ⋅ WeiQin Chuah ⋅ Alireza Bab-Hadiashar ⋅ Ruwan Tennakoon
|
Tucson Ballroom & Prefunction Space 66 | |
|
No MoCap Needed: Post-Training Motion Diffusion Models with Reinforcement Learning using Only Textual Prompts
Poster Session 1
Girolamo Macaluso ⋅ Lorenzo Mandelli ⋅ Mirko Bicchierai ⋅ Stefano Berretti ⋅ Andrew Bagdanov
|
Tucson Ballroom & Prefunction Space 93 | |
|
Revisiting Layer Normalization for Point Cloud Test Time Adaptation
Poster Session 1
Moslem Yazdanpanah ⋅ Ali Bahri ⋅ Mehrdad Noori ⋅ Sahar Dastani ⋅ Samuel Barbeau ⋅ David OSOWIECHI ⋅ Gustavo Vargas Hakim ⋅ Ismail Ayed ⋅ Christian Desrosiers
|
Tucson Ballroom & Prefunction Space 52 | |
|
T2LF: LLM-Guided Multimodal Diffusion for Text-to-Light Field Synthesis
Poster Session 6 + Refreshments
Soyoung Yoon ⋅ Namhyuk Ahn ⋅ In Kyu Park
|
Tucson Ballroom & Prefunction Space 12 | |
|
SENCA-st: Integrating Spatial Transcriptomics and Histopathology with Cross Attention Shared Encoder for Region Identification in Cancer Pathology
Poster Session 3
Shanaka Liyanaarachchi ⋅ Chathurya Wijethunga ⋅ Shihab Ahamed ⋅ Akthas Absar ⋅ Ranga Rodrigo
|
Tucson Ballroom & Prefunction Space 63 | |
|
LogicCBMs: Logic-Enhanced Concept-Based Learning
Poster Session 5
Deepika Vemuri ⋅ Gautham Bellamkonda ⋅ Aditya Pola ⋅ Vineeth Balasubramanian
|
Tucson Ballroom & Prefunction Space 23 | |
|
SurgXBench: Explainable Vision-Language Model Benchmark for Surgery
Poster Session 6 + Refreshments
Jiajun Cheng ⋅ Xianwu Zhao ⋅ Sainan Liu ⋅ Xiaofan Yu ⋅ Ravi Prakash ⋅ Patrick Codd ⋅ Jonathan Katz ⋅ Shan Lin
|
Tucson Ballroom & Prefunction Space 94 | |
|
CountingDINO: A Training-free Pipeline for Class-Agnostic Counting using Unsupervised Backbones
Poster Session 1
Giacomo Pacini ⋅ Lorenzo Bianchi ⋅ Luca Ciampi ⋅ Nicola Messina ⋅ Giuseppe Amato ⋅ Fabrizio Falchi
|
Tucson Ballroom & Prefunction Space 77 | |
|
Personalized Image Privacy Advisors via Federated Daisy-Chaining
Poster Session 2 + Refreshments
Sourasekhar Banerjee ⋅ Vengateswaran Subramaniam ⋅ Debaditya Roy ⋅ Vigneshwaran Subbaraju ⋅ Monowar Bhuyan
|
Tucson Ballroom & Prefunction Space 132 | |
|
Reciprocal Teaching: Dynamic Multi-Model Teacher-Student Learning for Multiple Noisy Annotations
Poster Session 6 + Refreshments
Wenjie Ai ⋅ Cuong Nguyen ⋅ Adrian Hilton ⋅ Gustavo Carneiro
|
Tucson Ballroom & Prefunction Space 111 | |
|
WWE-UIE: A Wavelet & White Balance Efficient Network for Underwater Image Enhancement
Poster Session 2 + Refreshments
Ching-Heng Cheng ⋅ Jen-Wei Lee ⋅ Chia-Ming Lee ⋅ Chih-Chung Hsu
|
Tucson Ballroom & Prefunction Space 69 | |
|
CLIP’s Visual Embedding Projector is a Few-shot Cornucopia
Poster Session 3
Mohammad Fahes ⋅ Tuan-Hung VU ⋅ Andrei Bursuc ⋅ Patrick Perez ⋅ Raoul de Charette
|
Tucson Ballroom & Prefunction Space 32 | |
|
SFMNet: Sparse Focal Modulation for 3D Object Detection
Poster Session 5
Oren Shrout ⋅ Ayellet Tal
|
Tucson Ballroom & Prefunction Space 47 | |
|
UltraClean: A Simple Framework to Train Robust Neural Networks against Backdoor Attacks
Poster Session 6 + Refreshments
Bingyin Zhao ⋅ Yingjie Lao
|
Tucson Ballroom & Prefunction Space 109 | |
|
LangPose: Language-Aligned Motion for Robust 3D Human Pose Estimation
Poster Session 6 + Refreshments
Longyun Liao ⋅ Rong Zheng
|
Tucson Ballroom & Prefunction Space 83 | |
|
Restora-Flow: Mask-Guided Image Restoration with Flow Matching
Poster Session 4 + Reception
Arnela Hadzic ⋅ Franz Thaler ⋅ Lea Bogensperger ⋅ Simon Johannes Joham ⋅ Martin Urschler
|
Tucson Ballroom & Prefunction Space 63 | |
|
RegionAligner: Bridging Ego-Exo Views for Object Correspondence via Unified Text-Visual Learning
Poster Session 3
Yuhao Su ⋅ Ehsan Elhamifar
|
Tucson Ballroom & Prefunction Space 33 | |
|
Scalable Video Action Anticipation with Cross Linear Attentive Memory
Poster Session 6 + Refreshments
Zeyun Zhong ⋅ Manuel Martin ⋅ David Schneider ⋅ David Lerch ⋅ Chengzhi Wu ⋅ Frederik Diederichs ⋅ Juergen Gall ⋅ Jürgen Beyerer
|
Tucson Ballroom & Prefunction Space 87 | |
|
Learning Compact Video Representations for Efficient Long-form Video Understanding in Large Multimodal Models
Poster Session 3
Yuxiao Chen ⋅ Jue Wang ⋅ Zhikang Zhang ⋅ Jingru Yi ⋅ Xu Zhang ⋅ Yang Zou ⋅ Zhaowei Cai ⋅ Jianbo Yuan ⋅ Xinyu Li ⋅ Hao Yang ⋅ Davide Modolo
|
Tucson Ballroom & Prefunction Space 127 | |
|
CSF-Net: Context-Semantic Fusion Network for Large Mask Inpainting
Poster Session 6 + Refreshments
Chae-Yeon Heo ⋅ Yeong-Jun Cho
|
Tucson Ballroom & Prefunction Space 103 | |
|
ChartQA-X: Generating Explanations for Visual Chart Reasoning
Poster Session 5
Shamanthak Hegde ⋅ Pooyan Fazli ⋅ Hasti Seifi
|
Tucson Ballroom & Prefunction Space 63 | |
|
AnyBald: Toward Realistic Diffusion-Based Hair Removal In-The-Wild
Poster Session 2 + Refreshments
Yongjun Choi ⋅ Seungoh Han ⋅ Soomin Kim ⋅ Sumin Son ⋅ Mohsen Rohani ⋅ Edgar Maucourant ⋅ Dongbo Min ⋅ Kyungdon Joo
|
Tucson Ballroom & Prefunction Space 77 | |
|
FAE-Net: Fashion Attribute Editing via Disentangled Latent Conditioning in Diffusion Models
Poster Session 1
Parvatam Rajith Bhargav ⋅ Gaurab Bhattacharya ⋅ Vivek B S ⋅ Jayavardhana Gubbi
|
Tucson Ballroom & Prefunction Space 19 | |
|
NRGMark: Localized Watermarking for Energy Transparency in Images
Poster Session 6 + Refreshments
Shruti Agarwal ⋅ Élie Michel ⋅ Vishal Asnani ⋅ Tania Mathern ⋅ John Collomosse
|
Tucson Ballroom & Prefunction Space 55 | |
|
ACuRE: Accurate Continuity-Regularized SpO2 Estimation Using Liquid Time-Constant Networks
Poster Session 6 + Refreshments
Shahzad Ahmad ⋅ DR. MISHRA ⋅ Sania Bano ⋅ Sukalpa Chanda ⋅ Yogesh Rawat
|
Tucson Ballroom & Prefunction Space 5 | |
|
F-ViTA: Foundation Model Guided Visible to Infrared Translation
Poster Session 4 + Reception
Jay Paranjape ⋅ Celso de Melo ⋅ Vishal Patel
|
Tucson Ballroom & Prefunction Space 129 | |
|
Graph Query Networks for Object Detection with Automotive Radar
Poster Session 5
Loveneet Saini ⋅ Hasan Tercan ⋅ Tobias Meisen
|
Tucson Ballroom & Prefunction Space 113 | |
|
Multi-Grained Text-Guided Image Fusion for Multi-Exposure and Multi-Focus Scenarios
Poster Session 6 + Refreshments
Mingwei Tang ⋅ Jiahao Nie ⋅ Guang Yang ⋅ Ziqing Cui ⋅ Jie Li
|
Tucson Ballroom & Prefunction Space 45 | |
|
FB-4D: Spatial-Temporal Coherent Dynamic 3D Content Generation with Feature Banks
Poster Session 4 + Reception
Jinwei Li ⋅ Huan-ang Gao ⋅ Wenyi Li ⋅ Haohan Chi ⋅ Chenyu Liu ⋅ Chenxi Du ⋅ Yiqian Liu ⋅ Mingju Gao ⋅ Guiyu Zhang ⋅ Zongzheng Zhang ⋅ Li Yi ⋅ Yao Yao ⋅ Jingwei Zhao ⋅ Hongyang Li ⋅ Yikai Wang ⋅ Hao Zhao
|
Tucson Ballroom & Prefunction Space 96 | |
|
Neural Geometry Image-Based Representations with Optimal Transport (OT)
Poster Session 5
Xiang Gao ⋅ Yuanpeng Liu ⋅ Jiazhi Li ⋅ Xinmu Wang ⋅ Minghao Guo ⋅ Yu Guo ⋅ Xiyun Song ⋅ Heather Yu ⋅ Zhiqiang Lao ⋅ David Gu
|
Tucson Ballroom & Prefunction Space 83 | |
|
LENVIZ: A High-Resolution Low-Exposure Night Vision Benchmark Dataset
Poster Session 2 + Refreshments
Manjushree Aithal ⋅ Rosaura VidalMata ⋅ Manikandtan Kartha ⋅ Gong Chen ⋅ Eashan Adhikarla ⋅ Lucas Kirsten ⋅ Zhicheng Fu ⋅ Nikhil Madhusudhana ⋅ Joseph Nasti
|
Tucson Ballroom & Prefunction Space 106 | |
|
DICE: Discrete Inversion Enabling Controllable Editing for Masked Generative Models
Poster Session 1
Sen Zhang ⋅ Quan Dao ⋅ Ligong Han ⋅ Song Wen ⋅ Minhao Bai ⋅ Di Liu ⋅ Han Zhang ⋅ Felix Juefei-Xu ⋅ Chaowei Tan ⋅ Bo Liu ⋅ Martin Min ⋅ Kang Li ⋅ Faez Ahmed ⋅ Akash Srivastava ⋅ Hongdong Li ⋅ Junzhou Huang ⋅ Dimitri Metaxas
|
Tucson Ballroom & Prefunction Space 73 | |
|
High-Level Semantics and Low-Level Features Fusion for Multi-Scale Object Detection in Dynamic Construction Environments
Poster Session 5
Bonyani ⋅ Maryam Soleymani ⋅ Chao Wang
|
Tucson Ballroom & Prefunction Space 70 | |
|
F-INR: Functional Tensor Decomposition for Implicit Neural Representations
Poster Session 5
Sai Karthikeya Vemuri ⋅ Tim Büchner ⋅ Joachim Denzler
|
Tucson Ballroom & Prefunction Space 73 | |
|
Meta-YOLO: Metadata-Guided Real-Time Object Detector in Aerial Imagery
Poster Session 6 + Refreshments
Deukryeol Yoon ⋅ Seonghak KIM ⋅ Young Hwa Sung ⋅ Jinho Jung
|
Tucson Ballroom & Prefunction Space 74 | |
|
Understanding Human-Like Biases in VLMs via Subjective Face Analytics
Poster Session 1
Chaitanya Roygaga ⋅ Aparna Bharati
|
Tucson Ballroom & Prefunction Space 50 | |
|
Integrating Multi-scale and Multi-filtration Topological Features for Medical Image Classification
Poster Session 6 + Refreshments
Pengfei Gu ⋅ Huimin Li ⋅ Haoteng Tang ⋅ Dongkuan Xu ⋅ Erik Enriquez ⋅ Dongchul Kim ⋅ Bin Fu ⋅ Danny Chen
|
Tucson Ballroom & Prefunction Space 138 | |
|
Decoupling Shape and Texture in SAM-2 via Controlled Texture Replacement
Poster Session 3
Inbal Cohen ⋅ Boaz Meivar ⋅ Peihan Tu ⋅ Shai Avidan ⋅ Gal Oren
|
Tucson Ballroom & Prefunction Space 111 | |
|
PEaRL: Pathway-Enhanced Representation Learning for Gene and Pathway Expression Prediction from Histology
Poster Session 6 + Refreshments
Sejuti Majumder ⋅ Saarthak Kapse ⋅ Moinak Bhattacharya ⋅ Xuan Xu ⋅ Alisa Yurovsky ⋅ Prateek Prasanna
|
Tucson Ballroom & Prefunction Space 81 | |
|
VectorSynth: Fine-Grained Satellite Image Synthesis with Structured Semantics
Poster Session 5
Daniel Cher ⋅ Brian Wei ⋅ Srikumar Sastry ⋅ Nathan Jacobs
|
Tucson Ballroom & Prefunction Space 116 | |
|
Feature Inversion as a Lens on Vision Encoders
Poster Session 3
Eduard Allakhverdov ⋅ Dmitrii Tarasov ⋅ Elizaveta Goncharova ⋅ Andrei Kuznetsov
|
Tucson Ballroom & Prefunction Space 65 | |
|
SAIL: Self-supervised Learning of Lighting-Invariant Representations from Real Images with Latent Diffusion
Poster Session 3
Hala Djeghim ⋅ Céline Loscos ⋅ Désiré Sidibé
|
Tucson Ballroom & Prefunction Space 29 | |
|
Stroke Modeling Enables Vectorized Character Generation with Large Vectorized Glyph Model
Poster Session 3
Xinyue Zhang ⋅ Haolong Li ⋅ Jiawei Ma ⋅ Chen Ye
|
Tucson Ballroom & Prefunction Space 46 | |
|
CaRS: A Causal Intervention Segmentation Framework and Benchmark Dataset for Autonomous Driving under Transitional Weather Conditions
Poster Session 3
Madhavi Kondapally ⋅ Naveen Kumar K ⋅ C Mohan ⋅ Sobhan Babu
|
Tucson Ballroom & Prefunction Space 108 | |
|
DirectDrag: High-Fidelity, Mask-Free, Prompt-Free Drag-based Image Editing via Readout-Guided Feature Alignment
Poster Session 6 + Refreshments
Sheng-Hao Liao ⋅ Shang-Fu Chen ⋅ Tai-Ming Huang ⋅ Wen-Huang Cheng ⋅ Kailung Hua
|
Tucson Ballroom & Prefunction Space 99 | |
|
DMS2F-HAD: A Dual-branch Mamba-based Spatial–Spectral Fusion Network for Hyperspectral Anomaly Detection
Poster Session 4 + Reception
Aayushma Pant ⋅ Lakpa Tamang ⋅ Tsz-Kwan Lee ⋅ Sunil Aryal
|
Tucson Ballroom & Prefunction Space 128 | |
|
MANTA: Physics-Informed Generalized Underwater Object Tracking
Poster Session 3
Suhas Srinath ⋅ Hemang Jamadagni ⋅ Aditya Chandrasekar ⋅ Prathosh AP
|
Tucson Ballroom & Prefunction Space 53 | |
|
A Fast, Simple, and Flexible Scale Informative Feature Transform Module for Arbitrary Scale Image Super-Resolution
Poster Session 1
Aupendu Kar ⋅ Prabir Biswas
|
Tucson Ballroom & Prefunction Space 135 | |
|
DCText: Scheduled Attention Masking for Visual Text Generation via Divide-and-Conquer Strategy
Poster Session 4 + Reception
Jaewoo Song ⋅ Jooyoung Choi ⋅ Kanghyun Baek ⋅ Sangyub Lee ⋅ Daemin Park ⋅ Sungroh Yoon
|
Tucson Ballroom & Prefunction Space 2 | |
|
Visual Detector Compression via Location-Aware Discriminant Analysis
Poster Session 3
Qizhen Lan ⋅ Jung Choi Choi ⋅ Qing Tian
|
Tucson Ballroom & Prefunction Space 60 | |
|
ImageNet-sES: A First Systematic Study of Sensor–Environment Simulation Anchored by Real Recaptures
Poster Session 1
Ji-yoon Kim ⋅ Eunsu Baek ⋅ Hyung-Sin Kim
|
Tucson Ballroom & Prefunction Space 107 | |
|
Cross-Modal Event Encoder: Bridging Image–Text Knowledge to Event Streams
Poster Session 3
SungHeon Jeong ⋅ Hanning Chen ⋅ Sanggeon Yun ⋅ Suhyeon Cho ⋅ Wenjun Huang ⋅ Xiangjian Liu ⋅ Mohsen Imani
|
Tucson Ballroom & Prefunction Space 28 | |
|
Exploring Automated Recognition of Instructional Activity and Discourse from Multimodal Classroom Data
Poster Session 5
Ivo Bueno ⋅ Ruikun Hou ⋅ Babette Bühler ⋅ Tim Fütterer ⋅ James Drimalla ⋅ Jonathan Foster ⋅ Peter Youngs ⋅ Peter Gerjets ⋅ Ulrich Trautwein ⋅ Enkelejda Kasneci
|
Tucson Ballroom & Prefunction Space 96 | |
|
WSSSP-Net: Weakly Supervised Semantic Segmentation Plugin Network for Face Anti-Spoofing
Poster Session 4 + Reception
Krzysztof Galus ⋅ Piotr Syga ⋅ Piotr Kawa
|
Tucson Ballroom & Prefunction Space 92 | |
|
NAPP: Noise-Adaptive Prototype Perturbation for Few-Shot Learning
Poster Session 6 + Refreshments
Il Kim ⋅ Sang Yun ⋅ Dongheon Lee ⋅ Seong Kim Kim ⋅ Joonki Paik
|
Tucson Ballroom & Prefunction Space 77 | |
|
Being Positive about Negative Queries: Exclusion Aware Multimodal Retrieval using Disentangled Representations
Poster Session 6 + Refreshments
Prachi Jha ⋅ Sumit Bhatia ⋅ Srikanta Bedathur
|
Tucson Ballroom & Prefunction Space 60 | |
|
PredMapNet: Future and Historical Reasoning for Consistent Online HD Vectorized Map Construction
Poster Session 4 + Reception
Bo Lang ⋅ Nirav Savaliya ⋅ Zhihao Zheng ⋅ Jinglun Feng ⋅ Zheng-Hang Yeh ⋅ Mooi Choo Chuah
|
Tucson Ballroom & Prefunction Space 114 | |
|
Inpainting of Sparse Depth Maps from Monocular Depth-from-Focus on Pixel Processor Arrays
Poster Session 4 + Reception
Maciej Lewandowski ⋅ Piotr Dudek
|
Tucson Ballroom & Prefunction Space 127 | |
|
Shift-Equivariant Complex-Valued Convolutional Neural Networks
Poster Session 2 + Refreshments
Quentin Gabot ⋅ Teck-Yian Lim ⋅ Jeremy Fix ⋅ Joana Frontera-Pons ⋅ Chengfang Ren ⋅ Jean-Philippe Ovarlez
|
Tucson Ballroom & Prefunction Space 110 | |
|
Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis
Poster Session 5
Thang-Anh-Quan Nguyen ⋅ Laurent Caraffa ⋅ Jean-Philippe Tarel ⋅ Roland Brémond
|
Tucson Ballroom & Prefunction Space 54 | |
|
ExDDV: A New Dataset for Explainable Deepfake Detection in Video
Poster Session 3
Vlad Hondru ⋅ Eduard Hogea ⋅ Darian Onchis ⋅ Radu Ionescu
|
Tucson Ballroom & Prefunction Space 130 | |
|
SCORE: Soft Label Compression-Centric Dataset Condensation via Coding Rate Optimization
Poster Session 2 + Refreshments
Bowen Yuan ⋅ Yuxia Fu ⋅ Zijian Wang ⋅ Yadan Luo ⋅ Zi Huang
|
Tucson Ballroom & Prefunction Space 75 | |
|
Direct Visual Grounding by Directing Attention of Visual Tokens
Poster Session 4 + Reception
Parsa Esmaeilkhani ⋅ Longin Jan Latecki
|
Tucson Ballroom & Prefunction Space 144 | |
|
MDUNet: Multimodal Decoding UNet for Passive Occluder-Aided Non-line-of-sight 3D Imaging
Poster Session 1
Fadlullah Raji ⋅ John Murray-Bruce
|
Tucson Ballroom & Prefunction Space 45 | |
|
One Model, Many Behaviors: Training-Induced Effects on Out-of-Distribution Detection
Poster Session 3
Gerhard Krumpl ⋅ Henning Avenhaus ⋅ Horst Possegger
|
Tucson Ballroom & Prefunction Space 116 | |
|
Imitating the Functionality of Image-to-Image Models Using a Single Example
Poster Session 2 + Refreshments
Nurit Spingarn ⋅ Tomer Michaeli
|
Tucson Ballroom & Prefunction Space 73 | |
|
NavMapFusion: Diffusion-based Fusion of Navigation Maps for Online Vectorized HD Map Construction
Poster Session 6 + Refreshments
Thomas Monninger ⋅ Zihan Zhang ⋅ Steffen Staab ⋅ Sihao Ding
|
Tucson Ballroom & Prefunction Space 71 | |
|
RobustFormer: Noise-Robust Pre-training for Images and Videos
Poster Session 2 + Refreshments
Ashish Bastola ⋅ Nishant Luitel ⋅ Hao Wang ⋅ Danda Pani Paudel ⋅ Roshni Poudel ⋅ Abolfazl Razi
|
Tucson Ballroom & Prefunction Space 83 | |
|
Rethinking Real Image Editing: Unleashing Diverse Editing Operators via Multi-Objective Optimization
Poster Session 3
Siyuan Wang ⋅ Xi Yang ⋅ Zihao Zhou ⋅ Huiru Shao ⋅ Rui Zhang ⋅ Qiufeng Wang ⋅ Guangliang Cheng ⋅ Kaizhu Huang
|
Tucson Ballroom & Prefunction Space 118 | |
|
SpecGen: Neural Spectral BRDF Generation via Spectral-Spatial Tri-plane Aggregation
Poster Session 6 + Refreshments
Jin Zhenyu ⋅ Wenjie Li ⋅ Zhanyu Ma ⋅ Heng Guo
|
Tucson Ballroom & Prefunction Space 106 | |
|
Surgical Gaussian Surfels: Highly Accurate Real-time Surgical Scene Rendering using Gaussian Surfels
Poster Session 4 + Reception
Idris Sunmola ⋅ Zhenjun Zhao ⋅ Samuel Schmidgall ⋅ Yumeng Wang ⋅ Paul Maria Scheikl ⋅ Viet Pham ⋅ Axel Krieger
|
Tucson Ballroom & Prefunction Space 22 | |
|
SCATR: Mitigating New Instance Suppression in LiDAR-based Tracking-by-Attention via Second Chance Assignment and Track Query Dropout
Poster Session 3
Brian Cheong ⋅ Letian Wang ⋅ Sandro Papais ⋅ Steven Waslander
|
Tucson Ballroom & Prefunction Space 39 | |
|
VFace: A Training-Free Approach for Diffusion-Based Video Face Swapping
Poster Session 4 + Reception
Sanoojan Baliah ⋅ Yohan Abeysinghe ⋅ Rusiru Thushara ⋅ Khan Muhammad ⋅ Abhinav Dhall ⋅ Karthik Nandakumar ⋅ Muhammad Haris Khan
|
Tucson Ballroom & Prefunction Space 3 | |
|
SegMango: Early Deep Mango Yield Prediction based on Flower Segmentation and Weather Data
Poster Session 4 + Reception
Janaksinh Ven ⋅ Charu Sharma ⋅ Azeemuddin Syed
|
Tucson Ballroom & Prefunction Space 67 | |
|
Diagnose Like A REAL Pathologist: An Uncertainty-Focused Approach for Trustworthy Multi-Resolution Multiple Instance Learning
Poster Session 5
Sungrae Hong ⋅ Sol Lee ⋅ Jisu Shin ⋅ Jiwon Jeong ⋅ Mun Yi
|
Tucson Ballroom & Prefunction Space 32 | |
|
Isolating the Role of Temporal Information in Video Saliency: A Controlled Experimental Analysis
Poster Session 5
Peter El-Jiz ⋅ Matthias Kuemmerer ⋅ Matthias Tangemann ⋅ Matthias Bethge ⋅ Andreas Bartels ⋅ Michael Bannert
|
Tucson Ballroom & Prefunction Space 11 | |
|
Safe Vision-Language Models via Unsafe Weights Manipulation
Poster Session 4 + Reception
Moreno D'Incà ⋅ Elia Peruzzo ⋅ Xingqian Xu ⋅ Humphrey Shi ⋅ Nicu Sebe ⋅ Massimiliano Mancini
|
Tucson Ballroom & Prefunction Space 38 | |
|
Structure-Aware Feature Rectification with Region Adjacency Graphs for Training-free Open-Vocabulary Semantic Segmentation
Poster Session 3
Qiming Huang ⋅ Hao Ai ⋅ Jianbo Jiao
|
Tucson Ballroom & Prefunction Space 115 | |
|
DCSHARP: 3D Gaussian Splatting with Direction Cosine Spherical Harmonics and Shape-Aware Pruning
Poster Session 3
Ahmed Hasssan ⋅ Jian Meng ⋅ Yuanbo Xiangli ⋅ Jae-sun Seo
|
Tucson Ballroom & Prefunction Space 68 | |
|
PSA-MIL: A Probabilistic Spatial Attention-Based Multiple Instance Learning for Whole Slide Image Classification
Poster Session 1
Sharon Peled ⋅ Yosef Maruvka ⋅ Moti Freiman
|
Tucson Ballroom & Prefunction Space 116 | |
|
Unsupervised Segmentation by Diffusing, Walking and Cutting
Poster Session 4 + Reception
Daniela Ivanova ⋅ Marco Aversa ⋅ Paul Henderson ⋅ John Williamson
|
Tucson Ballroom & Prefunction Space 79 | |
|
GAITGen: Disentangled Motion-Pathology Impaired Gait Generative Model -- Bringing Motion Generation to the Clinical Domain
Poster Session 3
Vida Adeli ⋅ Soroush Mehraban ⋅ Majid Mirmehdi ⋅ Alan Whone ⋅ Benjamin Filtjens ⋅ Amirhossein Dadashzadeh ⋅ Alfonso Fasano ⋅ Andrea Iaboni ⋅ Babak Taati
|
Tucson Ballroom & Prefunction Space 22 | |
|
milliMamba: Specular-Aware Human Pose Estimation via Dual mmWave Radar with Multi-Frame Mamba Fusion
Poster Session 2 + Refreshments
Niraj Prakash Kini ⋅ Shiau-Rung Tsai ⋅ Guan-Hsun Lin ⋅ Wen-Hsiao Peng ⋅ Ching-Wen Ma ⋅ Jenq-Neng Hwang
|
Tucson Ballroom & Prefunction Space 7 | |
|
Improving Animal Pose Estimation through Species Similarity Measures and Rigorous Label Definition
Poster Session 4 + Reception
Medhashree Parhy ⋅ Shaan Chanchani ⋅ Claire Kim ⋅ Joshua Mansky ⋅ Parth Thakre ⋅ Zian Pan ⋅ Haoyu Chen ⋅ Amy Reibman
|
Tucson Ballroom & Prefunction Space 132 | |
|
Comp4D: Compositional 4D Scene Generation
Poster Session 3
Hanwen Liang ⋅ Dejia Xu ⋅ Neel Bhatt ⋅ Hezhen Hu ⋅ Hanxue Liang ⋅ Konstantinos Plataniotis
|
Tucson Ballroom & Prefunction Space 62 | |
|
Food Image Generation on Multi-Noun Categories
Poster Session 4 + Reception
Xinyue Pan ⋅ Yuhao Chen ⋅ Jiangpeng He ⋅ Fengqing Zhu
|
Tucson Ballroom & Prefunction Space 124 | |
|
GraspDiffusion: Synthesizing Realistic Whole-body Hand-Object Interaction
Poster Session 2 + Refreshments
Patrick Kwon ⋅ Chen Chen ⋅ Hanbyul Joo
|
Tucson Ballroom & Prefunction Space 93 | |
|
Mem-MLP: Real-Time 3D Human Motion Generation from Sparse Inputs
Poster Session 6 + Refreshments
Sinan Mutlu ⋅ Georgios Fotios Angelis ⋅ Savas Ozkan ⋅ Paul Wisbey ⋅ Anastasios Drosou ⋅ Mete Ozay
|
Tucson Ballroom & Prefunction Space 108 | |
|
X-JEPA: A Novel Joint Learning Cross-Modal Predictive Alignment Framework for Remote Sensing Image Retrieval
Poster Session 4 + Reception
Shabnam Choudhury ⋅ Yash Salunkhe ⋅ Vaibhav Rajan ⋅ Subhasis Chaudhuri ⋅ Biplab Banerjee
|
Tucson Ballroom & Prefunction Space 7 | |
|
SOLAR: Switchable Output Layer for Accuracy and Robustness in Once-for-All Training
Poster Session 6 + Refreshments
Shaharyar Ahmed Khan Tareen ⋅ Lei Fan ⋅ Xiaojing Yuan ⋅ Qin Lin ⋅ Bin Hu
|
Tucson Ballroom & Prefunction Space 66 | |
|
Advancing Player Identification and Tracking with Global ID Fusion (GIF)
Poster Session 6 + Refreshments
Karol Wojtulewicz ⋅ Minxing Liu ⋅ Niklas Carlsson
|
Tucson Ballroom & Prefunction Space 7 | |
|
Line Art Colorization with Offset Prior-based Diffusion Model
Poster Session 4 + Reception
Xuan Zhu ⋅ Miao Cao ⋅ Fang-Lue Zhang ⋅ Yu-Kun Lai ⋅ Paul Rosin
|
Tucson Ballroom & Prefunction Space 123 | |
|
STRinGS: Selective Text Refinement in Gaussian Splatting
Poster Session 6 + Refreshments
Abhinav Raundhal ⋅ Gaurav Behera ⋅ P Narayanan ⋅ Ravi Kiran Sarvadevabhatla ⋅ Makarand Tapaswi
|
Tucson Ballroom & Prefunction Space 130 | |
|
Remote Sensing Forestry Similarity Convolution
Poster Session 6 + Refreshments
Shikuan Wang ⋅ Yuangong Chen ⋅ Jianzhou Gong ⋅ Lingyi Meng ⋅ Mengquan Wu ⋅ Longxing Liu ⋅ Haiwei Yuan ⋅ Guo Mingbin
|
Tucson Ballroom & Prefunction Space 35 | |
|
Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting
Poster Session 3
Shu Zou ⋅ Xinyu Tian ⋅ Lukas Wesemann ⋅ Fabian Waschkowski ⋅ Zhaoyuan Yang ⋅ Jing Zhang
|
Tucson Ballroom & Prefunction Space 125 | |
|
RemEdit: Efficient Diffusion Editing with Riemannian Geometry
Poster Session 4 + Reception
Eashan Adhikarla ⋅ Brian Davison
|
Tucson Ballroom & Prefunction Space 72 | |
|
AusSmoke meets MultiNatSmoke: a fully-labelled diverse smoke segmentation dataset
Poster Session 6 + Refreshments
Weihao Li ⋅ Hongjin Zhao ⋅ Gao Zhu ⋅ Ge-Peng Ji ⋅ Nicholas Wilson ⋅ Marta Yebra ⋅ Nick Barnes
|
Tucson Ballroom & Prefunction Space 76 | |
|
Equivariant Sampling for Improving Diffusion Model-based Image Restoration
Poster Session 5
Chenxu Wu ⋅ Qingpeng Kong ⋅ Peiang Zhao ⋅ Wendi Yang ⋅ Wenxin ma ⋅ Fenghe Tang ⋅ Zihang Jiang ⋅ S Kevin Zhou
|
Tucson Ballroom & Prefunction Space 98 | |
|
FlowEO: Generative Unsupervised Domain Adaptation for Earth Observation
Poster Session 3
Georges Le Bellier ⋅ Nicolas Audebert
|
Tucson Ballroom & Prefunction Space 94 | |
|
Deepfake Detection that Generalizes Across Benchmarks
Poster Session 1
Andrii Yermakov ⋅ Jan Čech ⋅ Jiri Matas ⋅ Mario Fritz
|
Tucson Ballroom & Prefunction Space 74 | |
|
HDR Reconstruction Boosting with Training-Free and Exposure-Consistent Diffusion
Poster Session 6 + Refreshments
Yo-Tin Lin ⋅ Sykai Chen ⋅ Hou-Ning Hu ⋅ Yen-Yu Lin ⋅ Yu-Lun Liu
|
Tucson Ballroom & Prefunction Space 30 | |
|
HiMix : Hierarchical Visual-Textual Mixing Network for Lesion Segmentation
Poster Session 4 + Reception
Soojin Hwang ⋅ Jaeyoon Sim ⋅ Won Hwa Kim
|
Tucson Ballroom & Prefunction Space 100 | |
|
Visibility guided Self-Supervised Occlusion Resilient Human Pose Estimation
Poster Session 1
Arindam Dutta ⋅ Sarosij Bose ⋅ Rohit Kundu ⋅ Calvin-Khang Ta ⋅ Saketh Bachu ⋅ Konstantinos Karydis ⋅ Amit Roy-Chowdhury
|
Tucson Ballroom & Prefunction Space 101 | |
|
Exploring the Boundaries of Diffusion Models for Offline Writer Identification with Sparse and Intra-Variable Data
Poster Session 5
Aritra Dey ⋅ Chandranath Adak ⋅ Kumari Priya ⋅ Soumi Chattopadhyay ⋅ Sukalpa Chanda
|
Tucson Ballroom & Prefunction Space 131 | |
|
A Woman with a Knife or A Knife with a Woman? Measuring Directional Bias Amplification in Image Captions
Poster Session 1
Rahul Nair ⋅ Bhanu Tokas ⋅ Hannah Kerner
|
Tucson Ballroom & Prefunction Space 25 | |
|
Non‑Contact Blood Pressure Estimation from Face Videos via Physiology‑Aware Contrastive Learning
Poster Session 2 + Refreshments
JaeHyuk Son ⋅ Young-Seok Choi
|
Tucson Ballroom & Prefunction Space 95 | |
|
DuPLUS: Dual-Prompt Vision-Language Framework for Universal Medical Image Segmentation and Prognosis
Poster Session 6 + Refreshments
Numan Saeed ⋅ Tausifa Jan Saleem ⋅ Fadillah Maani ⋅ Muhammad Ridzuan ⋅ Hu Wang ⋅ Mohammad Yaqub
|
Tucson Ballroom & Prefunction Space 112 | |
|
UniCoRN: Latent Diffusion-based Unified Controllable Image Restoration Network across Multiple Degradations
Poster Session 2 + Refreshments
Debabrata Mandal ⋅ Soumitri Chattopadhyay ⋅ Guansen Tong ⋅ Praneeth Chakravarthula
|
Tucson Ballroom & Prefunction Space 13 | |
|
PatchEAD: Unifying Industrial Visual Prompting Frameworks for Patch-Exclusive Anomaly Detection
Poster Session 4 + Reception
Po-Han Huang ⋅ Jeng-Lin Li ⋅ Po-Hsuan Huang ⋅ Ming-Ching Chang ⋅ Wei-Chao Chen
|
Tucson Ballroom & Prefunction Space 119 | |
|
EndoPBR: Photorealistic Synthetic Data for Surgical 3D Vision via Physically-based Rendering
Poster Session 4 + Reception
John Han ⋅ Jie Ying Wu
|
Tucson Ballroom & Prefunction Space 126 | |
|
Beyond the Encoder: Joint Encoder-Decoder Contrastive Pre-Training Improves Dense Prediction
Poster Session 1
Sébastien Quetin ⋅ Tapotosh Ghosh ⋅ Farhad Maleki
|
Tucson Ballroom & Prefunction Space 96 | |
|
Tables Guide Vision: Learning to See the Heart through Tabular Data
Poster Session 2 + Refreshments
Marta Hasny ⋅ Maxime Di Folco ⋅ Keno Bressem ⋅ Julia Schnabel
|
Tucson Ballroom & Prefunction Space 29 | |
|
Pose-Diverse Multi-View Virtual Try-on from a Single Frontal Image via Diffusion Transformer
Poster Session 3
Seonghee Han ⋅ Minchang Chung ⋅ Gyeongsu Cho ⋅ Kyungdon Joo ⋅ Taehwan Kim
|
Tucson Ballroom & Prefunction Space 37 | |
|
Dual-Domain Multimodal Hyperbolic Fusion for Cardiopulmonary Disease Diagnosis in Emergency Care
Poster Session 6 + Refreshments
Ke Nan ⋅ Maggie Samaan ⋅ Benjamin Burns ⋅ Xia Ning ⋅ Yuchi Han ⋅ Yuan Xue
|
Tucson Ballroom & Prefunction Space 142 | |
|
Enabling High-Quality In-the-Wild Imaging from Severely Aberrated Metalens Bursts
Poster Session 1
Debabrata Mandal ⋅ Zhihan Peng ⋅ Yujie Wang ⋅ Praneeth Chakravarthula
|
Tucson Ballroom & Prefunction Space 81 | |
|
FG-TRACER: Tracing Information Flow in Multimodal Large Language Models in Free-Form Generation
Poster Session 6 + Refreshments
Alessia Saporita ⋅ Vittorio Pipoli ⋅ Federico Bolelli ⋅ Lorenzo Baraldi ⋅ Andrea Acquaviva ⋅ ELISA FICARRA
|
Tucson Ballroom & Prefunction Space 67 | |
|
ReFineVQA: Iterative Refinement of Video Description via Feedback Generation for Video Question Answering
Poster Session 6 + Refreshments
Jeongwan Shin ⋅ Chan Hur ⋅ Seongmin Cho ⋅ Jae-Ho Choi ⋅ Hyeyoung Park
|
Tucson Ballroom & Prefunction Space 43 | |
|
From Lightweight CNNs to SpikeNets: Benchmarking Accuracy–Energy Tradeoffs with Pruned Spiking SqueezeNet
Poster Session 1
Radib Kabir ⋅ Tawsif Tashwar Dipto ⋅ Mehedi Ahamed ⋅ Sabbir Ahmed ⋅ Md Hasanul Kabir
|
Tucson Ballroom & Prefunction Space 109 | |
|
MAFM³: Modular Adaptation of Foundation Models for Multi-Modal Medical AI
Poster Session 3
Mohammad Qazi ⋅ Munachiso Nwadike ⋅ Ibrahim Almakky ⋅ Mohammad Yaqub ⋅ Numan Saeed
|
Tucson Ballroom & Prefunction Space 55 | |
|
Align Video Diffusion Model with Online Video-Centric Preference Optimization
Poster Session 5
Jiacheng Zhang ⋅ Jie Wu ⋅ Weifeng Chen ⋅ Yatai Ji ⋅ Xuefeng Xiao ⋅ Weilin Huang ⋅ Kai Han
|
Tucson Ballroom & Prefunction Space 33 | |
|
HABIT: Human Action Benchmark for Interactive Traffic in CARLA
Poster Session 5
Mohan Ramesh ⋅ Mark Azer ⋅ Fabian Flohr
|
Tucson Ballroom & Prefunction Space 128 | |
|
Explaining the Unseen: Multimodal Vision-Language Reasoning for Situational Awareness in Underground Mining Disasters
Poster Session 1
Mizanur Rahman Jewel ⋅ Mohamed Elmahallawy ⋅ Sanjay Madria ⋅ Samuel Frimpong
|
Tucson Ballroom & Prefunction Space 127 | |
|
Color Preserving CMOS-SPAD Fusion for Multi-Frame HDR
Poster Session 4 + Reception
Aleksi Suonsivu ⋅ Lauri Salmela ⋅ Lassi Helin ⋅ Leevi Uosukainen ⋅ Giacomo Boracchi
|
Tucson Ballroom & Prefunction Space 78 | |
|
Sea-CLIP: Mining Semantic-Aware Representations for Few-Shot Anomaly Detection with CLIP
Poster Session 3
Xiao Guo ⋅ Zhimin Chen ⋅ Carlos Castillo ⋅ Hongcheng Wang ⋅ Xiaoming Liu
|
Tucson Ballroom & Prefunction Space 74 | |
|
Unified Video Anomaly Detection Model for Detecting Different Anomaly Types
Poster Session 1
Kijung Lee ⋅ Youngwan Jo ⋅ Sunghyun Ahn ⋅ Sanghyun Park
|
Tucson Ballroom & Prefunction Space 75 | |
|
MageBench: Bridging Large Multimodal Models to Agents
Poster Session 2 + Refreshments
Miaosen Zhang ⋅ Qi Dai ⋅ Yifan Yang ⋅ Jianmin Bao ⋅ Dongdong Chen ⋅ Kai Qiu ⋅ Chong Luo ⋅ Xin Geng ⋅ Baining Guo
|
Tucson Ballroom & Prefunction Space 1 | |
|
DermEVAL: A Dermatologist-Reviewed Benchmark for Multimodal Large Language Models
Poster Session 1
Hongjin Zhao ⋅ Weihao Li ⋅ Zhenyue Qin ⋅ Ge-Peng Ji ⋅ Yang Liu ⋅ Tom Gedeon ⋅ Nick Barnes
|
Tucson Ballroom & Prefunction Space 89 | |
|
CAMP-VQA: Caption-Embedded Multimodal Perception for No-Reference Quality Assessment of Compressed Video
Poster Session 2 + Refreshments
Xinyi Wang ⋅ Angeliki Katsenou ⋅ Junxiao Shen ⋅ David Bull
|
Tucson Ballroom & Prefunction Space 60 | |
|
TaxonRL: Reinforcement Learning with Intermediate Rewards for Interpretable Fine-Grained Visual Reasoning
Poster Session 2 + Refreshments
Maximilian von Klinski ⋅ Maximilian Schall
|
Tucson Ballroom & Prefunction Space 102 | |
|
Patch Your Matcher: Correspondence-Aware Image-to-Image Translation Unlocks Cross-Modal Matching via Single-Modality Priors
Poster Session 6 + Refreshments
Anton Frolov ⋅ Volker Rodehorst
|
Tucson Ballroom & Prefunction Space 68 | |
|
MarineEval: Assessing the Marine Intelligence of Vision-Language Models
Poster Session 2 + Refreshments
Yuk Kwan Wong ⋅ Tuan-An To ⋅ Jipeng Zhang ⋅ Ziqiang Zheng ⋅ Sai-Kit Yeung
|
Tucson Ballroom & Prefunction Space 5 | |
|
CLIP-UP: CLIP-Based Unanswerable Problem Detection for Visual Question Answering
Poster Session 5
Ben Vardi ⋅ Oron Nir ⋅ Ariel Shamir
|
Tucson Ballroom & Prefunction Space 10 | |
|
CaFlow: Enhancing Long-Term Action Quality Assessment with Causal Counterfactual Flow
Poster Session 4 + Reception
Ruisheng Han ⋅ Kanglei Zhou ⋅ Shuang Chen ⋅ Amir Atapour-Abarghouei ⋅ Hubert P. H. Shum
|
Tucson Ballroom & Prefunction Space 145 | |
|
Layout Anything: One Transformer for Universal Room Layout Estimation
Poster Session 2 + Refreshments
Md Sohag Mia ⋅ Muhammad Abdullah Adnan
|
Tucson Ballroom & Prefunction Space 15 | |
|
Not Like Transformers: Drop the Beat Representation for Dance Generation with Mamba-Based Diffusion Model
Poster Session 2 + Refreshments
Sangjune Park ⋅ Inhyeok Choi ⋅ Donghyeon Soon ⋅ Youngwoo Jeon ⋅ Kyungdon Joo
|
Tucson Ballroom & Prefunction Space 34 | |
|
Distribution Highlighted Reference-based Label Distribution Learning for Facial Age Estimation
Poster Session 5
Satoshi Suzuki ⋅ Shin'ya Yamaguchi ⋅ Shoichiro Takeda ⋅ Takuhiro Kaneko ⋅ Shota Orihashi ⋅ Ryo Masumura
|
Tucson Ballroom & Prefunction Space 64 | |
|
Can Image Splicing and Copy-Move Forgery Be Detected by the Same Model? Forensim: An Attention-Based State-Space Approach
Poster Session 5
Soumyaroop Nandi ⋅ Prem Natarajan
|
Tucson Ballroom & Prefunction Space 38 | |
|
Rank-based Geographical Regularization: Revisiting Contrastive Self-Supervised Learning for Multispectral Remote Sensing Imagery
Poster Session 4 + Reception
Tom Burgert ⋅ Leonard Hackel ⋅ Paolo Rota ⋅ Begüm Demir
|
Tucson Ballroom & Prefunction Space 9 | |
|
AortaDiff: A Unified Multitask Diffusion Framework for Contrast-Free AAA Imaging
Poster Session 6 + Refreshments
Yuxuan Ou ⋅ NING BI ⋅ Jiazhen Pan ⋅ Jiancheng Yang ⋅ Boliang Yu ⋅ Usama Zidan ⋅ Regent Lee ⋅ Vicente Grau
|
Tucson Ballroom & Prefunction Space 98 | |
|
DARB-Splatting: Generalizing Splatting with Decaying Anisotropic Radial Basis Functions
Poster Session 4 + Reception
Hashiru Pramuditha ⋅ Vinasirajan Viruthshaan ⋅ Vishagar Arunan ⋅ Saeedha Nazar ⋅ Sameera Ramasinghe ⋅ Simon Lucey ⋅ Ranga Rodrigo
|
Tucson Ballroom & Prefunction Space 21 | |
|
Hierarchical Adaptive networks with Task vectors for Test-Time Adaptation
Poster Session 4 + Reception
Sameer Ambekar ⋅ Marta Hasny ⋅ Laura Daza ⋅ Daniel Lang ⋅ Julia Schnabel
|
Tucson Ballroom & Prefunction Space 36 | |
|
GFT: Graph Feature Tuning for Efficient Point Cloud Analysis
Poster Session 6 + Refreshments
Manish Dhakal ⋅ Venkat Dasari ⋅ Rajshekhar Sunderraman ⋅ Yi Ding
|
Tucson Ballroom & Prefunction Space 72 | |
|
IPCD: Intrinsic Point-Cloud Decomposition
Poster Session 5
Shogo Sato ⋅ Takuhiro Kaneko ⋅ Shoichiro Takeda ⋅ Tomoyasu Shimada ⋅ Kazuhiko Murasaki ⋅ Taiga Yoshida ⋅ Ryuichi Tanida ⋅ Akisato Kimura
|
Tucson Ballroom & Prefunction Space 123 | |
|
See, Record, Do: Automated Generation of UI Workflows from Tutorial Videos
Poster Session 5
Adam Beauchaine ⋅ Craig Shue
|
Tucson Ballroom & Prefunction Space 44 | |
|
Empowering Source-Free Domain Adaptation via MLLM-Guided Reliability-Based Curriculum Learning
Poster Session 3
Dongjie Chen ⋅ Kartik Patwari ⋅ Zhengfeng Lai ⋅ Xiaoguang Zhu ⋅ Sen-ching Cheung ⋅ Chen-Nee Chuah
|
Tucson Ballroom & Prefunction Space 129 | |
|
QUOTA: Quantifying Objects with Text-to-Image Models for Any Domain
Poster Session 5
Wenfang Sun ⋅ Yingjun Du ⋅ Gaowen Liu ⋅ Yefeng Zheng ⋅ Cees Snoek
|
Tucson Ballroom & Prefunction Space 56 | |
|
Extreme Amodal Face Detection
Poster Session 3
Changlin Song ⋅ Yunzhong Hou ⋅ Michael Barnes ⋅ Rahul Shome ⋅ Dylan Campbell
|
Tucson Ballroom & Prefunction Space 2 | |
|
Contrastive Integrated Gradients: A Feature Attribution-Based Method for Explaining Whole Slide Image Classification
Poster Session 1
Anh Vu ⋅ Tuan Vo ⋅ Ngoc Bui ⋅ Nam Le ⋅ AKASH AWASTHI ⋅ Huy Vo ⋅ Thanh-Huy Nguyen ⋅ Zhu Han ⋅ Chandra Mohan ⋅ Hien Nguyen
|
Tucson Ballroom & Prefunction Space 115 | |
|
MEGA-PCC: A Mamba-based Efficient Approach for Joint Geometry and Attribute Point Cloud Compression
Poster Session 2 + Refreshments
Kai-Hsiang Hsieh ⋅ Monyneath Yim ⋅ Wen-Hsiao Peng ⋅ Jui-Chiu Chiang
|
Tucson Ballroom & Prefunction Space 39 | |
|
CORA: Consistency-Guided Semi-Supervised Framework for Reasoning Segmentation
Poster Session 5
Prantik Howlader ⋅ Hoang Nguyen-Canh ⋅ Srijan Das ⋅ Jingyi Xu ⋅ Hieu Le ⋅ Dimitris Samaras
|
Tucson Ballroom & Prefunction Space 13 | |
|
DODA: Adapting Object Detectors to Dynamic Agricultural Environments in Real-Time with Diffusion
Poster Session 4 + Reception
Shuai Xiang ⋅ Pieter Blok ⋅ James Burridge ⋅ Haozhou Wang ⋅ Wei Guo
|
Tucson Ballroom & Prefunction Space 49 | |
|
Training-free Detection of Text-to-video Generations via Over-coherence
Poster Session 3
Jonathan Brokman ⋅ Oren Rachmil ⋅ Omer Hofman ⋅ Roy Betser ⋅ Amit Giloni ⋅ Roman Vainshtein ⋅ Hisashi Kojima
|
Tucson Ballroom & Prefunction Space 103 | |
|
MM-TS: Multi-Modal Temperature and Margin Schedules for Contrastive Learning with Long-Tail Data
Poster Session 6 + Refreshments
Siarhei Sheludzko ⋅ Dhimitrios Duka ⋅ Bernt Schiele ⋅ Hilde Kühne ⋅ Anna Kukleva
|
Tucson Ballroom & Prefunction Space 17 | |
|
AFL-PRF: Adaptive Federated Learning for Low-Quality Data: Enhancing Performance, Robustness, and Fairness
Poster Session 1
Pinrui Yu ⋅ Yiming Xie ⋅ Longtian Ye ⋅ Geng Yuan ⋅ Ningfang Mi ⋅ Xue Lin
|
Tucson Ballroom & Prefunction Space 39 | |
|
Harnessing Object Grounding for Time-Sensitive Video Understanding
Poster Session 2 + Refreshments
Tz-Ying Wu ⋅ Sharath Nittur Sridhar ⋅ Subarna Tripathi
|
Tucson Ballroom & Prefunction Space 101 | |
|
Are All Marine Species Created Equal? Performance Disparities in Underwater Object Detection
Poster Session 4 + Reception
Melanie Wille ⋅ Tobias Fischer ⋅ Scarlett Raine
|
Tucson Ballroom & Prefunction Space 26 | |
|
ViGG: Robust RGB-D Point Cloud Registration using Visual-Geometric Mutual Guidance
Poster Session 1
Congjia Chen ⋅ Shen Yan ⋅ Yufu Qu
|
Tucson Ballroom & Prefunction Space 78 | |
|
SCORP: Scene-Consistent Object Refinement via Proxy Generation and Tuning
Poster Session 1
Ziwei Chen ⋅ Ziling Liu ⋅ Zitong Huang ⋅ Mingqi Gao ⋅ Feng Zheng
|
Tucson Ballroom & Prefunction Space 76 | |
|
How I Met Your Bias: Investigating Bias Amplification in Diffusion Models
Poster Session 4 + Reception
Nathan Roos ⋅ Ekaterina Iakovleva ⋅ Ani Gjergji ⋅ Vito Paolo Pastore ⋅ Enzo Tartaglione
|
Tucson Ballroom & Prefunction Space 104 | |
|
PhysEduVideo: A Benchmark for Evaluating Text-to-Video Models for Physics Education
Poster Session 6 + Refreshments
Megha Mariam K M ⋅ Aditya Arun ⋅ Zakaria Laskar ⋅ Jawahar CV
|
Tucson Ballroom & Prefunction Space 141 | |
|
DreamCatcher: Efficient Multi-Concept Customization via Representation Finetuning
Poster Session 5
Jungwon Lee ⋅ Changhun Lee ⋅ Eunhyeok Park
|
Tucson Ballroom & Prefunction Space 120 | |
|
Self-Supervised Visual Prompting for Cross-Domain Road Damage Detection
Poster Session 3
Xi Xiao ⋅ Zhuxuanzi Wang ⋅ Mingqiao Mo ⋅ Chen Liu ⋅ Chenrui Ma ⋅ Yanshu Li ⋅ Smita Krishnaswamy ⋅ Xiao Wang ⋅ Tianyang Wang
|
Tucson Ballroom & Prefunction Space 57 | |
|
HumanBench: Two Heads, No Legs, But Mostly Human, the State of Generative Capabilities in T2I Models
Poster Session 3
Anubhooti Jain ⋅ Mayank Vatsa ⋅ Richa Singh
|
Tucson Ballroom & Prefunction Space 112 | |
|
Boosting Unsupervised Video Instance Segmentation with Automatic Quality-Guided Self-Training
Poster Session 6 + Refreshments
Kaixuan Lu ⋅ Mehmet Onurcan Kaya ⋅ Dim Papadopoulos
|
Tucson Ballroom & Prefunction Space 18 | |
|
Where is the Watermark? Interpretable Watermark Detection at the Block Level
Poster Session 6 + Refreshments
Maria Bulychev ⋅ Neil Grant Marchant ⋅ Benjamin Rubinstein
|
Tucson Ballroom & Prefunction Space 21 | |
|
From Detection to Anticipation: Online Understanding of Struggles across Various Tasks and Activities
Poster Session 3
Shijia Feng ⋅ Michael Wray ⋅ Walterio Mayol-Cuevas
|
Tucson Ballroom & Prefunction Space 107 | |
|
Memoire: Learning User Personas from Gallery Tags for Personalized Photo Curation
Poster Session 5
Praful Mathur ⋅ Mohsin Iftekhar ⋅ Aman Sharma ⋅ Sarvesh Tiwari ⋅ Meghali Deka ⋅ Sathish Cherukuri ⋅ Roopa Sheshadri ⋅ Rakesh Valusa
|
Tucson Ballroom & Prefunction Space 102 | |
|
Zero-Shot Video Deraining with Video Diffusion Models
Poster Session 1
Tuomas Varanka ⋅ Juan Bello Bello ⋅ Hyeongwoo Kim ⋅ Pablo Garrido ⋅ Xu YAO
|
Tucson Ballroom & Prefunction Space 65 | |
|
RoadBench: A Vision-Language Foundation Model and Benchmark for Road Damage Understanding
Poster Session 5
Xi Xiao ⋅ Yunbei Zhang ⋅ Janet Wang ⋅ Lin Zhao ⋅ YUXIANG WEI ⋅ Hengjia Li ⋅ Yanshu Li ⋅ Xiao Wang ⋅ Swalpa Roy ⋅ Hao Xu ⋅ Tianyang Wang
|
Tucson Ballroom & Prefunction Space 21 | |
|
Ego-EXTRA: video-language Egocentric Dataset for EXpert-TRAinee assistance
Poster Session 4 + Reception
Francesco Ragusa ⋅ Michele Mazzamuto ⋅ Rosario Forte ⋅ Irene D'Ambra ⋅ James Fort ⋅ Jakob Engel ⋅ Antonino Furnari ⋅ Giovanni Farinella
|
Tucson Ballroom & Prefunction Space 15 | |
|
BREEN: Bridge Data-Efficient Encoder-Free Multimodal Learning with Learnable Queries
Poster Session 4 + Reception
Tianle Li ⋅ Yongming Rao ⋅ Winston Hu ⋅ Yu Cheng
|
Tucson Ballroom & Prefunction Space 105 | |
|
GAEA: A Geolocation Aware Conversational Assistant
Poster Session 4 + Reception
Ron Campos ⋅ Ashmal Vayani ⋅ Parth Parag Kulkarni ⋅ Rohit Gupta ⋅ Aizan Zafar ⋅ Aritra Dutta ⋅ Mubarak Shah
|
Tucson Ballroom & Prefunction Space 91 | |
|
Leveraging Sparsity for Privacy in Collaborative Inference
Poster Session 6 + Refreshments
Maximilian Hoefler ⋅ Karsten Mueller ⋅ Wojciech Samek
|
Tucson Ballroom & Prefunction Space 38 | |
|
Optimizing LVLMs with On-Policy Data for Effective Hallucination Mitigation
Poster Session 4 + Reception
Chengzhi Yu ⋅ Yifan Xu ⋅ Yifan Chen ⋅ Wenyi Zhang
|
Tucson Ballroom & Prefunction Space 43 | |
|
Eye-for-an-eye: Appearance Transfer with Dense Semantic Correspondence in Diffusion Models
Poster Session 4 + Reception
Sooyeon Go ⋅ Kyungmook Choi ⋅ Minjung Shin ⋅ Youngjung Uh
|
Tucson Ballroom & Prefunction Space 34 | |
|
Diffusion-Based Action Recognition Generalizes to Untrained Domains
Poster Session 5
Rogério Guimarães ⋅ Frank Xiao ⋅ Pietro Perona ⋅ Markus Marks
|
Tucson Ballroom & Prefunction Space 12 | |
|
Multimodal Medical Image Binding via Shared Text Embeddings
Poster Session 2 + Refreshments
Yunhao Liu ⋅ Suyang Xi ⋅ Shiqi Liu ⋅ Hong Ding ⋅ Chicheng Jin ⋅ Zhong Chong ⋅ Junjun He ⋅ Catherine Liu ⋅ Yiqing Shen
|
Tucson Ballroom & Prefunction Space 19 | |
|
ATM: Enhanced Alignment for Text-to-Motion Generation
Poster Session 5
Ke Han ⋅ Yueming Lyu ⋅ Weichen Yu ⋅ Nicu Sebe
|
Tucson Ballroom & Prefunction Space 101 | |
|
You May Speak Freely: Improving the Fine-Grained Visual Recognition Capabilities of Multimodal Large Language Models with Answer Extraction
Poster Session 2 + Refreshments
Logan Lawrence ⋅ Oindrila Saha ⋅ Megan Wei ⋅ Chen Sun ⋅ Subhransu Maji ⋅ Grant Horn
|
Tucson Ballroom & Prefunction Space 2 | |
|
Intraoperative 2D/3D Registration via Spherical Similarity Learning and Differentiable Levenberg-Marquardt Optimization
Poster Session 6 + Refreshments
Minheng Chen ⋅ Youyong Kong
|
Tucson Ballroom & Prefunction Space 4 | |
|
GRAPE (Gaussian Rendering for Accelerated Pixel Enhancement) Brings Fast and Lightweight Arbitrary Super-Resolution
Poster Session 6 + Refreshments
Jung In Jang ⋅ Kyong Hwan Jin
|
Tucson Ballroom & Prefunction Space 52 | |
|
Face-LLaVA: Facial Expression and Attribute Understanding through Instruction Tuning
Poster Session 2 + Refreshments
Ashutosh Chaubey ⋅ Xulang Guan ⋅ Mohammad Soleymani
|
Tucson Ballroom & Prefunction Space 117 | |
|
Revisiting Retentive Networks for Fast Range-View 3D LiDAR Semantic Segmentation
Poster Session 2 + Refreshments
Simone Mosco ⋅ Daniel Fusaro ⋅ Wanmeng Li ⋅ Alberto Pretto
|
Tucson Ballroom & Prefunction Space 103 | |
|
Diffusion-Based Authentication of Copy Detection Patterns: A Multimodal Framework with Printer Signature Conditioning
Poster Session 2 + Refreshments
Bolutife Atoki ⋅ Iuliia Tkachenko ⋅ Bertrand Kerautret ⋅ Carlos Crispim-Junior Crispim-Junior
|
Tucson Ballroom & Prefunction Space 26 | |
|
Pyramidal Spectrum: Frequency-based Hierarchically Vector Quantized VAE for Videos
Poster Session 2 + Refreshments
Tushar Prakash ⋅ Onkar Susladkar ⋅ Inderjit Dhillon ⋅ Sparsh Mittal
|
Tucson Ballroom & Prefunction Space 63 | |
|
Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising
Poster Session 6 + Refreshments
Yan-Bo Lin ⋅ Kevin Lin ⋅ Zhengyuan Yang ⋅ Linjie Li ⋅ Jianfeng Wang ⋅ Chung-Ching Lin ⋅ Xiaofei Wang ⋅ Gedas Bertasius ⋅ Lijuan Wang
|
Tucson Ballroom & Prefunction Space 14 | |
|
FedSCAl: Leveraging Server and Client Alignment for Unsupervised Federated Source-Free Domain Adaptation
Poster Session 3
M Yashwanth ⋅ Sampath Koti ⋅ Arunabh Singh ⋅ Shyam Marjit ⋅ Anirban Chakraborty
|
Tucson Ballroom & Prefunction Space 114 | |
|
Human Pose Aggregation for Multi-View Temporal Video Alignment
Poster Session 1
Fabien Delattre ⋅ Tsung-Wei Huang ⋅ Guan-Ming Su ⋅ Erik Learned-Miller
|
Tucson Ballroom & Prefunction Space 61 | |
|
MEDAL: multi-modal MEta-space Distillation and ALignment for Visual Compatibility Learning
Poster Session 1
Dween Sanny ⋅ Vinay Verma ⋅ Prateek Sircar ⋅ Deepak Gupta
|
Tucson Ballroom & Prefunction Space 85 | |
|
FlowCLAS: Enhancing Normalizing Flow-Based Anomaly Segmentation Via Contrastive Learning
Poster Session 5
Chang Won (John) Lee ⋅ Selina Leveugle ⋅ Paul Grouchy ⋅ Chris Langley ⋅ Svetlana Stolpner ⋅ Jonathan Kelly ⋅ Steven Waslander
|
Tucson Ballroom & Prefunction Space 114 | |
|
Multimodal Graph Representation Learning over Arbitrary Sets of Modalities
Poster Session 5
Santosh Patapati ⋅ Trisanth Srinivasan
|
Tucson Ballroom & Prefunction Space 124 | |
|
RapidMV: Leveraging Spatio-Angular Latent Space for Efficient and Consistent Text-to-Multi-View Synthesis
Poster Session 2 + Refreshments
Seungwook Kim ⋅ Yichun Shi ⋅ Kejie Li ⋅ Minsu Cho ⋅ Peng Wang
|
Tucson Ballroom & Prefunction Space 25 | |
|
PiSA: A Self-Augmented Data Engine and Training Strategy for 3D Understanding with Large Models
Poster Session 1
Zilu Guo ⋅ Hongbin Lin ⋅ Zhihao Yuan ⋅ Chaoda Zheng ⋅ Pengshuo Qiu ⋅ Dongzhi Jiang ⋅ Renrui Zhang ⋅ Chun-Mei Feng ⋅ Zhen Li
|
Tucson Ballroom & Prefunction Space 122 | |
|
DreamAnywhere: Object-Centric Panoramic 3D Scene Generation
Poster Session 1
Edoardo Dominici ⋅ Jozef Hladký ⋅ Floor Verhoeven ⋅ Lukas Radl ⋅ Thomas Deixelberger ⋅ Stefan Ainetter ⋅ Philipp Drescher ⋅ Stefan Hauswiesner ⋅ Arno Coomans ⋅ Giacomo Nazzaro ⋅ Konstantinos Vardis ⋅ Markus Steinberger
|
Tucson Ballroom & Prefunction Space 1 | |
|
mEOL: Training-Free Instruction-Guided Multimodal Embedder for Vector Graphics and Image Retrieval
Poster Session 1
Kyeongseon Kim ⋅ Baek Seong-Eun ⋅ Lee Jung-Mok ⋅ Tae-Hyun Oh
|
Tucson Ballroom & Prefunction Space 114 | |
|
TS-PCI: Point Cloud Frame Interpolation with Time-Aware Point Cloud Sampling and Self-Supervised Learning Strategy
Poster Session 1
Kohei Matsuzaki ⋅ Keisuke Nonaka
|
Tucson Ballroom & Prefunction Space 6 | |
|
Referring Change Detection in Remote Sensing Imagery
Poster Session 1
Yilmaz Korkmaz ⋅ Jay Paranjape ⋅ Celso de Melo ⋅ Vishal Patel
|
Tucson Ballroom & Prefunction Space 11 | |
|
GenHSI: Controllable Generation of Human-Scene Interaction Videos
Poster Session 1
Zekun Li ⋅ Rui Zhou ⋅ Rahul Sajnani ⋅ Xiaoyan Cong ⋅ Daniel Ritchie ⋅ Srinath Sridhar
|
Tucson Ballroom & Prefunction Space 14 | |
|
SAVeD: Learning to Denoise Low-SNR Video for Improved Downstream Performance
Poster Session 5
Suzanne Stathatos ⋅ Michael Hobley ⋅ Pietro Perona ⋅ Markus Marks
|
Tucson Ballroom & Prefunction Space 100 | |
|
Forget Less by Learning Together through Concept Consolidation
Poster Session 1
Arjun Kaushik Kaushik ⋅ Naresh Kumar Devulapally ⋅ Vishnu Lokhande ⋅ Nalini Ratha ⋅ Venu Govindaraju
|
Tucson Ballroom & Prefunction Space 26 | |
|
Training-free Multi-view 4D Human Motion Reconstruction Virtual Reality System
Poster Session 1
Yijie Li ⋅ Ce Zheng ⋅ Yijie He ⋅ Joel Julin ⋅ Ryosuke Ichikari ⋅ Satoki Ogiso ⋅ Satoshi Nakae ⋅ Akihiro Sato ⋅ Takeshi Kurata ⋅ Laszlo Jeni
|
Tucson Ballroom & Prefunction Space 31 | |
|
Cluster-Guided Adversarial Perturbations for Robust Contrastive Learning
Poster Session 1
Seongyun Seo ⋅ Sungmin Han ⋅ Jeonghyun Lee ⋅ Sangkyun Lee
|
Tucson Ballroom & Prefunction Space 34 | |
|
Eff-GRot: Efficient and Generalizable Rotation Estimation with Transformers
Poster Session 1
Fanis Mathioulakis ⋅ Gorjan Radevski ⋅ Tinne Tuytelaars
|
Tucson Ballroom & Prefunction Space 40 | |
|
Interleaved Vision-and-Language Generation via Generative Voken
Poster Session 1
Kaizhi Zheng ⋅ Xuehai He ⋅ Xin Wang
|
Tucson Ballroom & Prefunction Space 46 | |
|
CraftSVG: Multi-Object Text-to-SVG Synthesis via Layout Guided Diffusion
Poster Session 2 + Refreshments
Ayan Banerjee ⋅ Nityanand Mathur ⋅ Josep Llados ⋅ Umapada Pal ⋅ Anjan Dutta
|
Tucson Ballroom & Prefunction Space 109 | |
|
Network-agnostic distortion-robust projections for wide-angle image understanding
Poster Session 1
Akshaya Athwale ⋅ Ola Ahmad ⋅ Jean-Francois Lalonde
|
Tucson Ballroom & Prefunction Space 57 | |
|
PS3: Part level instance segmentation in 3D
Poster Session 1
HONG-XUAN YEN ⋅ Chiamin Chen ⋅ Yanqing Wang ⋅ Yu-Lun Liu ⋅ Min Sun
|
Tucson Ballroom & Prefunction Space 86 | |
|
Root Completion from Intraoral Scans of Tooth Crowns using Diffusion with Patch Perturbation
Poster Session 1
Yohan Jang ⋅ In-Seok Song ⋅ Seung Baek
|
Tucson Ballroom & Prefunction Space 47 | |
|
ZonUI-3B: Competitive GUI Grounding with a 3B VLM Trained on a Single Consumer GPU
Poster Session 1
ZongHan Hsieh ⋅ SHENGJING YANG ⋅ TZER-JEN WEI
|
Tucson Ballroom & Prefunction Space 92 | |
|
HyperPose: Hyper-pose Embeddings for 3D-Aware Generative Models with Self-Supervised Disentangling of Pose and Scene
Poster Session 1
Mijeong Kim ⋅ Namgi Kim ⋅ Bohyung Han
|
Tucson Ballroom & Prefunction Space 97 | |
|
Diverse Sketch Colorization with Content-Enhanced Style Representation and Recolorization Distillation
Poster Session 1
Shuangming Mao ⋅ HaiXiang Zhu
|
Tucson Ballroom & Prefunction Space 102 | |
|
BanglaProtha: Evaluating Vision Language Models in Underrepresented Long-tail Cultural Contexts
Poster Session 1
Md Fahim ⋅ Md Sakib Ul Rahman Sourove ⋅ Akm Mazumder ⋅ Md Ishmam ⋅ Md Adib ⋅ Fariha Tanjim Shifat ⋅ Fabiha Haider ⋅ Md Bhuiyan
|
Tucson Ballroom & Prefunction Space 111 | |
|
ProSkill: Segment-Level Skill Assessment in Procedural Videos
Poster Session 4 + Reception
Michele Mazzamuto ⋅ Daniele Di Mauro ⋅ Gianpiero Francesca ⋅ Giovanni Farinella ⋅ Antonino Furnari
|
Tucson Ballroom & Prefunction Space 54 | |
|
Towards Fast and Scalable Normal Integration using Continuous Components
Poster Session 1
Francesco Milano ⋅ Jen Jen Chung ⋅ Lionel Ott ⋅ Roland Siegwart
|
Tucson Ballroom & Prefunction Space 23 | |
|
GHOST: Getting to the Bottom of Hallucinations with A Multi-round Consistency Benchmark
Poster Session 5
Vibashan VS ⋅ Nadine Chang ⋅ Jenny Schmalfuss ⋅ Vishal Patel ⋅ Zhiding Yu ⋅ Jose M. Alvarez
|
Tucson Ballroom & Prefunction Space 35 | |
|
QuadraNet V2: Efficient and Sustainable Training of High-Order Neural Networks with Quadratic Adaptation
Poster Session 1
Chenhui Xu ⋅ Fuxun Yu ⋅ Jinjun Xiong ⋅ Xiang Chen
|
Tucson Ballroom & Prefunction Space 131 | |
|
Identity Verification from Human Scent using Channel Representation of 2D Gas Chromatography-Mass Spectrometry Data
Poster Session 2 + Refreshments
Radim Spetlik ⋅ Jan Hlavsa ⋅ Jana Čechová ⋅ Petra Pojmanová ⋅ Jiri Matas ⋅ Štěpán Urban
|
Tucson Ballroom & Prefunction Space 6 | |
|
BrightRate: Quality Assessment for User-Generated HDR Videos
Poster Session 2 + Refreshments
Shreshth Saini ⋅ Bowen Chen ⋅ Yilin Wang ⋅ Neil Birkbeck ⋅ Balu Adsumilli ⋅ Alan Bovik
|
Tucson Ballroom & Prefunction Space 11 | |
|
Timestamp Query Transformer for Temporal Action Segmentation
Poster Session 4 + Reception
Tieqiao Wang ⋅ Sinisa Todorovic
|
Tucson Ballroom & Prefunction Space 70 | |
|
Inpaint360GS: Efficient Object-Aware 3D Inpainting via Gaussian Splatting for 360° Scenes
Poster Session 1
Shaoxiang Wang ⋅ Shihong Zhang ⋅ Christen Millerdurai ⋅ Rüdiger Westermann ⋅ Didier Stricker ⋅ Alain Pagani
|
Tucson Ballroom & Prefunction Space 12 | |
|
QCFace: Image Quality Control for boosting Face Representation & Recognition
Poster Session 2 + Refreshments
Duc-Phuong Doan-Ngo ⋅ Thanh-Dang Diep ⋅ Thanh Nguyen-Duc ⋅ Thanh-Sach LE ⋅ Nam Thoai
|
Tucson Ballroom & Prefunction Space 9 | |
|
Test-Time Adaptation for Video Highlight Detection Using Meta-Auxiliary Learning and Cross-Modality Hallucinations
Poster Session 5
Zahidul Islam ⋅ Sujoy Paul ⋅ Mrigank Rochan
|
Tucson Ballroom & Prefunction Space 104 | |
|
CycleSL: Server-Client Cyclical Update Driven Scalable Split Learning
Poster Session 2 + Refreshments
Mengdi Wang ⋅ Efe Bozkir ⋅ Enkelejda Kasneci
|
Tucson Ballroom & Prefunction Space 41 | |
|
Roadside Monocular 3D Detection Prompted by 2D Detection
Poster Session 2 + Refreshments
Yechi Ma ⋅ Wei Hua ⋅ Yanan Li ⋅ Shu Kong
|
Tucson Ballroom & Prefunction Space 46 | |
|
ASC: Learning Augmentation Severity-Consistent Representations Improves Generalization via Augmentation Search
Poster Session 2 + Refreshments
Amirhossein Alamdar ⋅ Hossein Jafarinia ⋅ Mahdi Nouri ⋅ Mohammad Rohban
|
Tucson Ballroom & Prefunction Space 49 | |
|
Semi-Supervised Hierarchical Open-Set Classification
Poster Session 2 + Refreshments
Erik Wallin ⋅ Fredrik Kahl ⋅ Lars Hammarstrand
|
Tucson Ballroom & Prefunction Space 55 | |
|
DoTA: Latent Distribution Conditioned Data Attribution for Diffusion Models
Poster Session 2 + Refreshments
Ninad Joshi ⋅ Vivek Srivastava ⋅ Shirish Karande
|
Tucson Ballroom & Prefunction Space 58 | |
|
Narrating For You: Prompt-guided Audio-visual Narrating Face Generation Employing Multi-entangled Latent Space
Poster Session 1
Aashish Chandra ⋅ Aashutosh A V ⋅ Abhijit Das
|
Tucson Ballroom & Prefunction Space 126 | |
|
LightGazeNet: A Lightweight GNN-based Architecture for Gaze Estimation
Poster Session 3
Heena Patel ⋅ Anirban Chowdhury ⋅ Pooja Choksy ⋅ Samiksha Pachade ⋅ Ajinkya Puar
|
Tucson Ballroom & Prefunction Space 76 | |
|
Zero-Shot Coreset Selection via Iterative Subspace Sampling
Poster Session 2 + Refreshments
Brent Griffin ⋅ Jacob Marks ⋅ Jason Corso
|
Tucson Ballroom & Prefunction Space 67 | |
|
BAFIS: Dataset + Framework to assess occupational Bias and Human Preference in modern Text-to-image Models
Poster Session 2 + Refreshments
Thomas Klassert ⋅ Adrian Ulges ⋅ Biying Fu
|
Tucson Ballroom & Prefunction Space 72 | |
|
High-Rate Mixout: Revisiting Mixout for Robust Domain Generalization
Poster Session 3
Masih Aminbeidokhti ⋅ Heitor Medeiros ⋅ Srikanth Muralidharan ⋅ Eric Granger ⋅ Marco Pedersoli
|
Tucson Ballroom & Prefunction Space 85 | |
|
CVP: Central-Peripheral Vision-Inspired Multimodal Model for Spatial Reasoning
Poster Session 2 + Refreshments
Zeyuan Chen ⋅ Xiang Zhang ⋅ Haiyang Xu ⋅ Jianwen Xie ⋅ Zhuowen Tu
|
Tucson Ballroom & Prefunction Space 84 | |
|
Discrete Facial Encoding: A Framework for Data-driven Facial Display Discovery
Poster Session 2 + Refreshments
Minh Tran ⋅ Maksim Siniukov ⋅ Zhangyu Jin ⋅ Mohammad Soleymani
|
Tucson Ballroom & Prefunction Space 89 | |
|
ScoliGaitX: A Deep Multi-Modal Fusion Network for Scoliosis Assessment via Gait Video Analysis
Poster Session 2 + Refreshments
Kaushik Vishwakarma ⋅ Aditya Nigam
|
Tucson Ballroom & Prefunction Space 94 | |
|
FlowMorph: Revealing an Optimizable Flow Latent Space for Controlled Image Morphing
Poster Session 2 + Refreshments
Yan Zheng ⋅ Yi Yang ⋅ Lanqing Guo ⋅ Zhangyang ”Atlas” Wang
|
Tucson Ballroom & Prefunction Space 99 | |
|
Zero-shot Hierarchical Plant Segmentation via Foundation Segmentation Models and Text-to-image Attention
Poster Session 2 + Refreshments
Junhao Xing ⋅ Ryohei Miyakawa ⋅ Yang Yang ⋅ Xinpeng Liu ⋅ Risa Shinoda ⋅ Hiroaki Santo ⋅ Yosuke Toda ⋅ Fumio Okura
|
Tucson Ballroom & Prefunction Space 104 | |
|
Moiré Zero: An Efficient and High-Performance Neural Architecture for Moiré Removal
Poster Session 2 + Refreshments
Seungryong Lee ⋅ Woojeong Baek ⋅ Younghyun Kim ⋅ Eunwoo Kim ⋅ Haru Moon ⋅ Donggon Yoo ⋅ Eunbyung Park
|
Tucson Ballroom & Prefunction Space 105 | |
|
A-V Representation Learning via Audio Shift Prediction for Multimodal Deepfake Detection and Temporal Localization
Poster Session 2 + Refreshments
Ashutosh Anshul ⋅ Eng Chng ⋅ Deepu Rajan
|
Tucson Ballroom & Prefunction Space 108 | |
|
MVAT: Multi-View Aware Teacher for Weakly Supervised 3D Object Detection
Poster Session 5
Saad Lahlali ⋅ Alexandre Montgieux ⋅ Nicolas Granger ⋅ Hervé Le Borgne ⋅ Quoc Cuong PHAM
|
Tucson Ballroom & Prefunction Space 29 | |
|
Evaluating Text-to-Image and Text-to-Video Synthesis with a Conditional Frechet Distance
Poster Session 2 + Refreshments
Jaywon Koo ⋅ Jefferson Hernandez ⋅ Moayed Haji-Ali ⋅ Ziyan Yang ⋅ Vicente Ordonez
|
Tucson Ballroom & Prefunction Space 61 | |
|
CineVerse: Consistent Keyframe Synthesis for Cinematic Scene Composition
Poster Session 2 + Refreshments
Quynh Phunh ⋅ Long Mai ⋅ Fabian Caba Heilbron ⋅ Feng Liu ⋅ Jia-Bin Huang ⋅ Cusuh Ham
|
Tucson Ballroom & Prefunction Space 115 | |
|
ConsensusXAI: A framework to examine class-wise agreement in medical imaging
Poster Session 2 + Refreshments
Abbas Haider ⋅ David Wright ⋅ Ruth Hogg ⋅ Hui Wang ⋅ Tunde Peto ⋅ Richard Gault
|
Tucson Ballroom & Prefunction Space 118 | |
|
Matching Semantically Similar Non-Identical Objects
Poster Session 2 + Refreshments
Yusuke Marumo ⋅ Kazuhiko Kawamoto ⋅ Satomi Tanaka ⋅ Shigenobu Hirano ⋅ Hiroshi Kera
|
Tucson Ballroom & Prefunction Space 127 | |
|
What Happens When: Learning Temporal Orders of Events in Videos
Poster Session 2 + Refreshments
Daechul Ahn ⋅ Yura Choi ⋅ Hyeonbeom Choi ⋅ Seongwon Cho ⋅ San Kim ⋅ Jonghyun Choi
|
Tucson Ballroom & Prefunction Space 130 | |
|
DiRe: Diversity-promoting Regularization for Dataset Condensation
Poster Session 2 + Refreshments
Saumyaranjan Mohanty ⋅ Aravind Reddy ⋅ Konda Reddy Mopuri
|
Tucson Ballroom & Prefunction Space 133 | |
|
Improved Wildfire Spread Prediction with Time-Series Data and the WSTS+ Benchmark
Poster Session 2 + Refreshments
Saad Lahrichi ⋅ Jake Bova ⋅ Jesse Johnson ⋅ Jordan Malof
|
Tucson Ballroom & Prefunction Space 140 | |
|
RAVU: Retrieval Augmented Video Understanding with Compositional Reasoning over Graph
Poster Session 2 + Refreshments
Sameer Malik ⋅ Ayush Singh ⋅ Moyuru Yamada ⋅ Dishank Aggarwal
|
Tucson Ballroom & Prefunction Space 138 | |
|
StreetView-Waste: A Multi-Task Dataset for Urban Waste Management
Poster Session 3
Diogo J. Paulo ⋅ João Martins ⋅ Hugo Proenca ⋅ João Neves
|
Tucson Ballroom & Prefunction Space 9 | |
|
Evaluating the Capability of Video Question Generation for Expert Knowledge Elicitation
Poster Session 3
Huaying Zhang ⋅ Atsushi Hashimoto ⋅ Tosho Hirasawa
|
Tucson Ballroom & Prefunction Space 12 | |
|
GASP: Unifying Geometric and Semantic Self-Supervised Pre-training for Autonomous Driving
Poster Session 3
William Ljungbergh ⋅ Adam Lilja ⋅ Adam Tonderski ⋅ Arvid Ling ⋅ Carl Lindström ⋅ Willem Verbeke ⋅ Junsheng Fu ⋅ Christoffer Petersson ⋅ Lars Hammarstrand ⋅ Michael Felsberg
|
Tucson Ballroom & Prefunction Space 15 | |
|
Gradient-Free Classifier Guidance for Diffusion Model Sampling
Poster Session 3
Rahul Shenoy ⋅ Zhihong Pan ⋅ Kaushik Balakrishnan ⋅ Qisen Cheng ⋅ Yongmoon Jeon ⋅ Heejune Yang ⋅ Jaewon Kim
|
Tucson Ballroom & Prefunction Space 23 | |
|
PointNet4D: A lightweight 4D Point Cloud Video Backbone for Online and Offline Perception in Robotic Applications
Poster Session 3
Yunze Liu ⋅ Zifan Wang ⋅ Peiran Wu ⋅ Jiayang Ao
|
Tucson Ballroom & Prefunction Space 27 | |
|
Show Me: Unifying Instructional Image and Video Generation with Diffusion Models
Poster Session 3
Yujiang Pu ⋅ Zhanbo Huang ⋅ Vishnu Boddeti ⋅ Yu Kong
|
Tucson Ballroom & Prefunction Space 35 | |
|
HEART-PFL: Stable Personalized Federated Learning under Heterogeneity with Hierarchical Directional Alignment and Adversarial Knowledge Transfer
Poster Session 3
Minjun Kim ⋅ Minje Kim
|
Tucson Ballroom & Prefunction Space 43 | |
|
Detecting Social Engagement of Elderly From Lifelog Image-streams to Identify Effective Cues for Autobiographic Recall
Poster Session 3
Vengateswaran Subramaniam ⋅ Vigneshwaran Subbaraju ⋅ Debaditya Roy ⋅ Pramath Krishna ⋅ Thivya Kandappu ⋅ Qianli Xu
|
Tucson Ballroom & Prefunction Space 44 | |
|
Data-Driven Loss Functions for Inference-Time Optimization in Text-to-Image
Poster Session 3
Sapir Esther Yiflach ⋅ Yuval Atzmon ⋅ Gal Chechik
|
Tucson Ballroom & Prefunction Space 58 | |
|
DOTGraph: CLIP-Driven Feature Disentanglement and Optimal Transport based Graph Learning for Few-Shot Segmentation
Poster Session 3
Shreya Biswas ⋅ Zhaozheng Yin
|
Tucson Ballroom & Prefunction Space 69 | |
|
ScoreNet: Netting Lightweight Quality Scores for Better Visual Assessment with Large Multi-Modality Models
Poster Session 5
Bahador Rashidi ⋅ Kiarash Aghakasiri ⋅ Shupei Zhang ⋅ Amirmohsen Sattarifard ⋅ Yue zhang ⋅ Chao Gao
|
Tucson Ballroom & Prefunction Space 115 | |
|
A Little More Like This: Text-to-Image Retrieval with Vision-Language Models Using Relevance Feedback
Poster Session 3
Bulat Khaertdinov ⋅ Mirela Popa ⋅ Nava Tintarev
|
Tucson Ballroom & Prefunction Space 87 | |
|
Online Episodic Memory Visual Query Localization with Egocentric Streaming Object Memory
Poster Session 3
Zaira Manigrasso ⋅ Matteo Dunnhofer ⋅ Antonino Furnari ⋅ Moritz Nottebaum ⋅ Antonio Finocchiaro ⋅ Marana Davide ⋅ Rosario Forte ⋅ Giovanni Farinella ⋅ Christian Micheloni
|
Tucson Ballroom & Prefunction Space 99 | |
|
LVM-Lite: Training Large Vision Models with Efficient Sequential Modeling
Poster Session 4 + Reception
Xianhang Li ⋅ Hongru Zhu ⋅ Sucheng Ren ⋅ Linjie Yang ⋅ Peng Wang ⋅ Heng Wang ⋅ Xiaohui Shen ⋅ Qing Liu ⋅ Cihang Xie
|
Tucson Ballroom & Prefunction Space 27 | |
|
Domain Generalizing DINO for Visual Regression via Latent Distractor Subspace Consistency
Poster Session 3
Nikhil Kumar Jangamreddy ⋅ Chetan Arora ⋅ Mahsa Baktashmotlagh
|
Tucson Ballroom & Prefunction Space 109 | |
|
TalkingHeadBench: A Multi-Modal Benchmark & Analysis of Talking-Head DeepFake Detection
Poster Session 3
Xinqi Xiong ⋅ Prakrut Patel ⋅ Qingyuan Fan ⋅ Amisha Wadhwa ⋅ Sarathy Selvam ⋅ Xiao Guo ⋅ Luchao Qi ⋅ Xiaoming Liu ⋅ Roni Sengupta
|
Tucson Ballroom & Prefunction Space 117 | |
|
Guided Texture Segmentation via Coordinate-Aware Class-Ratio Mapping
Poster Session 3
Bishal Swain ⋅ Kyung Cheoi ⋅ Jaepil Ko
|
Tucson Ballroom & Prefunction Space 128 | |
|
OMeGa: Joint Optimization of Explicit Meshes and Gaussian Splats for Robust Scene-Level Surface Reconstruction
Poster Session 4 + Reception
Yuhang Cao ⋅ Haojun Yan ⋅ Danya Yao
|
Tucson Ballroom & Prefunction Space 10 | |
|
Similarity-aware Probabilistic Embeddings Modeling for Video-Text Retrieval
Poster Session 4 + Reception
Yuliang Huang ⋅ Pengxu Wei ⋅ Zhicheng Dong ⋅ Liang Lin
|
Tucson Ballroom & Prefunction Space 16 | |
|
SIAM: Synchronous Interaction Attention for Human Mesh Recovery
Poster Session 4 + Reception
Niaz Ahmad ⋅ Saif Ullah ⋅ Youngmoon Lee ⋅ Guanghui Wang
|
Tucson Ballroom & Prefunction Space 24 | |
|
Transformer-Based Inpainting for Real-Time 3D Streaming in Sparse Multi-Camera Setups
Poster Session 4 + Reception
Leif V Holland ⋅ Domenic Zingsheim ⋅ Mana Takhsha ⋅ Hannah Dröge ⋅ Patrick Stotko ⋅ Markus Plack ⋅ Reinhard Klein
|
Tucson Ballroom & Prefunction Space 29 | |
|
LiDAR-DHMT: LiDAR-Adaptive Dual Hierarchical Mask Transformer for Robust Freespace Detection and Semantic Segmentation
Poster Session 1
Siyu Chen ⋅ Ting Han ⋅ Changshe Zhang ⋅ Xin Luo ⋅ Huan Chen ⋅ Meiliu Wu ⋅ Guorong Cai ⋅ jinhe su
|
Tucson Ballroom & Prefunction Space 120 | |
|
LASER: Lip Landmark Assisted Speaker Detection for Robustness
Poster Session 6 + Refreshments
Le Thien Phuc Nguyen ⋅ Zhuoran Yu ⋅ Yong Jae Lee
|
Tucson Ballroom & Prefunction Space 9 | |
|
Generalization of Real World Video Deblurring By Image-to-Image Translation
Poster Session 4 + Reception
Kassymzhomart Aitbek ⋅ Seungjoon Yang
|
Tucson Ballroom & Prefunction Space 40 | |
|
More Than Memory Savings: Zeroth-Order Optimization Mitigates Forgetting in Continual Learning
Poster Session 4 + Reception
Wanhao Yu ⋅ Zheng Wang ⋅ Shuteng Niu ⋅ Sen Lin ⋅ Li Yang
|
Tucson Ballroom & Prefunction Space 46 | |
|
CoL2A: Convolution-free Local Linear Attention for SpatioTemporal Event Processing
Poster Session 4 + Reception
Yusuke Sekikawa ⋅ Itsumi Araki ⋅ Jun Nagata ⋅ Andreu Girbau
|
Tucson Ballroom & Prefunction Space 56 | |
|
Patch-wise Retrieval: A Bag of Practical Techniques for Instance-level Matching
Poster Session 4 + Reception
Wonseok Choi ⋅ Sohwi Lim ⋅ Nam Hyeon-Woo ⋅ Moon Ye-Bin ⋅ Dong-ju Jeong ⋅ Jinyoung Hwang ⋅ Tae-Hyun Oh
|
Tucson Ballroom & Prefunction Space 61 | |
|
Geo3DVQA: Evaluating Vision-Language Models for 3D Geospatial Reasoning from Aerial Imagery
Poster Session 4 + Reception
Mai Tsujimoto ⋅ Junjue Wang ⋅ Weihao Xuan ⋅ Naoto Yokoya
|
Tucson Ballroom & Prefunction Space 68 | |
|
GrowTAS: Progressive Expansion from Small to Large Subnets for Efficient ViT Architecture Search
Poster Session 4 + Reception
Hyunju Lee ⋅ Youngmin Oh ⋅ Jeimin Jeon ⋅ Donghyeon Baek ⋅ Bumsub Ham
|
Tucson Ballroom & Prefunction Space 73 | |
|
Curve Skeletonization in Continuous domain for Meshes and Point Clouds
Poster Session 4 + Reception
Jai Bardhan ⋅ Ramya Hebbalaguppe ⋅ Aravind Udupa
|
Tucson Ballroom & Prefunction Space 76 | |
|
ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models
Poster Session 4 + Reception
Danae Sanchez Villegas ⋅ Ingo Ziegler ⋅ Desmond Elliott
|
Tucson Ballroom & Prefunction Space 81 | |
|
ST-Think: How Multimodal Large Language Models Reason About 4D Worlds from Ego-Centric Videos
Poster Session 4 + Reception
Peiran Wu ⋅ Yunze Liu ⋅ Miao Liu ⋅ Junxiao Shen
|
Tucson Ballroom & Prefunction Space 85 | |
|
Improving Out-of-Distribution Detection Using Segmented Images and Cross-View Attention Fusion
Poster Session 4 + Reception
Alexander Politowicz ⋅ Sahisnu Mazumder ⋅ Bing Liu
|
Tucson Ballroom & Prefunction Space 94 | |
|
Revisiting Vision–Language Foundations for No-Reference Image Quality Assessment
Poster Session 4 + Reception
ANKIT YADAV ⋅ Ta Duc Huy ⋅ Lingqiao Liu
|
Tucson Ballroom & Prefunction Space 108 | |
|
Learning Subglacial Bed Topography from Sparse Radar with Physics-Guided Residuals
Poster Session 4 + Reception
Bayu Tama ⋅ Jianwu Wang ⋅ Vandana Janeja ⋅ Mostafa Cham
|
Tucson Ballroom & Prefunction Space 111 | |
|
DPBridge: Latent Diffusion Bridge for Dense Prediction
Poster Session 4 + Reception
Haorui Ji ⋅ Tao Jun Lin ⋅ Hongdong Li
|
Tucson Ballroom & Prefunction Space 118 | |
|
CRISP: Cylindrical Rendering for In-Stream Point Clouds
Poster Session 4 + Reception
Hyungwoo Kang ⋅ Seonyoung Jang ⋅ YeoJun Yoon ⋅ Byungtae Oh
|
Tucson Ballroom & Prefunction Space 121 | |
|
KFS-Bench: Comprehensive Evaluation of Key Frame Sampling in Long Video Understanding
Poster Session 4 + Reception
Zongyao Li ⋅ Kengo Ishida ⋅ Satoshi Yamazaki ⋅ XIAOTONG JI ⋅ Jianquan Liu
|
Tucson Ballroom & Prefunction Space 130 | |
|
Style-Friendly SNR Sampler for Style-Driven Generation
Poster Session 4 + Reception
Jooyoung Choi ⋅ Chaehun Shin ⋅ Yeongtak Oh ⋅ Heeseung Kim ⋅ Jungbeom Lee ⋅ Sungroh Yoon
|
Tucson Ballroom & Prefunction Space 136 | |
|
ControlVP: Interactive Geometric Refinement of AI-Generated Images with Consistent Vanishing Points
Poster Session 4 + Reception
Ryota Okumura ⋅ Kaede Shiohara ⋅ Toshihiko Yamasaki
|
Tucson Ballroom & Prefunction Space 140 | |
|
Towards Egocentric 3D Hand Pose Estimation in Unseen Domains
Poster Session 4 + Reception
Wiktor Mucha ⋅ Michael Wray ⋅ Martin Kampel
|
Tucson Ballroom & Prefunction Space 143 | |
|
Motion-Aware Graph Fusion NetWork for 3D Human Pose Estimation
Poster Session 5
Yen Pham ⋅ Xiaohui Yuan ⋅ Chengyuan Zhuang
|
Tucson Ballroom & Prefunction Space 1 | |
|
SynchroRaMa : Lip-Synchronized and Emotion-Aware Talking Face Generation via Multi-Modal Emotion Embedding
Poster Session 4 + Reception
Phyo Thet Yee ⋅ Dimitrios Kollias ⋅ Sudeepta Mishra ⋅ Abhinav Dhall
|
Tucson Ballroom & Prefunction Space 25 | |
|
Conjuring Positive Pairs for Efficient Unification of Representation Learning and Image Synthesis
Poster Session 1
Imanol Estepa ⋅ Jesús Rodríguez-de-Vera ⋅ Ignacio Sarasua ⋅ Bhalaji Nagarajan ⋅ Petia Radeva
|
Tucson Ballroom & Prefunction Space 72 | |
|
IMKD: Intensity-Aware Multi-Level Knowledge Distillation for Camera-Radar Fusion
Poster Session 5
Shashank Mishra ⋅ Karan Patil ⋅ Didier Stricker ⋅ Jason Rambach
|
Tucson Ballroom & Prefunction Space 22 | |
|
MUSE: Model-based Uncertainty-aware Similarity Estimation for zero-shot 2D Object Detection and Segmentation
Poster Session 5
Sungmin Cho ⋅ Sungbum Park ⋅ Insoo Oh
|
Tucson Ballroom & Prefunction Space 28 | |
|
TM-Adapter: Temporal Merge Adapter for Efficient Global Temporal Modeling
Poster Session 5
WooJoo Hahm ⋅ Seungwoo Jang ⋅ Hyeon Kim ⋅ Daeun Lee ⋅ Kwangsu Kim
|
Tucson Ballroom & Prefunction Space 31 | |
|
SceneProp: Combining Neural Network and Markov Random Field for Scene-Graph Grounding
Poster Session 5
Keita Otani ⋅ Tatsuya Harada
|
Tucson Ballroom & Prefunction Space 34 | |
|
Reinforcement Learning-based Adaptive Control of Classifier-Free Guidance and Timestep Embeddings in Diffusion Models
Poster Session 1
Haochen You ⋅ Baojing Liu ⋅ Hongyang He
|
Tucson Ballroom & Prefunction Space 5 | |
|
Zero‑Shot Domain Generalisation via Prompt-Driven Feature Refinement
Poster Session 5
Tingrui Qiao ⋅ Di Zhao ⋅ Caroline Walker ⋅ Chris Cunningham ⋅ Yun Sing Koh
|
Tucson Ballroom & Prefunction Space 37 | |
|
GFT-GCN: Privacy-Preserving 3D Face Mesh Recognition with Spectral Diffusion
Poster Session 5
Hichem Felouat ⋅ Hanrui Wang ⋅ Isao Echizen
|
Tucson Ballroom & Prefunction Space 42 | |
|
Video and Language Alignment in 2D Systems for 3D Multi-object Scenes with Multi-Information Derivative-Free Control
Poster Session 5
Jason Armitage ⋅ Rico Sennrich
|
Tucson Ballroom & Prefunction Space 45 | |
|
ViSTA: Visual Storytelling using Multi-modal Adapters for Text-to-Image Diffusion Models
Poster Session 1
Sibo Dong ⋅ Ismail Shaheen ⋅ Maggie Shen ⋅ Rupayan Mallick ⋅ Sarah Bargal
|
Tucson Ballroom & Prefunction Space 2 | |
|
PSDiffusion: Harmonized Multi-Layer Image Generation via Layout and Appearance Alignment
Poster Session 3
Dingbang Huang ⋅ Wenbo Li ⋅ Yifei Zhao ⋅ Xinyu Pan ⋅ Yanhong Zeng ⋅ Bo Dai
|
Tucson Ballroom & Prefunction Space 30 | |
|
Mean-Shift Distillation for Diffusion Mode Seeking
Poster Session 5
Vikas Thamizharasan ⋅ Nikitas Chatzis ⋅ Iliyan Georgiev ⋅ Matthew Fisher ⋅ Evangelos Kalogerakis ⋅ Difan Liu ⋅ Nanxuan Zhao ⋅ Michal Lukáč
|
Tucson Ballroom & Prefunction Space 71 | |
|
FAIR-SIGHT: Fairness Assurance in Image Recognition via Simultaneous Conformal Thresholding and Dynamic Output Repair
Poster Session 5
Arya Fayyazi ⋅ Mehdi Kamal ⋅ Massoud Pedram
|
Tucson Ballroom & Prefunction Space 79 | |
|
Guiding What Not to Generate: Automated Negative Prompting for Text-Image Alignment
Poster Session 5
Sangha Park ⋅ Eunji Kim ⋅ Yeongtak Oh ⋅ Jooyoung Choi ⋅ Sungroh Yoon
|
Tucson Ballroom & Prefunction Space 82 | |
|
Correcting and Quantifying Systematic Errors in 3D Box Annotations for Autonomous Driving
Poster Session 5
Alexandre Justo Miro ⋅ Ludvig af Klinteberg ⋅ Bogdan Timus ⋅ Aron Asefaw ⋅ Ajinkya Khoche ⋅ Thomas Gustafsson ⋅ Sina Mansouri ⋅ Masoud DANESHTALAB
|
Tucson Ballroom & Prefunction Space 88 | |
|
S2O: Static to Openable Enhancement for Articulated 3D Objects
Poster Session 5
Hanxiao Jiang ⋅ Hanxiao Jiang ⋅ Yiming Zhang ⋅ Manolis Savva ⋅ Angel Chang
|
Tucson Ballroom & Prefunction Space 94 | |
|
PoseAdapt: Sustainable Human Pose Estimation via Continual Learning Benchmarks and Toolkit
Poster Session 5
Muhammad Saif Ullah Khan ⋅ Didier Stricker
|
Tucson Ballroom & Prefunction Space 99 | |
|
Multimodal Adversarial Defense for Vision-Language Models by Leveraging One-To-Many Relationships
Poster Session 5
Futa Waseda ⋅ Antonio Tejero-de-Pablos ⋅ Isao Echizen
|
Tucson Ballroom & Prefunction Space 111 | |
|
ForestSplats: Deformable transient field for Gaussian Splatting in the Wild
Poster Session 5
Wongi Park ⋅ Myeongseok Nam ⋅ Siwon Kim ⋅ Sangwoo Jo ⋅ Soomok Lee
|
Tucson Ballroom & Prefunction Space 112 | |
|
SGD-Mix: Enhancing Domain-Specific Image Classification with Label-Preserving Data Augmentation
Poster Session 5
Yixuan Dong ⋅ Fang-Yi Su ⋅ Jung-Hsien Chiang
|
Tucson Ballroom & Prefunction Space 119 | |
|
PALMS+: Modular Image-Based Floor Plan Localization Leveraging Depth Foundation Model
Poster Session 5
Yunqian Cheng ⋅ Benjamin Princen ⋅ Roberto Manduchi
|
Tucson Ballroom & Prefunction Space 122 | |
|
Knowledge to Sight: Reasoning over Visual Attributes via Knowledge Decomposition for Abnormality Grounding
Poster Session 2 + Refreshments
Jun Li ⋅ Che Liu ⋅ Wenjia Bai ⋅ Mingxuan Liu ⋅ Rossella Arcucci ⋅ Cosmin Bercea ⋅ Julia Schnabel
|
Tucson Ballroom & Prefunction Space 90 | |
|
AuViRe: Audio-visual Speech Representation Reconstruction for Deepfake Temporal Localization
Poster Session 5
Christos Koutlis ⋅ Symeon Papadopoulos
|
Tucson Ballroom & Prefunction Space 130 | |
|
T2VWorldBench: A Benchmark for Evaluating World Knowledge in Text-to-Video Generation
Poster Session 5
Yubin Chen ⋅ Xuyang Guo ⋅ Zhenmei Shi ⋅ Zhao Song ⋅ Jiahao Zhang
|
Tucson Ballroom & Prefunction Space 65 | |
|
IPTQ-ViT: Post-Training Quantization of Non-linear Functions for Integer-only Vision Transformers
Poster Session 6 + Refreshments
Gihwan Kim ⋅ Jemin Lee ⋅ Hyungshin Kim
|
Tucson Ballroom & Prefunction Space 16 | |
|
Locally Explaining Prediction Behavior via Gradual Interventions and Measuring Property Gradients
Poster Session 6 + Refreshments
Niklas Penzel ⋅ Joachim Denzler
|
Tucson Ballroom & Prefunction Space 19 | |
|
DM3Net: Dual-Camera Super-Resolution via Domain Modulation and Multi-scale Matching
Poster Session 6 + Refreshments
CONG GUAN ⋅ Jiacheng Ying ⋅ Osamu Yoshie ⋅ Yuya Ieiri
|
Tucson Ballroom & Prefunction Space 26 | |
|
3D Cell Oversegmentation Correction via Geo-Wasserstein Divergence
Poster Session 6 + Refreshments
Peter Chen ⋅ Bryan Chang ⋅ Olivia Creasey ⋅ Julie Sneddon ⋅ Zev Gartner ⋅ Yining Liu
|
Tucson Ballroom & Prefunction Space 32 | |
|
brat: Aligned Multi-View Embeddings for Brain MRI Analysis
Poster Session 5
Maxime Kayser ⋅ Maksim Gridnev ⋅ Wanting Wang ⋅ Max Bain ⋅ Aneesh Rangnekar ⋅ Avijit Chatterjee ⋅ Aleksandr Petrov ⋅ Harini Veeraraghavan ⋅ Nathaniel Swinburne
|
Tucson Ballroom & Prefunction Space 7 | |
|
MMHOI: Modeling Complex 3D Multi-Human Multi-Object Interactions
Poster Session 2 + Refreshments
Kaen Kazawa (Kogashi) ⋅ Anoop Cherian ⋅ Meng-Yu Jennifer Kuo
|
Tucson Ballroom & Prefunction Space 10 | |
|
Guided Model Merging for Hybrid Data Learning: Leveraging Centralized Data to Refine Decentralized Models
Poster Session 3
Junyi Zhu ⋅ Ruicong Yao ⋅ Taha Ceritli ⋅ Savas Ozkan ⋅ Matthew Blaschko ⋅ Eunchung Noh ⋅ Jeongwon Min ⋅ Cho Min ⋅ Mete Ozay
|
Tucson Ballroom & Prefunction Space 25 | |
|
MedPEFT-CL: Dual-Phase Parameter-Efficient Continual Learning with Medical Semantic Adapter and Bidirectional Memory Consolidation
Poster Session 6 + Refreshments
ZIYUAN GAO ⋅ Philippe Morel
|
Tucson Ballroom & Prefunction Space 48 | |
|
Test-Time Consistency in Vision Language Models
Poster Session 6 + Refreshments
Shih-Han Chou ⋅ Shivam Chandhok ⋅ James Little ⋅ Leonid Sigal
|
Tucson Ballroom & Prefunction Space 56 | |
|
DualRes: Production-ready Dynamic Object Detection
Poster Session 6 + Refreshments
Jibril hassani ⋅ Thomas Verelst
|
Tucson Ballroom & Prefunction Space 61 | |
|
FastPose-ViT: A Vision Transformer for Real-Time Spacecraft Pose Estimation
Poster Session 6 + Refreshments
Pierre Ancey ⋅ Andrew Price ⋅ Saqib Javed ⋅ Mathieu Salzmann
|
Tucson Ballroom & Prefunction Space 64 | |
|
SAVE: Sparse Autoencoder‑Driven Visual Information Enhancement for Mitigating Object Hallucination
Poster Session 6 + Refreshments
Sangha Park ⋅ Seungryong Yoo ⋅ Jisoo Mok ⋅ Sungroh Yoon
|
Tucson Ballroom & Prefunction Space 70 | |
|
Generalizing Sports Feedback Generation by Watching Competitions and Reading Books: A Rock Climbing Case Study
Poster Session 6 + Refreshments
Arushi Rai ⋅ Adriana Kovashka
|
Tucson Ballroom & Prefunction Space 89 | |
|
TriaGS: Differentiable Triangulation-Guided Geometric Consistency for 3D Gaussian Splatting
Poster Session 6 + Refreshments
Quan Hong ⋅ Tuan Dang
|
Tucson Ballroom & Prefunction Space 113 | |
|
Any Detector Can Detect Anything
Poster Session 6 + Refreshments
Thomas Huang ⋅ Siyuan Li ⋅ Martin Danelljan ⋅ Henghui Ding ⋅ Luc Van Gool ⋅ Fisher Yu
|
Tucson Ballroom & Prefunction Space 117 | |
|
SafeguardGS: 3D Gaussian Primitive Pruning While Avoiding Catastrophic Scene Destruction
Poster Session 6 + Refreshments
Yongjae Lee ⋅ Zhaoliang Zhang ⋅ Deliang Fan
|
Tucson Ballroom & Prefunction Space 121 | |
|
Scalpel: Fine-Grained Alignment of Attention Activation Manifolds via Mixture Gaussian Bridges to Mitigate Multimodal Hallucination
Poster Session 3
Ziqiang Shi ⋅ Rujie Liu ⋅ Shanshan Yu ⋅ Satoshi Munakata ⋅ Koichi Shirahata
|
Tucson Ballroom & Prefunction Space 5 | |
|
UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models
Poster Session 5
Lan Chen ⋅ Yuchao Gu ⋅ Qi Mao
|
Tucson Ballroom & Prefunction Space 91 | |
|
ENCORE : A Neural Collapse Perspective on Out-of-Distribution Detection in Deep Neural Networks
Poster Session 3
A. Q. M. Sazzad Sayyed ⋅ Nathaniel Bastian ⋅ Francesco Restuccia
|
Tucson Ballroom & Prefunction Space 3 | |
|
FlyPose: Towards Robust Human Pose Estimation From Aerial Views
Poster Session 6 + Refreshments
Hassaan Farooq ⋅ Marvin Brenner ⋅ Peter Stütz
|
Tucson Ballroom & Prefunction Space 134 | |
|
SVS-GAN for Semantic Synthesis of Traffic Videos for Autonomous Driving
Poster Session 6 + Refreshments
Khaled Seyam ⋅ Julian Wiederer ⋅ Markus Braun ⋅ Bin Yang
|
Tucson Ballroom & Prefunction Space 137 | |
|
FairScene: Learning Class-Disentangled 2D/3D Representations for Semantic Scene Completion
Poster Session 3
Dian Jia ⋅ Pei Yu ⋅ Wei Tang
|
Tucson Ballroom & Prefunction Space 81 | |
|
Towards Fine-Grained Adaptation of CLIP via a Self-Trained Alignment Score
Poster Session 5
Eman Ali ⋅ Sathira Silva ⋅ Chetan Arora ⋅ Muhammad Haris Khan
|
Tucson Ballroom & Prefunction Space 8 | |
|
Rethinking Latent Variable in Learned Image Compression
Poster Session 6 + Refreshments
Fangzhou Yi ⋅ Zhicheng Gong ⋅ Hui Zeng
|
Tucson Ballroom & Prefunction Space 126 | |
|
One-Cycle Structured Pruning via Stability-Driven Subnetwork Search
Poster Session 4 + Reception
Deepak Ghimire ⋅ Dayoung Kil ⋅ Sunghwan Jeong ⋅ Jaesik Park ⋅ Seong-heum Kim
|
Tucson Ballroom & Prefunction Space 113 | |
|
Frequency Is What You Need: Considering Word Frequency When Text Masking Benefits Vision-Language Model Pre-training
Poster Session 3
Mingliang Liang ⋅ Martha Larson
|
Tucson Ballroom & Prefunction Space 82 | |
|
SSMRadNet : A Sample-wise State-Space Framework for Efficient and Ultra-Light Radar Segmentation and Object Detection
Poster Session 4 + Reception
Anuvab Sen ⋅ Mir Sayeed Mohammad ⋅ Saibal Mukhopadhyay
|
Tucson Ballroom & Prefunction Space 8 | |
|
HOLO: Holistic Lightweight Optimization for Scene Understanding with Auto-Annotation and Multimodal Learning
Poster Session 6 + Refreshments
Xiaoyun Hu ⋅ Xiaohan Yan ⋅ Nan Wang ⋅ Gang Wei ⋅ Zhicheng Wang
|
Tucson Ballroom & Prefunction Space 50 | |
|
AEON: Adaptive Embedding Optimized Noise for Robust Watermarking in Diffusion Models
Poster Session 4 + Reception
Muhammad Muneer ⋅ Simon Woo
|
Tucson Ballroom & Prefunction Space 107 | |
|
Memory-Augmented Representation for Efficient Event-based Visuomotor Policy Learning with Adaptive Perception and Control
Poster Session 2 + Refreshments
Uday Kamal ⋅ Saibal Mukhopadhyay
|
Tucson Ballroom & Prefunction Space 112 | |
|
Hierarchical Instance Tracking to Balance Privacy Preservation with Accessible Information
Poster Session 5
Neelima Prasad ⋅ Jarek Reynolds ⋅ Neel Karsanbhai ⋅ Tanusree Sharma ⋅ Lotus Zhang ⋅ Abigale Stangl ⋅ Yang Wang ⋅ Leah Findlater ⋅ Danna Gurari
|
Tucson Ballroom & Prefunction Space 14 | |
|
FairVLM: Enhancing Fairness and Prompt Sensitivity in Vision Language Models for Medical Image Segmentation
Poster Session 6 + Refreshments
Md Motiur Rahman ⋅ Saeka Rahman ⋅ Smriti Bhatt ⋅ Miad Faezipour
|
Tucson Ballroom & Prefunction Space 24 | |
|
A Dataset and Framework for Learning State-invariant Object Representations
Poster Session 4 + Reception
Rohan Sarkar ⋅ Avinash Kak
|
Tucson Ballroom & Prefunction Space 41 | |
|
SuperRivolution: Fine-Scale Rivers from Coarse Temporal Satellite Imagery
Poster Session 6 + Refreshments
Rangel Daroya ⋅ Subhransu Maji
|
Tucson Ballroom & Prefunction Space 27 | |
|
SegMo: Segment-aligned Text to 3D Human Motion Generation
Poster Session 5
Bowen Dang ⋅ Lin Wu ⋅ Xiaohang Yang ⋅ Zheng Yuan ⋅ Zhixiang Chen
|
Tucson Ballroom & Prefunction Space 109 | |
|
TRACE: Confounder-free Adversarial Fine-tuning for Robust Object Detection
Poster Session 5
Wonho Lee ⋅ Jisu Lee ⋅ Hyunsik Na ⋅ Sohee Park ⋅ Daeseon Choi
|
Tucson Ballroom & Prefunction Space 86 | |
|
GeneVA: A Dataset of Human Annotations for Generative Text to Video Artifacts
Poster Session 5
Jenna Kang ⋅ Maria Silva ⋅ Patsorn Sangkloy ⋅ Kenneth Chen ⋅ Niall Williams ⋅ Qi Sun
|
Tucson Ballroom & Prefunction Space 36 | |
|
UCDSC: Open Set UnCertainty aware Deep Simplex Classifier for Medical Image Datasets
Poster Session 4 + Reception
Arnav Aditya ⋅ Nitin Kumar ⋅ Saurabh Shigwan
|
Tucson Ballroom & Prefunction Space 48 | |
|
ART-ASyn: Anatomy-aware Realistic Texture-based Anomaly Synthesis Framework for Chest X-Rays
Poster Session 3
Qinyi Cao ⋅ Jianan Fan ⋅ Weidong Cai
|
Tucson Ballroom & Prefunction Space 84 | |
|
Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models
Poster Session 3
Prin Phunyaphibarn ⋅ Phillip Lee ⋅ Jaihoon Kim ⋅ Minhyuk Sung
|
Tucson Ballroom & Prefunction Space 102 | |
|
Temporal Object Captioning for Street Scene Videos from LiDAR Tracks
Poster Session 2 + Refreshments
Vignesh Gopinathan ⋅ Urs Zimmermann ⋅ Michael Arnold ⋅ Matthias Rottmann
|
Tucson Ballroom & Prefunction Space 136 | |
|
Hybrid State Representation for Video Procedure Planning
Poster Session 3
Woo Suk Choi ⋅ Youwon Jang ⋅ Minsu Lee ⋅ Byoung-Tak Zhang
|
Tucson Ballroom & Prefunction Space 135 |
Successful Page Load