VisionDocs: 3rd Workshop on Computer Vision Systems for Document Analysis and Recognition
Abstract
The rapid digitization of textual and visual information has made automated document analysis increasinglycritical across industrial, scientific, and cultural domains. Despite advances in computer vision, research largely focuses on limited document types and tasks, leaving challenges such as heterogeneous formats,low-resource languages, non-standard layouts, and historical documents largely unaddressed. This workshop aims to advance document understanding by exploring cutting-edge approaches, including generativemodeling, self-supervised learning, multimodal fusion, and few-shot adaptation. By fostering collaborationbetween computer vision researchers and domain experts, it seeks to promote solutions for generalizationacross diverse document types and low-data scenarios. Currently underrepresented at major computer visionvenues, document analysis will benefit from a dedicated forum connecting the WACV and ICDAR communities. VisionDocs will showcase state-of-the-art methods, stimulate cross-disciplinary exchange, and define new research directions, advancing both scientific understanding and practical AI applications in documentanalysis.