Document Understanding Dataset and Evaluation (DUDE)

Van Landeghem, Jordy; Tito, Ruben; Borchmann, Łukasz; Pietruszka, Michał; Joziak, Paweł; Powalski, Rafał; Jurkiewicz, Dawid; Coustaty, Mickael; Ackaert, Bertrand; Valveny, Ernest; Blaschko, Matthew; Moens, Sien; Stanisławek, Tomasz

doi:10.1109/ICCV51070.2023.01789

Proceedings IEEE/CVF international conference on computer vision - ICCV 2023

Document Understanding Dataset and Evaluation (DUDE)

Author:

Van Landeghem, Jordy

Tito, Ruben ; Borchmann, Łukasz ; Pietruszka, Michał ; Joziak, Paweł ; Powalski, Rafał ; Jurkiewicz, Dawid ; Coustaty, Mickael ; Ackaert, Bertrand ; Valveny, Ernest ; Blaschko, Matthew ; Moens, Sien ; Stanisławek, Tomasz

Keywords:

Science & Technology, Technology, Computer Science, Artificial Intelligence, Computer Science, Theory & Methods, Imaging Science & Photographic Technology, Computer Science, PSI_4802, PSI_MBL

Abstract:

We call on the Document AI (DocAI) community to reevaluate current methodologies and embrace the challenge of creating more practically-oriented benchmarks. Document Understanding Dataset and Evaluation (DUDE) seeks to remediate the halted research progress in understanding visually-rich documents (VRDs). We present a new dataset 1 with novelties related to types of questions, answers, and document layouts based on multi-industry, multi-domain, and multi-page VRDs of various origins, and dates. Moreover, we are pushing the boundaries of current methods by creating multi-task and multi-domain evaluation setups that more accurately simulate real-world situations where powerful generalization and adaptation under low-resource settings are desired. DUDE aims to set a new standard as a more practical, long-standing benchmark for the community, and we hope that it will lead to future extensions and contributions that address real-world challenges. Finally, our work illustrates the importance of finding more efficient ways to model language, images, and layout in DocAI.

Proceedings IEEE/CVF international conference on computer vision - ICCV 2023 Document Understanding Dataset and Evaluation (DUDE)

Author:

Keywords:

Abstract:

Proceedings IEEE/CVF international conference on computer vision - ICCV 2023

Document Understanding Dataset and Evaluation (DUDE)