Document Understanding Dataset and Evaluation (DUDE)

Van Landeghem, Jordy; Tito, Rubèn; Borchmann, Łukasz; Pietruszka, Michał; Joziak, Pawel; Powalski, Rafal; Jurkiewicz, Dawid; Coustaty, Mickael; Anckaert, Bertrand; Valveny, Ernest; Blaschko, Matthew; Moens, Sien; Stanislawek, Tomasz

Jordy Van Landeghem, Rubèn Tito, Łukasz Borchmann, Michał Pietruszka, Pawel Joziak, Rafal Powalski, Dawid Jurkiewicz, Mickael Coustaty, Bertrand Anckaert, Ernest Valveny, Matthew Blaschko, Sien Moens, Tomasz Stanislawek; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2023, pp. 19528-19540

Abstract

We call on the Document AI (DocAI) community to re-evaluate current methodologies and embrace the challenge of creating more practically-oriented benchmarks. Document Understanding Dataset and Evaluation (DUDE) seeks to remediate the halted research progress in understanding visually-rich documents (VRDs). We present a new dataset with novelties related to types of questions, answers, and document layouts based on multi-industry, multi-domain, and multi-page VRDs of various origins and dates. Moreover, we are pushing the boundaries of current methods by creating multi-task and multi-domain evaluation setups that more accurately simulate real-world situations where powerful generalization and adaptation under low-resource settings are desired. DUDE aims to set a new standard as a more practical, long-standing benchmark for the community, and we hope that it will lead to future extensions and contributions that address real-world challenges. Finally, our work illustrates the importance of finding more efficient ways to model language, images, and layout in DocAI.

Related Material

[pdf] [supp] [arXiv]

[bibtex]

@InProceedings{Van_Landeghem_2023_ICCV, author = {Van Landeghem, Jordy and Tito, Rub\`en and Borchmann, {\L}ukasz and Pietruszka, Micha{\l} and Joziak, Pawel and Powalski, Rafal and Jurkiewicz, Dawid and Coustaty, Mickael and Anckaert, Bertrand and Valveny, Ernest and Blaschko, Matthew and Moens, Sien and Stanislawek, Tomasz}, title = {Document Understanding Dataset and Evaluation (DUDE)}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2023}, pages = {19528-19540} }