轻触 发表于 2025-3-25 04:32:51

http://reply.papertrans.cn/29/2849/284817/284817_21.png

完成才会征服 发表于 2025-3-25 09:08:37

http://reply.papertrans.cn/29/2849/284817/284817_22.png

娴熟 发表于 2025-3-25 13:58:51

TrOCR Meets Language Models: An End-to-End Post-correction Approachances visual and linguistic information, preserving the authenticity of the original texts. Furthermore, the model is able to adapt to historical data even when the recogniser is trained solely on contemporary data, mitigating the need for a large number of annotated historical handwritten images.

Costume 发表于 2025-3-25 17:50:01

: Domain Adaptive Document Restoration with a Layer Separation Approach qualitatively and quantitatively using a new real-world dataset, ., developed for this study. Initially trained on a synthetically generated dataset, our model demonstrates strong generalization capabilities for the DIR task, offering a promising solution for handling variability in real-world data. Our code is accessible on this GitHub(.).

prosthesis 发表于 2025-3-25 20:12:16

Investigating Neural Networks and Transformer Models for Enhanced Comic Decoding (eBDtheque, DCM772, Manga109) and using different metrics (Precision, Recall, Average Precision), we conclude that pre-trained self-supervised transformer models can competently outperform state of the art approaches, which often require further fine-tuning to achieve comparable results.

即席 发表于 2025-3-26 01:09:21

http://reply.papertrans.cn/29/2849/284817/284817_26.png

不可接触 发表于 2025-3-26 04:28:17

Normalized vs Diplomatic Annotation: A Case Study of Automatic Information Extraction from Handwritte than 15 different writers) but with different annotation methods. Our findings indicate that normalized annotation is more effective for fields that can be standardized, such as dates and places of birth, whereas diplomatic annotation performs much better for fields containing names and surnames, which can not be standardized.

Itinerant 发表于 2025-3-26 09:31:51

http://reply.papertrans.cn/29/2849/284817/284817_28.png

Infinitesimal 发表于 2025-3-26 15:23:00

ion process is provided, in order to render the materials in question machine-readable, while in the second part the potential for linguistic research is highlighted, through a case-study exploring aspects of the ‘Greek language question’, as discussed in the parliamentary context, within the wider framework of language policy making.

碳水化合物 发表于 2025-3-26 17:50:37

Peter C. Maloney,E. R. Kashket,T. H. Wilsonon including character’s appearance, posture, mood, dialogues etc. We believe that such enriched content description can be easily used to produce audiobook and eBook with various voices for characters, captions and playing sound effects.
页: 1 2 [3] 4 5 6
查看完整版本: Titlebook: Document Analysis and Recognition – ICDAR 2024 Workshops; Athens, Greece, Augu Harold Mouchère,Anna Zhu Conference proceedings 2024 The Edi