Titlebook: Document Analysis and Recognition - ICDAR 2023; 17th International C Gernot A. Fink,Rajiv Jain,Richard Zanibbi Conference proceedings 2023

显示全部楼层 · 发表于 2025-3-29 00:22:43

E2TIMT: Efficient and Effective Modal Adapter for Text Image Machine Translation, both two-stage cascade and one-stage end-to-end architectures, suffer from different issues. The cascade models can benefit from the large-scale optical character recognition (OCR) and MT datasets but the two-stage architecture is redundant. The end-to-end models are efficient but suffer from trai

显示全部楼层 · 发表于 2025-3-29 03:09:06

Open-Set Text Recognition via Shape-Awareness Visual Reconstructionmpared to conventional counterparts, the OSTR task demands actively spotting and incrementally recognizing novel characters. Existing methods have demonstrated some success, yet confusion among similar characters remains to be a major challenge, potentially due to insufficient shape information pres

显示全部楼层 · 发表于 2025-3-29 10:58:21

Accelerating Transformer-Based Scene Text Detection and Recognition via Token Pruningnd all current state-of-the-art models and have achieved excellent performance. However, the computational requirements of the transformer architecture makes training these methods slow and resource heavy. In this paper, we introduce a new token pruning strategy that significantly decreases training

显示全部楼层 · 发表于 2025-3-29 15:21:34

Text Enhancement: Scene Text Recognition in Hazy Weatherer adverse weather conditions with poor visibility remains challenging. To address this problem, we propose a text image enhancement network that can be embedded into a scene text recognizer in a pluggable manner. This network comprises multiple sets of digital image processing (DIP) units, which ar

显示全部楼层 · 发表于 2025-3-29 18:40:03

Reading Between the Lanes: Text VideoQA on the Roadtion is a challenging problem, while textual cues typically appear for a short time span, and early detection at a distance is necessary. Systems that exploit such information to assist the driver should not only extract and incorporate visual and textual cues from the video stream but also reason o

		自动登录	找回密码
密码			To register

关于派博传思			派博传思旗下网站			友情链接
派博传思介绍	公司地理位置	论文服务流程	影响因子官网	吾爱论文网	大讲堂	北京大学	Oxford Uni.	Harvard Uni.
发展历史沿革	期刊点评	投稿经验总结	SCIENCEGARD	IMPACTFACTOR	派博系数	清华大学	Yale Uni.	Stanford Uni.
\|Archiver\|手机版\|小黑屋\| 派博传思国际 ( 京公网安备110108008328) GMT+8, 2026-6-19 17:30
Copyright © 2001-2015 派博传思京公网安备110108008328 版权所有 All rights reserved

Titlebook: Document Analysis and Recognition - ICDAR 2023; 17th International C Gernot A. Fink,Rajiv Jain,Richard Zanibbi Conference proceedings 2023

浏览过的版块