用户名  找回密码
 To register

QQ登录

只需一步,快速开始

扫一扫,访问微社区

Titlebook: Document Analysis and Recognition - ICDAR 2024; 18th International C Elisa H. Barney Smith,Marcus Liwicki,Liangrui Peng Conference proceedi

[复制链接]
楼主: 召唤
发表于 2025-3-28 14:35:00 | 显示全部楼层
发表于 2025-3-28 19:20:37 | 显示全部楼层
Font Impression Estimation in the Wildssions and a convolutional neural network (CNN) framework for this task. However, impressions attached to individual fonts are often missing and noisy because of the subjective characteristic of font impression annotation. To realize stable impression estimation even with such a dataset, we propose
发表于 2025-3-29 00:00:42 | 显示全部楼层
Typographic Text Generation with Off-the-Shelf Diffusion Modelted texts render them insufficient in the realm of typographic design. This paper proposes a typographic text generation system to add and modify text on typographic designs while specifying font styles, colors, and text effects. The proposed system is a novel combination of two off-the-shelf method
发表于 2025-3-29 05:59:31 | 显示全部楼层
Impression-CLIP: Contrastive Shape-Impression Embedding for Fontsression is weak and unstable because impressions are subjective. To capture such weak and unstable cross-modal correlation between font shapes and their impressions, we propose Impression-CLIP, which is a novel machine-learning model based on CLIP (Contrastive Language-Image Pre-training). By using
发表于 2025-3-29 08:17:07 | 显示全部楼层
发表于 2025-3-29 14:52:55 | 显示全部楼层
Script Identification in the Wild with FFT-Multi-grained Mix Attention Transformerfferent scripts. Specifically, scene text-based script identification is challenged by inter-language similarities, complex backgrounds, and diverse text styles. To address the above problem, we use FFT Block to map the token to the frequency domain and decompose it into multiple frequency component
发表于 2025-3-29 15:47:57 | 显示全部楼层
SAGHOG: Self-supervised Autoencoder for Generating HOG Features for Writer Retrievalg involves the application of the Segment Anything technique to extract handwriting from various datasets, ending up with about 24k documents, followed by training a vision transformer on reconstructing masked patches of the handwriting. . is then finetuned by appending NetRVLAD as an encoding layer
发表于 2025-3-29 22:09:03 | 显示全部楼层
Analysis of the Calibration of Handwriting Text Recognition Modelsable when facing new data. In this context, it is essential to correctly estimate an approximate error of the target predictions. To achieve this, the model must be well calibrated, meaning that the confidence values are sufficiently representative of the expected accuracy. Calibration is a crucial
发表于 2025-3-30 02:15:13 | 显示全部楼层
发表于 2025-3-30 07:52:25 | 显示全部楼层
 关于派博传思  派博传思旗下网站  友情链接
派博传思介绍 公司地理位置 论文服务流程 影响因子官网 SITEMAP 大讲堂 北京大学 Oxford Uni. Harvard Uni.
发展历史沿革 期刊点评 投稿经验总结 SCIENCEGARD IMPACTFACTOR 派博系数 清华大学 Yale Uni. Stanford Uni.
|Archiver|手机版|小黑屋| 派博传思国际 ( 京公网安备110108008328) GMT+8, 2025-5-25 18:14
Copyright © 2001-2015 派博传思   京公网安备110108008328 版权所有 All rights reserved
快速回复 返回顶部 返回列表