找回密码
 To register

QQ登录

只需一步,快速开始

扫一扫,访问微社区

Titlebook: Computer Vision – ECCV 2024; 18th European Confer Aleš Leonardis,Elisa Ricci,Gül Varol Conference proceedings 2025 The Editor(s) (if applic

[复制链接]
楼主: 拿着锡
发表于 2025-3-30 12:01:13 | 显示全部楼层
发表于 2025-3-30 12:29:11 | 显示全部楼层
,Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation, state-of-the-art performance with the efficient computation compared to the existing transformer-based semantic segmentation models in three public benchmarks, including ADE20K, Cityscapes and COCO-Stuff. Furthermore, our ISR method reduces the computational cost by up to 61% with minimal mIoU perf
发表于 2025-3-30 19:10:38 | 显示全部楼层
,VeCLIP: Improving CLIP Training via Visual-Enriched Captions,ive pipeline, we effortlessly scale our dataset up to 300 million samples named VeCap dataset. Our results show significant advantages in image-text alignment and overall model performance. For example, VeCLIP achieves up to . gain in COCO and Flickr30k retrieval tasks under the 12M setting. For dat
发表于 2025-3-30 23:07:33 | 显示全部楼层
发表于 2025-3-31 03:42:08 | 显示全部楼层
,Learning Representations from Foundation Models for Domain Generalized Stereo Matching,opose a cosine-constrained concatenation cost (C4) space to construct cost volumes. We integrate FormerStereo with state-of-the-art (SOTA) stereo matching networks and evaluate its effectiveness on multiple benchmark datasets. Experiments show that the FormerStereo framework effectively improves the
发表于 2025-3-31 08:13:04 | 显示全部楼层
,Spike-Temporal Latent Representation for Energy-Efficient Event-to-Video Reconstruction,esholding Algorithm. Then, the U-shape SNN decoder reconstructs the video based on the encoded spikes. Experimental results demonstrate that the STLR achieves performance comparable to popular SNNs on IJRR, HQF, and MVSEC datasets while significantly enhancing energy efficiency.
发表于 2025-3-31 10:44:04 | 显示全部楼层
发表于 2025-3-31 17:02:54 | 显示全部楼层
,Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts,rmore, we design a scheme utilizing Hash-Atlas to represent 3D scene views, which transfers the editing of 3D scenes onto 2D atlas images. This design achieves complete decoupling between the 2D editing and 3D reconstruction processes, enabling . to flexibly integrate a wide range of existing 2D or
发表于 2025-3-31 21:12:09 | 显示全部楼层
发表于 2025-4-1 01:15:28 | 显示全部楼层
,Look Hear: Gaze Prediction for Speech-Directed Human Attention,rs, from 220 participants performing our referral task. In our quantitative and qualitative analyses, ART not only outperforms existing methods in scanpath prediction, but also appears to capture several human attention patterns, such as waiting, scanning, and verification. Code and dataset are avai
 关于派博传思  派博传思旗下网站  友情链接
派博传思介绍 公司地理位置 论文服务流程 影响因子官网 SITEMAP 大讲堂 北京大学 Oxford Uni. Harvard Uni.
发展历史沿革 期刊点评 投稿经验总结 SCIENCEGARD IMPACTFACTOR 派博系数 清华大学 Yale Uni. Stanford Uni.
|Archiver|手机版|小黑屋| 派博传思国际 ( 京公网安备110108008328) GMT+8, 2025-6-27 17:16
Copyright © 2001-2015 派博传思   京公网安备110108008328 版权所有 All rights reserved
快速回复 返回顶部 返回列表