找回密码
 To register

QQ登录

只需一步,快速开始

扫一扫,访问微社区

Titlebook: Computer Vision – ECCV 2024; 18th European Confer Aleš Leonardis,Elisa Ricci,Gül Varol Conference proceedings 2025 The Editor(s) (if applic

[复制链接]
楼主: introspective
发表于 2025-3-23 13:33:56 | 显示全部楼层
Innere Welten und äußere Realitätene solution for explainable human visual scanpath prediction. Extensive experiments on diverse eye-tracking datasets demonstrate the effectiveness of GazeXplain in both scanpath prediction and explanation, offering valuable insights into human visual attention and cognitive processes.
发表于 2025-3-23 17:00:38 | 显示全部楼层
Vom Älterwerden des Psychoanalytikers to counterfactual scenarios. This enables LVLMs to explicitly reason step-by-step rather than relying on biased knowledge, leading to more generalizable solutions. Our extensive evaluation demonstrates that CoCT outperforms existing approaches on tasks requiring reasoning under knowledge bias. Our
发表于 2025-3-23 21:56:06 | 显示全部楼层
,Walker: Self-supervised Multiple Object Tracking by Walking on Temporal Appearance Graphs, first self-supervised tracker to achieve competitive performance on MOT17, DanceTrack, and BDD100K. Remarkably, our proposal outperforms the previous self-supervised trackers even when drastically reducing the annotation requirements by up to 400..
发表于 2025-3-24 00:18:55 | 显示全部楼层
,Spatio-Temporal Proximity-Aware Dual-Path Model for Panoramic Activity Recognition,e comprises individual-to-global and individual-to-social paths, mutually reinforcing each other’s task with global-local context through multiple layers. Through extensive experiments, we validate the effectiveness of the spatio-temporal proximity among individuals and the dual-path architecture in
发表于 2025-3-24 02:21:34 | 显示全部楼层
发表于 2025-3-24 10:06:46 | 显示全部楼层
,FSD-BEV: Foreground Self-distillation for Multi-view 3D Object Detection,ome distillation strategies. Additionally, we design two Point Cloud Intensification (PCI) strategies to compensate for the sparsity of point clouds by frame combination and pseudo point assignment. Finally, we develop a Multi-Scale Foreground Enhancement (MSFE) module to extract and fuse multi-scal
发表于 2025-3-24 12:08:22 | 显示全部楼层
,MATHVERSE: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?, addition, we propose a Chain-of-Thought (CoT) evaluation strategy for a fine-grained assessment of the output answers. Rather than naively judging true or false, we employ GPT-4(V) to adaptively assess each step with error analysis to derive a total score, which can reveal the inner CoT reasoning q
发表于 2025-3-24 16:44:09 | 显示全部楼层
See and Think: Embodied Agent in Virtual Environment,wledge question-answering pairs, and 200+ skill-code pairs. We conduct continuous block search, knowledge question and answering, and tech tree mastery to evaluate the performance. Extensive experiments show that STEVE achieves at most 1.5. faster unlocking key tech trees and 2.5. quicker in block s
发表于 2025-3-24 19:57:38 | 显示全部楼层
发表于 2025-3-24 23:25:32 | 显示全部楼层
,VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding,ure enhancements with a novel pre-training task, using language masking on a snippet of the document text fed to the visual encoder in place of the prompt, to empower the model with focusing capabilities. Consequently, VisFocus learns to allocate its attention to text patches pertinent to the provid
 关于派博传思  派博传思旗下网站  友情链接
派博传思介绍 公司地理位置 论文服务流程 影响因子官网 SITEMAP 大讲堂 北京大学 Oxford Uni. Harvard Uni.
发展历史沿革 期刊点评 投稿经验总结 SCIENCEGARD IMPACTFACTOR 派博系数 清华大学 Yale Uni. Stanford Uni.
|Archiver|手机版|小黑屋| 派博传思国际 ( 京公网安备110108008328) GMT+8, 2025-6-28 08:08
Copyright © 2001-2015 派博传思   京公网安备110108008328 版权所有 All rights reserved
快速回复 返回顶部 返回列表