Control-Group 发表于 2025-3-28 15:09:44

http://reply.papertrans.cn/25/2424/242357/242357_41.png

acrobat 发表于 2025-3-28 22:03:28

0302-9743 ce on Computer Vision, ECCV 2024, held in Milan, Italy, during September 29–October 4, 2024...The 2387 papers presented in these proceedings were carefully reviewed and selected from a total of 8585 submissions. They deal with topics such as computer vision; machine learning; deep neural networks; r

Intruder 发表于 2025-3-29 00:16:55

,Depth-Guided NeRF Training via Earth Mover’s Distance,e enough information to disambiguate between different possible geometries yielding the same image. Previous work has thus incorporated depth supervision during NeRF training, leveraging dense predictions from pre-trained depth networks as pseudo-ground truth. While these depth priors are assumed to

Trypsin 发表于 2025-3-29 06:55:43

http://reply.papertrans.cn/25/2424/242357/242357_44.png

寒冷 发表于 2025-3-29 08:43:18

http://reply.papertrans.cn/25/2424/242357/242357_45.png

EXTOL 发表于 2025-3-29 12:01:08

http://reply.papertrans.cn/25/2424/242357/242357_46.png

组装 发表于 2025-3-29 18:31:49

,Diagnosing and Re-learning for Balanced Multimodal Learning,he training of uni-modal encoders from different perspectives, taking the inter-modal performance discrepancy as the basis. However, the intrinsic limitation of modality capacity is ignored. The scarcely informative modalities can be recognized as “worse-learnt” ones, which could force the model to

altruism 发表于 2025-3-29 22:16:25

http://reply.papertrans.cn/25/2424/242357/242357_48.png

富饶 发表于 2025-3-30 01:10:48

,Elucidating the Hierarchical Nature of Behavior with Masked Autoencoders,ehavioral benchmarks, we create a novel synthetic basketball playing benchmark (Shot7M2). Beyond synthetic data, we extend BABEL into a hierarchical action segmentation benchmark (hBABEL). Then, we develop a masked autoencoder framework (hBehaveMAE) to elucidate the hierarchical nature of motion cap

aquatic 发表于 2025-3-30 06:10:54

BeyondScene: Higher-Resolution Human-Centric Scene Generation with Pretrained Diffusion,lenge stems from limited training image size, text encoder capacity (limited tokens), and the inherent difficulty of generating complex scenes involving multiple humans. While current methods attempted to address training size limit only, they often yielded human-centric scenes with severe artifacts
页: 1 2 3 4 [5] 6 7
查看完整版本: Titlebook: Computer Vision – ECCV 2024; 18th European Confer Aleš Leonardis,Elisa Ricci,Gül Varol Conference proceedings 2025 The Editor(s) (if applic