eczema 发表于 2025-3-23 12:02:09
,Overcome Modal Bias in Multi-modal Federated Learning via Balanced Modality Selection,isting client selection methods simply consider the mining of distributed uni-modal data, yet, their effectiveness may diminish in multi-modal FL (MFL) as the modality imbalance problem not only impedes the collaborative local training but also leads to a severe global modality-level bias. We empiriNIP 发表于 2025-3-23 14:49:23
,Comprehensive Attribution: Inherently Explainable Vision Model with Feature Detector,ribution method aims to enhance the understanding of model behavior by identifying the important regions in images that significantly contribute to predictions. It is achieved by cooperatively training a selector (generating an attribution map to identify important features) and a predictor (making强制性 发表于 2025-3-23 21:05:09
http://reply.papertrans.cn/25/2424/242338/242338_13.png展览 发表于 2025-3-23 23:24:14
http://reply.papertrans.cn/25/2424/242338/242338_14.pngacrophobia 发表于 2025-3-24 05:23:08
,Pre-trained Visual Dynamics Representations for Efficient Policy Learning,ilable and inhere a vast amount of prior world knowledge, the absence of action annotations and the common domain gap with downstream tasks hinder utilizing videos for RL pre-training. To address the challenge of ., we propose .re-trained .isual .ynamics .epresentations (PVDR) to bridge the domain gANN 发表于 2025-3-24 10:18:18
http://reply.papertrans.cn/25/2424/242338/242338_16.png枕垫 发表于 2025-3-24 13:29:05
http://reply.papertrans.cn/25/2424/242338/242338_17.png名义上 发表于 2025-3-24 17:53:05
,Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Models,little rationale behind detection, hindering public trust in real-world deployments. In this paper, we approach VAD with a reasoning framework. Although Large Language Models (LLMs) have shown revolutionary reasoning ability, we find that their direct use falls short of VAD. Specifically, the implicTOXIC 发表于 2025-3-24 19:39:53
http://reply.papertrans.cn/25/2424/242338/242338_19.png旅行路线 发表于 2025-3-25 00:21:14
0302-9743 ce on Computer Vision, ECCV 2024, held in Milan, Italy, during September 29–October 4, 2024...The 2387 papers presented in these proceedings were carefully reviewed and selected from a total of 8585 submissions. They deal with topics such as computer vision; machine learning; deep neural networks; r