铁塔等 发表于 2025-3-23 12:50:50
,Completely Self-supervised Crowd Counting via Distribution Matching,ould learn good representations, they require some labeled data to map these features to the end task of density estimation. We mitigate this issue with the proposed paradigm of complete self-supervision, which does not need even a single labeled image. The only input required to train, apart from aRejuvenate 发表于 2025-3-23 17:11:41
http://reply.papertrans.cn/24/2343/234247/234247_12.png伪造者 发表于 2025-3-23 18:12:24
http://reply.papertrans.cn/24/2343/234247/234247_13.pngsurmount 发表于 2025-3-23 22:44:03
,Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition,pace of possible augmented data points either at random, without knowing which augmented points will be better, or through heuristics. We propose to learn what makes a “good” video for action recognition and select only high-quality samples for augmentation. In particular, we choose video compositinoverture 发表于 2025-3-24 04:27:32
http://reply.papertrans.cn/24/2343/234247/234247_15.png占线 发表于 2025-3-24 07:26:40
http://reply.papertrans.cn/24/2343/234247/234247_16.pngingestion 发表于 2025-3-24 12:41:13
,Improving Self-supervised Lightweight Model Learning via Hard-Aware Metric Distillation,ghtweight models, which is important for many mobile devices. To address this problem, we propose a method to improve the lightweight network (as student) via distilling the metric knowledge in a larger SSL model (as teacher). We exploit the relation between teacher and student to mine the positiveObligatory 发表于 2025-3-24 18:55:45
http://reply.papertrans.cn/24/2343/234247/234247_18.pngBother 发表于 2025-3-24 20:53:56
http://reply.papertrans.cn/24/2343/234247/234247_19.pngVentilator 发表于 2025-3-24 23:56:22
0302-9743 ruction; stereo vision; computational photography; neural networks; image coding; image reconstruction; object recognition; motion estimation..978-3-031-19820-5978-3-031-19821-2Series ISSN 0302-9743 Series E-ISSN 1611-3349