改良 发表于 2025-3-30 08:36:48
,Kinematic 3D Object Detection in Monocular Video,or detection, tracking, and depth perception, such features have not been thoroughly utilized in modern 3D object detectors. In this work, we propose a novel method for monocular video-based 3D object detection which leverages kinematic motion to extract scene dynamics and improve localization accur显而易见 发表于 2025-3-30 15:48:29
Describing Unseen Videos via Multi-modal Cooperative Dialog Agents,AI with implicit information sources. To this end, in this paper, we introduce a new task called video description via two multi-modal cooperative dialog agents, whose ultimate goal is for one conversational agent to describe an unseen video based on the dialog and two static frames. Specifically, oGEN 发表于 2025-3-30 19:20:30
http://reply.papertrans.cn/24/2343/234230/234230_53.png宇宙你 发表于 2025-3-30 21:44:39
http://reply.papertrans.cn/24/2343/234230/234230_54.pngsenile-dementia 发表于 2025-3-31 01:00:19
Know Your Surroundings: Exploiting Scene Information for Object Tracking,one to fail in case of e.g. fast appearance changes or presence of distractor objects, where a target appearance model alone is insufficient for robust tracking. Having the knowledge about the presence and locations of other objects in the surrounding scene can be highly beneficial in such cases. Th暂时休息 发表于 2025-3-31 05:15:51
Practical Detection of Trojan Neural Networks: Data-Limited and Data-Free Cases, the Trojan attack (or poisoning backdoor attack). The lack of robustness of DNNs against Trojan attacks could significantly harm real-life machine learning (ML) systems in downstream applications, therefore posing widespread concern to their trustworthiness. In this paper, we study the problem of t侵略 发表于 2025-3-31 09:42:49
Anatomy-Aware Siamese Network: Exploiting Semantic Asymmetry for Accurate Pelvic Fracture Detectionfrom medical images. So far, inadequate research attention has been received on effectively emulating this practice in computer-aided diagnosis (CAD) methods. In this work, we exploit semantic anatomical symmetry or asymmetry analysis in a complex CAD scenario, i.e., anterior pelvic fracture detecti调整校对 发表于 2025-3-31 16:36:38
DeepLandscape: Adversarial Modeling of Landscape Videos, extends StyleGAN model by augmenting it with parts that allow to model dynamic changes in a scene. Once trained, our model can be used to generate realistic time-lapse landscape videos with moving objects and time-of-the-day changes. Furthermore, by fitting the learned models to a static landscape施舍 发表于 2025-3-31 21:13:45
GANwriting: Content-Conditioned Generation of Styled Handwritten Word Images,dwritten words. On the contrary, when writing by hand, a great variability is observed across different writers, and even when analyzing words scribbled by the same individual, involuntary variations are conspicuous. In this work, we take a step closer to producing realistic and varied artificiallylaparoscopy 发表于 2025-3-31 22:23:08
http://reply.papertrans.cn/24/2343/234230/234230_60.png