改良
发表于 2025-3-30 08:36:48
,Kinematic 3D Object Detection in Monocular Video,or detection, tracking, and depth perception, such features have not been thoroughly utilized in modern 3D object detectors. In this work, we propose a novel method for monocular video-based 3D object detection which leverages kinematic motion to extract scene dynamics and improve localization accur
显而易见
发表于 2025-3-30 15:48:29
Describing Unseen Videos via Multi-modal Cooperative Dialog Agents,AI with implicit information sources. To this end, in this paper, we introduce a new task called video description via two multi-modal cooperative dialog agents, whose ultimate goal is for one conversational agent to describe an unseen video based on the dialog and two static frames. Specifically, o
GEN
发表于 2025-3-30 19:20:30
http://reply.papertrans.cn/24/2343/234230/234230_53.png
宇宙你
发表于 2025-3-30 21:44:39
http://reply.papertrans.cn/24/2343/234230/234230_54.png
senile-dementia
发表于 2025-3-31 01:00:19
Know Your Surroundings: Exploiting Scene Information for Object Tracking,one to fail in case of e.g. fast appearance changes or presence of distractor objects, where a target appearance model alone is insufficient for robust tracking. Having the knowledge about the presence and locations of other objects in the surrounding scene can be highly beneficial in such cases. Th
暂时休息
发表于 2025-3-31 05:15:51
Practical Detection of Trojan Neural Networks: Data-Limited and Data-Free Cases, the Trojan attack (or poisoning backdoor attack). The lack of robustness of DNNs against Trojan attacks could significantly harm real-life machine learning (ML) systems in downstream applications, therefore posing widespread concern to their trustworthiness. In this paper, we study the problem of t
侵略
发表于 2025-3-31 09:42:49
Anatomy-Aware Siamese Network: Exploiting Semantic Asymmetry for Accurate Pelvic Fracture Detectionfrom medical images. So far, inadequate research attention has been received on effectively emulating this practice in computer-aided diagnosis (CAD) methods. In this work, we exploit semantic anatomical symmetry or asymmetry analysis in a complex CAD scenario, i.e., anterior pelvic fracture detecti
调整校对
发表于 2025-3-31 16:36:38
DeepLandscape: Adversarial Modeling of Landscape Videos, extends StyleGAN model by augmenting it with parts that allow to model dynamic changes in a scene. Once trained, our model can be used to generate realistic time-lapse landscape videos with moving objects and time-of-the-day changes. Furthermore, by fitting the learned models to a static landscape
施舍
发表于 2025-3-31 21:13:45
GANwriting: Content-Conditioned Generation of Styled Handwritten Word Images,dwritten words. On the contrary, when writing by hand, a great variability is observed across different writers, and even when analyzing words scribbled by the same individual, involuntary variations are conspicuous. In this work, we take a step closer to producing realistic and varied artificially
laparoscopy
发表于 2025-3-31 22:23:08
http://reply.papertrans.cn/24/2343/234230/234230_60.png