default search action
18th ECCV 2024: Milan, Italy - Part IV
- Ales Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part IV. Lecture Notes in Computer Science 15062, Springer 2025, ISBN 978-3-031-73234-8 - Jiaxiang Tang, Zhaoxi Chen, Xiaokang Chen, Tengfei Wang, Gang Zeng, Ziwei Liu:
LGM: Large Multi-view Gaussian Model for High-Resolution 3D Content Creation. 1-18 - Qi Zhang, Kaiyi Zhang, Antoni B. Chan, Hui Huang:
Mahalanobis Distance-Based Multi-view Optimal Transport for Multi-view Crowd Localization. 19-36 - Ziteng Cui, Tatsuya Harada:
RAW-Adapter: Adapting Pre-trained Visual Model to Camera RAW Images. 37-56 - Kashyap Chitta, Daniel Dauner, Andreas Geiger:
SLEDGE: Synthesizing Driving Environments with Generative Models and Rule-Based Traffic. 57-74 - Adriano C. D'Alessandro, Ali Mahdavi-Amiri, Ghassan Hamarneh:
AFreeCA: Annotation-Free Counting for All. 75-91 - Junhao Dong, Piotr Koniusz, Junxi Chen, Yew-Soon Ong:
Adversarially Robust Distillation by Reducing the Student-Teacher Variance Gap. 92-111 - Yushi Lan, Fangzhou Hong, Shuai Yang, Shangchen Zhou, Xuyi Meng, Bo Dai, Xingang Pan, Chen Change Loy:
LN3Diff: Scalable Latent Neural Fields Diffusion for Speedy 3D Generation. 112-130 - Bohan Li, Jiajun Deng, Wenyao Zhang, Zhujin Liang, Dalong Du, Xin Jin, Wenjun Zeng:
Hierarchical Temporal Context Learning for Camera-Based Semantic Scene Completion. 131-148 - Xueyang Kang, Zhaoliang Luan, Kourosh Khoshelham, Bing Wang:
Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration. 149-167 - Chenxin Li, Xinyu Liu, Cheng Wang, Yifan Liu, Weihao Yu, Jing Shao, Yixuan Yuan:
GTP-4o: Modality-Prompted Heterogeneous Graph Learning for Omni-Modal Biomedical Representation. 168-187 - Fernando Julio Cendra, Bingchen Zhao, Kai Han:
PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery. 188-205 - Rawal Khirodkar, Timur M. Bagautdinov, Julieta Martinez, Su Zhaoen, Austin James, Peter Selednik, Stuart Anderson, Shunsuke Saito:
Sapiens: Foundation for Human Vision Models. 206-228 - Sehyung Lee, Mijung Kim, Yeongnam Chae, Björn Stenger:
Linearly Controllable GAN: Unsupervised Feature Categorization and Decomposition for Image Generation and Manipulation. 229-245 - Hongwei Yi, Justus Thies, Michael J. Black, Xue Bin Peng, Davis Rempe:
Generating Human Interaction Motions in Scenes with Text Control. 246-263 - Artur Jesslen, Guofeng Zhang, Angtian Wang, Wufei Ma, Alan L. Yuille, Adam Kortylewski:
NOVUM: Neural Object Volumes for Robust Object Classification. 264-281 - Kun Yang, Dingkang Yang, Ke Li, Dongling Xiao, Zedian Shao, Peng Sun, Liang Song:
Align Before Collaborate: Mitigating Feature Misalignment for Robust Multi-agent Perception. 282-299 - Xintao Lv, Liang Xu, Yichao Yan, Xin Jin, Congsheng Xu, Shuwen Wu, Yifan Liu, Lincheng Li, Mengxiao Bi, Wenjun Zeng, Xiaokang Yang:
HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects. 300-318 - Canyu Zhang, Xiaoguang Li, Qing Guo, Song Wang:
SAIR: Learning Semantic-Aware Implicit Representation. 319-335 - Yixin Yang, Jiangxin Dong, Jinhui Tang, Jinshan Pan:
ColorMNet: A Memory-Based Deep Spatial-Temporal Feature Propagation Network for Video Colorization. 336-352 - Mert Bülent Sariyildiz, Philippe Weinzaepfel, Thomas Lucas, Diane Larlus, Yannis Kalantidis:
UNIC: Universal Classification Models via Multi-teacher Distillation. 353-371 - Arpit Garg, Cuong Nguyen, Rafael Felix, Thanh-Toan Do, Gustavo Carneiro:
Instance-Dependent Noisy-Label Learning with Graphical Model Based Noise-Rate Estimation. 372-389 - Lang Nie, Chunyu Lin, Kang Liao, Yun Zhang, Shuaicheng Liu, Rui Ai, Yao Zhao:
Eliminating Warping Shakes for Unsupervised Online Video Stitching. 390-407 - Haoran Wei, Lingyu Kong, Jinyue Chen, Liang Zhao, Zheng Ge, Jinrong Yang, Jianjian Sun, Chunrui Han, Xiangyu Zhang:
Vary: Scaling up the Vision Vocabulary for Large Vision-Language Model. 408-424 - En Yu, Liang Zhao, Yana Wei, Jinrong Yang, Dongming Wu, Lingyu Kong, Haoran Wei, Tiancai Wang, Zheng Ge, Xiangyu Zhang, Wenbing Tao:
Merlin: Empowering Multimodal LLMs with Foresight Minds. 425-443 - Jefferson Hernandez, Ruben Villegas, Vicente Ordonez:
ViC-MAE: Self-supervised Representation Learning from Images and Video with Contrastive Masked Autoencoders. 444-463 - Robin Courant, Nicolas Dufour, Xi Wang, Marc Christie, Vicky Kalogeiton:
E.T. the Exceptional Trajectories: Text-to-Camera-Trajectory Generation with Character Awareness. 464-480 - Ming Hu, Peng Xia, Lin Wang, Siyuan Yan, Feilong Tang, Zhongxing Xu, Yimin Luo, Kaimin Song, Jürgen Leitner, Xuelian Cheng, Jun Cheng, Chi Liu, Kaijing Zhou, Zongyuan Ge:
OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding. 481-500
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.