


default search action
18th ECCV 2024: Milan, Italy - Part LXXXI
- Ales Leonardis
, Elisa Ricci
, Stefan Roth
, Olga Russakovsky
, Torsten Sattler
, Gül Varol
:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LXXXI. Lecture Notes in Computer Science 15139, Springer 2025, ISBN 978-3-031-73003-0 - Chenxi Liu, Zhenyi Wang, Tianyi Xiong, Ruibo Chen, Yihan Wu, Junfeng Guo, Heng Huang:
Few-Shot Class Incremental Learning with Attention-Aware Self-adaptive Prompt. 1-18 - Liang Chen, Haozhe Zhao, Tianyu Liu, Shuai Bai, Junyang Lin, Chang Zhou, Baobao Chang:
An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models. 19-35 - Xiaotian Song, Peng Zeng, Yanan Sun, Andy Song:
Generalizable Symbolic Optimizer Learning. 36-52 - Keon-Hee Park
, Hakyung Lee, Kyungwoo Song
, Gyeong-Moon Park
:
Online Continuous Generalized Category Discovery. 53-69 - Shihao Zhao
, Shaozhe Hao
, Bojia Zi, Huaizhe Xu, Kwan-Yee K. Wong
:
Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation. 70-86 - Seunghoi Kim
, Chen Jin
, Tom Diethe, Matteo Figini
, Henry F. J. Tregidgo
, Asher Mullokandov, Philip Teare, Daniel C. Alexander
:
Tackling Structural Hallucination in Image Translation with Local Diffusion. 87-103 - Ping Wang
, Yulun Zhang
, Lishun Wang
, Xin Yuan
:
Hierarchical Separable Video Transformer for Snapshot Compressive Imaging. 104-122 - Xiaoxuan He, Yifan Yang, Xinyang Jiang, Xufang Luo, Haoji Hu, Siyun Zhao, Dongsheng Li, Yuqing Yang, Lili Qiu:
Unified Medical Image Pre-training in Language-Guided Common Semantic Space. 123-139 - Koh Jun Hao, Sy-Tuyen Ho, Ngoc-Bao Nguyen, Ngai-Man Cheung:
On the Vulnerability of Skip Connections to Model Inversion Attacks. 140-157 - Daewon Choi, Jongheon Jeong
, Huiwon Jang, Jinwoo Shin:
Adversarial Robustification via Text-to-Image Diffusion Models. 158-177 - Yunfeng Fan
, Wenchao Xu
, Haozhao Wang
, Fushuo Huo
, Jinyu Chen
, Song Guo
:
Overcome Modal Bias in Multi-modal Federated Learning via Balanced Modality Selection. 178-195 - Xianren Zhang
, Dongwon Lee
, Suhang Wang
:
Comprehensive Attribution: Inherently Explainable Vision Model with Feature Detector. 196-213 - Abhinav Narayan Harish, Larry Heck, Josiah P. Hanna, Zsolt Kira, Andrew Szot:
Reinforcement Learning via Auxiliary Task Distillation. 214-230 - Sanghyun Jo, Fei Pan, In-Jae Yu, Kyungsu Kim:
DHR: Dual Features-Driven Hierarchical Rebalancing in Inter- and Intra-Class Regions for Weakly-Supervised Semantic Segmentation. 231-248 - Hao Luo, Bohan Zhou, Zongqing Lu:
Pre-trained Visual Dynamics Representations for Efficient Policy Learning. 249-267 - Haodi He
, Colton Stearns
, Adam W. Harley
, Leonidas J. Guibas
:
View-Consistent Hierarchical 3D Segmentation Using Ultrametric Feature Fields. 268-286 - Tianyou Luo
, Quan Yuan
, Guiyang Luo
, Yuchen Xia, Yujia Yang, Jinglin Li
:
Plug and Play: A Representation Enhanced Domain Adapter for Collaborative Perception. 287-303 - Yuchen Yang, Kwonjoon Lee, Behzad Dariush, Yinzhi Cao, Shao-Yuan Lo:
Follow the Rules: Reasoning for Video Anomaly Detection with Large Language Models. 304-322 - Yi-Chia Chen, Wei-Hua Li, Cheng Sun, Yu-Chiang Frank Wang, Chu-Song Chen:
SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation. 323-340 - Sanghyun Jo, Soohyun Ryu, Sungyub Kim, Eunho Yang, Kyungsu Kim:
TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias. 341-357 - Yuchen Liang
, Yuchan Tian, Lei Yu, Huaao Tang, Jie Hu, Xiangzhong Fang, Hanting Chen:
Learning Quantized Adaptive Conditions for Diffusion Models. 358-374 - Yongcan Yu, Lijun Sheng
, Ran He
, Jian Liang
:
STAMP: Outlier-Aware Test-Time Adaptation with Stable Memory Replay. 375-392 - Shengjie Zhu, Girish Chandar Ganesan
, Abhinav Kumar, Xiaoming Liu:
RePLAy: Remove Projective LiDAR Depthmap Artifacts via Exploiting Epipolar Geometry. 393-411 - Xunjiang Gu, Guanyu Song, Igor Gilitschenski, Marco Pavone
, Boris Ivanovic:
Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature Attention. 412-428 - Jingfan Guo, Jae Shin Yoon, Shunsuke Saito, Takaaki Shiratori, Hyun Soo Park:
High-Fidelity Modeling of Generalizable Wrinkle Deformation. 429-445 - Dongsheng Wang
, Jiequan Cui
, Miaoge Li
, Wang Lin
, Bo Chen
, Hanwang Zhang
:
Instruction Tuning-Free Visual Token Complement for Multimodal LLMs. 446-462

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.