


default search action
18th ECCV 2024: Milan, Italy - Part XXII
- Ales Leonardis
, Elisa Ricci
, Stefan Roth
, Olga Russakovsky
, Torsten Sattler
, Gül Varol
:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XXII. Lecture Notes in Computer Science 15080, Springer 2025, ISBN 978-3-031-72669-9 - Kai Zhang, Sai Bi, Hao Tan, Yuanbo Xiangli, Nanxuan Zhao, Kalyan Sunkavalli, Zexiang Xu:
GS-LRM: Large Reconstruction Model for 3D Gaussian Splatting. 1-19 - Runyi Hu
, Jie Zhang
, Ting Xu, Jiwei Li, Tianwei Zhang
:
Robust-Wide: Robust Watermarking Against Instruction-Driven Image Editing. 20-37 - Qiao Mo
, Yukang Ding
, Jinhua Hao
, Qiang Zhu
, Ming Sun
, Chao Zhou
, Feiyu Chen
, Shuyuan Zhu:
OAPT: Offset-Aware Partition Transformer for Double JPEG Artifacts Removal. 38-56 - Ryosuke Yamada
, Kensho Hara
, Hirokatsu Kataoka
, Koshi Makihara
, Nakamasa Inoue
, Rio Yokota
, Yutaka Satoh
:
Formula-Supervised Visual-Geometric Pre-training. 57-74 - Yue Fan
, Xiaojian Ma
, Rujie Wu
, Yuntao Du
, Jiaqi Li
, Zhi Gao
, Qing Li
:
[inline-graphic not available: see fulltext]VideoAgent: A Memory-Augmented Multimodal Agent for Video Understanding. 75-92 - Guanghao Zheng
, Yuchen Liu, Wenrui Dai, Chenglin Li, Junni Zou, Hongkai Xiong:
Towards Unified Representation of Invariant-Specific Features in Missing Modality Face Anti-spoofing. 93-110 - Shangquan Sun
, Wenqi Ren, Xinwei Gao, Rui Wang
, Xiaochun Cao
:
Restoring Images in Adverse Weather Conditions via Histogram Transformer. 111-129 - Tongkun Guan
, Chengyu Lin
, Wei Shen, Xiaokang Yang:
PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer. 130-147 - Yubin Hu
, Xiaoyang Guo, Yang Xiao, Jingwei Huang, Yong-Jin Liu:
NGP-RT: Fusing Multi-level Hash Features with Lightweight Attention for Real-Time Novel View Synthesis. 148-165 - Han Wang, Yongjie Ye, Yanjie Wang, Yuxiang Nie, Can Huang:
Elysium: Exploring Object-Level Perception in Videos via MLLM. 166-185 - Shuxiang Xie, Shuyi Zhou, Ken Sakurada, Ryoichi Ishikawa, Masaki Onishi, Takeshi Oishi:
G2fR: Frequency Regularization in Grid-Based Feature Encoding Neural Radiance Fields. 186-203 - Agneet Chatterjee
, Gabriela Ben Melech Stan
, Estelle Aflalo
, Sayak Paul
, Dhruba Ghosh
, Tejas Gokhale
, Ludwig Schmidt, Hannaneh Hajishirzi
, Vasudev Lal
, Chitta Baral
, Yezhou Yang
:
Getting it Right: Improving Spatial Consistency in Text-to-Image Models. 204-222 - Xueqi Ma
, Yilin Liu
, Wenjun Zhou
, Ruowei Wang
, Hui Huang
:
Generating 3D House Wireframes with Semantics. 223-240 - Xiao Fu
, Wei Yin
, Mu Hu
, Kaixuan Wang, Yuexin Ma, Ping Tan, Shaojie Shen, Dahua Lin
, Xiaoxiao Long
:
GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image. 241-258 - Yiyao Ma, Kai Chen, Hon-Sing Tong
, Ruofeng Wei, Yui-Lun Ng, Ka-Wai Kwok, Qi Dou:
Shape-Guided Configuration-Aware Learning for Endoscopic-Image-Based Pose Estimation of Flexible Robotic Instruments. 259-276 - Jianan Wei, Tianfei Zhou, Yi Yang, Wenguan Wang:
Nonverbal Interaction Detection. 277-295 - Jian Zou, Tianyu Huang
, Guanglei Yang
, Zhenhua Guo
, Tao Luo
, Chun-Mei Feng
, Wangmeng Zuo
:
UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving. 296-313 - Minheng Ni, Yeli Shen, Lei Zhang, Wangmeng Zuo:
Responsible Visual Editing. 314-330 - Weijia Wu
, Zhuang Li, Yuchao Gu, Rui Zhao, Yefei He, David Junhao Zhang, Mike Zheng Shou, Yan Li, Tingting Gao, Di Zhang:
DragAnything: Motion Control for Anything Using Entity Representation. 331-348 - Shuting He
, Henghui Ding
, Xudong Jiang
, Bihan Wen
:
[inline-graphic not available: see fulltext] SegPoint: Segment Any Point Cloud via Large Language Model. 349-367 - Sheng Fan, Rui Liu, Wenguan Wang, Yi Yang:
Navigation Instruction Generation with BEV Perception and Large Language Models. 368-387 - Taemin Park
, Hyuck Lee
, Heeyoung Kim
:
Rebalancing Using Estimated Class Distribution for Imbalanced Semi-supervised Learning Under Class Distribution Mismatch. 388-404 - Qiuhong Shen, Xingyi Yang, Michael Bi Mi, Xinchao Wang
:
Vista3D: Unravel the 3D Darkside of a Single Image. 405-421 - Yi Yao
, Chan-Feng Hsu, Jhe-Hao Lin, Hongxia Xie
, Terence Lin, Yi-Ning Huang, Hong-Han Shuai
, Wen-Huang Cheng
:
The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation. 422-438 - Junjie Huang
, Yun Ye, Zhujin Liang, Yi Shan, Dalong Du:
Detecting as Labeling: Rethinking LiDAR-Camera Fusion in 3D Object Detection. 439-455 - Qiuhong Shen, Xingyi Yang, Xinchao Wang
:
FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally. 456-472 - Guanting Dong, Yueyi Zhang, Xiaoyan Sun, Zhiwei Xiong:
Exploiting Dual-Correlation for Multi-frame Time-of-Flight Denoising. 473-489

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.