


default search action
18th ECCV 2024: Milan, Italy - Part LXXIV
- Ales Leonardis

, Elisa Ricci
, Stefan Roth
, Olga Russakovsky
, Torsten Sattler
, Gül Varol
:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LXXIV. Lecture Notes in Computer Science 15132, Springer 2025, ISBN 978-3-031-72903-4 - Kangqi Ma, Hao Dong, Yadong Mu:

Local Occupancy-Enhanced Object Grasping with Multiple Triplanar Projection. 1-18 - Meng Wang, Yuyao Huang

, Henghui Ding
, Xinlong Wang
, Tiejun Huang, Yao Zhao
, Yunchao Wei
, Shuicheng Yan
:
Region-Native Visual Tokenization. 19-36 - Mae Younes

, Amine Ouasfi, Adnane Boukhayma:
SparseCraft: Few-Shot Neural Reconstruction Through Stereopsis Guided Geometric Linearization. 37-56 - Fei Wang:

Sketch2Vox: Learning 3D Reconstruction from a Single Monocular Sketch. 57-73 - Minghao Chen, Iro Laina, Andrea Vedaldi:

DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing. 74-92 - Jiafeng Mao

, Xueting Wang
, Kiyoharu Aizawa
:
The Lottery Ticket Hypothesis in Denoising: Towards Semantic-Driven Initialization. 93-109 - Silvio Galesso

, Philipp Schröppel
, Hssan Driss, Thomas Brox:
Diffusion for Out-of-Distribution Detection on Road Scenes and Beyond. 110-126 - Zijie Jiang

, Tianhan Xu
, Hiroharu Kato
:
Rethinking Directional Parameterization in Neural Implicit Surface Reconstruction. 127-142 - Tianhe Wu

, Kede Ma
, Jie Liang
, Yujiu Yang
, Lei Zhang
:
A Comprehensive Study of Multimodal Large Language Models for Image Quality Assessment. 143-160 - Wulian Yun

, Mengshi Qi
, Fei Peng
, Huadong Ma
:
Semi-supervised Teacher-Reference-Student Architecture for Action Quality Assessment. 161-178 - Seungjun Shin

, Suji Kim
, Dokwan Oh
:
Efficient Neural Video Representation with Temporally Coherent Modulation. 179-195 - Yaoting Wang

, Peiwen Sun
, Dongzhan Zhou
, Guangyao Li
, Honggang Zhang
, Di Hu
:
Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes. 196-213 - Haoran Li

, Haolin Shi
, Wenli Zhang
, Wenjun Wu
, Yong Liao
, Lin Wang
, Lik-Hang Lee
, Peng Yuan Zhou
:
DreamScene: 3D Gaussian-Based Text-to-3D Scene Generation via Formation Pattern Sampling. 214-230 - Haoliang Meng, Xiaopeng Hong, Chenhao Wang, Miao Shang, Wangmeng Zuo:

Multi-modal Crowd Counting via a Broker Modality. 231-250 - Tianyu Zhang, Guocheng Qian

, Jin Xie, Jian Yang:
FastPCI: Motion-Structure Guided Fast Point Cloud Frame Interpolation. 251-267 - Charig Yang

, Weidi Xie
, Andrew Zisserman
:
Made to Order: Discovering Monotonic Temporal Changes via Self-supervised Video Ordering. 268-286 - Runzhao Yao

, Shaoyi Du
, Wenting Cui
, Canhui Tang
, Chengwu Yang
:
PARE-Net: Position-Aware Rotation-Equivariant Networks for Robust Point Cloud Registration. 287-303 - Guoqiang Zhao, Junjie Huang, Xiaoyun Yan, Zhaojing Wang, Junwei Tang, Yangjun Ou, Xinrong Hu, Tao Peng:

Open-Vocabulary RGB-Thermal Semantic Segmentation. 304-320 - Gabriele Moreno Berton, Lorenz Junglas, Riccardo Zaccone, Thomas Pollok, Barbara Caputo, Carlo Masone

:
MeshVPR: Citywide Visual Place Recognition Using 3D Meshes. 321-339 - Yaoting Wang

, Peiwen Sun
, Yuanchao Li
, Honggang Zhang
, Di Hu
:
Can Textual Semantics Mitigate Sounding Object Segmentation Preference? 340-356 - Raphael Sulzer, Florent Lafarge:

Concise Plane Arrangements for Low-Poly Surface and Volume Modelling. 357-373 - Hairong Jin

, Yuefan Shen
, Jianwen Lou
, Kun Zhou
, Youyi Zheng
:
KeypointDETR: An End-to-End 3D Keypoint Detector. 374-390 - Sogand Salehi

, Mahdi Shafiei
, Teresa Yeo
, Roman Bachmann
, Amir Zamir
:
ViPer: Visual Personalization of Generative Models via Individual Preference Learning. 391-406 - Jian Yang

, Jiakun Li
, Guoming Li
, Huai-Yu Wu
, Zhen Shen
, Zhaoxin Fan
:
MLPHand: Real Time Multi-view 3D Hand Reconstruction via MLP Modeling. 407-424 - A. Tuan Nguyen, Kai Sheng Tai, Bor-Chun Chen, Satya Narayan Shukla, Hanchao Yu, Philip Torr, Tai-Peng Tian, Ser-Nam Lim:

uCAP: An Unsupervised Prompting Method for Vision-Language Models. 425-439 - Dilxat Muhtar, Zhenshi Li, Feng Gu, Xueliang Zhang, Pengfeng Xiao:

LHRS-Bot: Empowering Remote Sensing with VGI-Enhanced Large Multimodal Language Model. 440-457 - Andrei Atanov, Jiawei Fu, Rishubh Singh, Isabella Yu, Andrew Spielberg, Amir Zamir:

How Far Can a 1-Pixel Camera Go? Solving Vision Tasks Using Photoreceptors and Computationally Designed Visual Morphology. 458-476

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














