


default search action
18th ECCV 2024: Milan, Italy - Part XXXVIII
- Ales Leonardis

, Elisa Ricci
, Stefan Roth
, Olga Russakovsky
, Torsten Sattler
, Gül Varol
:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XXXVIII. Lecture Notes in Computer Science 15096, Springer 2025, ISBN 978-3-031-72919-5 - Luchuan Song, Pinxin Liu, Lele Chen, Guojun Yin, Chenliang Xu:

Tri2-plane: Thinking Head Avatar via Feature Pyramid. 1-20 - Yuzhong Zhao, Yue Liu, Zonghao Guo, Weijia Wu, Chen Gong, Qixiang Ye, Fang Wan:

ControlCap: Controllable Region-Level Captioning. 21-38 - Jilong Wang

, Saihui Hou
, Yan Huang
, Chunshui Cao
, Xu Liu
, Yongzhen Huang
, Tianzhu Zhang
, Liang Wang
:
Free Lunch for Gait Recognition: A Novel Relation Descriptor. 39-56 - Weitai Kang

, Gaowen Liu, Mubarak Shah, Yan Yan:
SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding. 57-75 - Xiaoran Zhang

, John C. Stendahl
, Lawrence H. Staib
, Albert J. Sinusas
, Alex Wong
, James S. Duncan
:
Adaptive Correspondence Scoring for Unsupervised Medical Image Registration. 76-92 - Nithin Gopalakrishnan Nair, Jeya Maria Jose Valanarasu

, Vishal M. Patel
:
MaxFusion: Plug&Play Multi-modal Generation in Text-to-Image Diffusion Models. 93-110 - Ashkan Mirzaei, Tristan Aumentado-Armstrong, Marcus A. Brubaker, Jonathan Kelly, Alex Levinshtein, Konstantinos G. Derpanis, Igor Gilitschenski:

Watch Your Steps: Local Image and Scene Editing by Text Instructions. 111-129 - Hritam Basak, Zhaozheng Yin:

Forget More to Learn More: Domain-Specific Feature Unlearning for Semi-supervised and Unsupervised Domain Adaptation. 130-148 - Anh Thai, Weiyao Wang, Hao Tang, Stefan Stojanov, James M. Rehg, Matt Feiszli:

3˟ 2: 3D Object Part Segmentation by 2D Semantic Correspondences. 149-166 - Zhengyuan Yang

, Jianfeng Wang
, Linjie Li, Kevin Lin, Chung-Ching Lin
, Zicheng Liu
, Lijuan Wang
:
Idea2Img: Iterative Self-refinement with GPT-4V for Automatic Image Design and Generation. 167-184 - Gustavo Pérez

, Daniel Sheldon, Grant Van Horn, Subhransu Maji
:
Human-in-the-Loop Visual Re-ID for Population Size Estimation. 185-202 - Lingchen Meng, Shiyi Lan, Hengduo Li, José M. Álvarez, Zuxuan Wu, Yu-Gang Jiang:

SegIC: Unleashing the Emergent Correspondence for In-Context Segmentation. 203-220 - Weiwei Sun

, Eduard Trulls
, Yang-Che Tseng, Sneha Sambandam, Gopal Sharma, Andrea Tagliasacchi
, Kwang Moo Yi
:
PointNeRF++: A Multi-scale, Point-Based Neural Radiance Field. 221-238 - Junfei Xiao, Ziqi Zhou, Wenxuan Li, Shiyi Lan, Jieru Mei, Zhiding Yu, Bingchen Zhao, Alan L. Yuille, Yuyin Zhou, Cihang Xie:

A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties. 239-258 - Bowen Shi, Peisen Zhao, Zichen Wang, Yuhang Zhang, Yaoming Wang, Jin Li, Wenrui Dai, Junni Zou, Hongkai Xiong, Qi Tian, Xiaopeng Zhang:

UMG-CLIP: A Unified Multi-granularity Vision Generalist for Open-World Understanding. 259-277 - Yao-Chih Lee, Zhoutong Zhang, Kevin Blackburn-Matzen, Simon Niklaus, Jianming Zhang, Jia-Bin Huang, Feng Liu:

Fast View Synthesis of Casual Videos with Soup-of-Planes. 278-296 - Neerja Thakkar, Karttikeya Mangalam, Andrea Bajcsy, Jitendra Malik:

Adaptive Human Trajectory Prediction via Latent Corridors. 297-314 - Rohan Choudhury

, Koichiro Niinuma
, Kris M. Kitani
, László A. Jeni
:
Video Question Answering with Procedural Programs. 315-332 - Wenhui Zhu, Xiwen Chen, Peijie Qiu, Aristeidis Sotiras, Abolfazl Razi, Yalin Wang:

DGR-MIL: Exploring Diverse Global Representation in Multiple Instance Learning for Whole Slide Image Classification. 333-351 - Dong Huo

, Zixin Guo, Xinxin Zuo
, Zhihao Shi, Juwei Lu, Peng Dai, Songcen Xu
, Li Cheng
, Yee-Hong Yang
:
TexGen: Text-Guided 3D Texture Generation with Multi-view Sampling and Resampling. 352-368 - Rongchang Li

, Zhenhua Feng
, Tianyang Xu
, Linze Li
, Xiaojun Wu
, Muhammad Awais
, Sara Atito Ali Ahmed
, Josef Kittler
:
C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition. 369-388 - Bin Xia, Shiyin Wang, Yingfan Tao, Yitong Wang, Jiaya Jia

:
LLMGA: Multimodal Large Language Model Based Generation Assistant. 389-406 - Mi Luo, Zihui Xue, Alex Dimakis, Kristen Grauman:

Put Myself in Your Shoes: Lifting the Egocentric Perspective from Exocentric Videos. 407-425 - Sriram Narayanan, Mani Ramanagopal, Mark Sheinin, Aswin C. Sankaranarayanan, Srinivasa G. Narasimhan:

Shape from Heat Conduction. 426-444 - Moritz Heep

, Eduard Zell
:
An Adaptive Screen-Space Meshing Approach for Normal Integration. 445-461 - Seung Hyun Lee

, Yinxiao Li
, Junjie Ke
, Innfarn Yoo
, Han Zhang
, Jiahui Yu
, Qifei Wang
, Fei Deng
, Glenn Entis
, Junfeng He
, Gang Li
, Sangpil Kim
, Irfan Essa
, Feng Yang
:
Parrot: Pareto-Optimal Multi-reward Reinforcement Learning Framework for Text-to-Image Generation. 462-478 - Eugene Valassakis, Guillermo Garcia-Hernando:

HandDGP: Camera-Space Hand Mesh Prediction with Differentiable Global Positioning. 479-496

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














