


default search action
20th VISIGRAPP 2025: Porto, Portugal - Volume 2: VISAPP
- Thomas Bashford-Rogers, Daniel Meneveaux, Mehdi Ammi, Mounia Ziat, Stefan Jänicke, Helen C. Purchase, Petia Radeva, Antonino Furnari, Kadi Bouatouch, A. Augusto de Sousa:
Proceedings of the 20th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, VISIGRAPP 2025 - Volume 2: VISAPP, Porto, Portugal, February 26-28, 2025. SCITEPRESS 2025, ISBN 978-989-758-728-3
Invited Speakers
- Julien Pettré:
Crowds and Graphics: Beyond Animation and Visual Effects. VISIGRAPP 2025: 5 - Daniel Archambault:
On the Importance of Visualisation in a Data Driven Society. 7-10 - Katherine J. Kuchenbecker:
Haptic Intelligence. VISIGRAPP 2025: 11 - Diane Larlus:
Lifelong Visual Representation Learning. VISIGRAPP 2025: 13
Image and Video Understanding
- Kohei Fukuda, Hiroaki Aizawa:
Adaptive Out-of-Distribution Detection with Coarse-to-Fine Grained Representation. 19-26 - Pham Phuc, Son Vuong, Khang Nguyen, Tuan Dang:
Distortion-Aware Adversarial Attacks on Bounding Boxes of Object Detectors. 27-38 - Oliver Hixon-Fisher, Jarek Francik, Dimitrios Makris:
Pose-Centric Motion Synthesis Through Adaptive Instance Normalization. 39-47 - Achref Ouni, Chafik Samir, Yousef Bouaziz, Anis Fradi:
ConvKAN: Towards Robust, High-Performance and Interpretable Image Classification. 48-58 - Anika Shrivastava, Renu Rameshan, Samar Agnihotri:
Latent Space Characterization of Autoencoder Variants. 59-67 - Felix Stillger, Frederik Hasecke, Lukas Hahn, Tobias Meisen:
Beyond Labels: Self-Attention-Driven Semantic Separation Using Principal Component Clustering in Latent Diffusion Models. 68-80 - Gusseppe Bravo Rocca, Peini Liu, Jordi Guitart, Ajay Dholakia, David Ellison, Rodrigo M. Carrillo-Larco:
Experience Replay and Zero-Shot Clustering for Continual Learning in Diabetic Retinopathy Detection. 81-92 - Ryoga Takahashi, Yota Yamamoto, Ryosuke Furuta, Yukinobu Taniguchi:
Detection of Door-Closing Defects by Learning from Physics-Based Simulations. 93-98 - Jose Alejandro Avellaneda Gonzalez, Tetsu Matsukawa, Einoshin Suzuki:
Leveraging Vision Language Models for Understanding and Detecting Violence in Videos. 99-113 - Eric Brouwer, Jan Erik van Woerden, Gertjan J. Burghouts, Matias Valdenegro-Toro, Marco Zullich:
Adaptive Prompt Tuning: Vision Guided Prompt Tuning with Cross-Attention for Fine-Grained Few-Shot Learning. 114-125 - Laura Fieback, Jakob Spiegelberg, Hanno Gottschalk:
MetaToken: Detecting Hallucination in Image Descriptions by Meta Classification. 126-137 - Fahad Majeed, Khaled Ahmed Lutf Al Thelaya, Nauman Ullah Gilal, Kamilla Swart-Arries, Marco Agus, Jens Schneider:
ReST: High-Precision Soccer Player Tracking via Motion Vector Segmentation. 138-149 - Zejian Zhang, Cristina Palmero, Sergio Escalera:
Transformer or Mamba for Temporal Action Localization? Insights from a Comprehensive Experimental Comparison Study. 150-162 - Ayush Roy, Sk Mohiuddin, Maxim V. Minenko, Dmitrii I. Kaplun, Ram Sarkar:
DeepSpace: Navigating the Frontier of Deepfake Identification Using Attention-Driven Xception and a Task-Specific Subspace. 163-172 - Muhammad Aqeel, Shakiba Sharifi, Marco Cristani, Francesco Setti:
Self-Supervised Iterative Refinement for Anomaly Detection in Industrial Quality Control. 173-183 - Ivan Jarsky, Maxim Kuzin, Valeria Efimova, Viacheslav Shalamov, Andrey Filchenkov:
VectorWeaver: Transformers-Based Diffusion Model for Vector Graphics Generation. 184-195 - Lukas Meiner, Jens Mehnert, Alexandru Paul Condurache:
Data-Free Dynamic Compression of CNNs for Tractable Efficiency. 196-208 - Elham Iravani, Frederik Hasecke, Lukas Hahn, Tobias Meisen:
Enhancing 3D Human Pose Estimation: A Novel Post-Processing Method. 209-220 - Marc Peral, Guillem Capellera, Antonio Rubio, Luis Ferraz, Francesc Moreno-Noguer, Antonio Agudo:
Temporally Accurate Events Detection Through Ball Possessor Recognition in Soccer. 221-231 - Artur A. M. Oliveira, Mateus Espadoto, Roberto Hirata Jr., Roberto M. Cesar Jr.:
Improving Image Classification Tasks Using Fused Embeddings and Multimodal Models. 232-241 - Andrea P. Gómez-Jaime, Luke Meyers, Josué A. Rodríguez-Cordero, José L. Agosto-Rivera, Tugrul Giray, Rémi Mégret:
Paint Blob Detection and Decoding for Identification of Honey Bees. 242-250 - Shiryu Ueno, Yoshikazu Hayashi, Shunsuke Nakatsuka, Yusei Yamada, Hiroaki Aizawa, Kunihito Kato:
Vision-Language In-Context Learning Driven Few-Shot Visual Inspection Model. 253-260 - Kazuki Omi, Jion Oshima, Toru Tamaki:
Action Tube Generation by Person Query Matching for Spatio-Temporal Action Detection. 261-268 - Alexander Naumann, Felix Hertlein, Jacqueline Höllig, Lucas Cazzonelli, Steffen Thoma:
CodeSCAN: ScreenCast ANalysis for Video Programming Tutorials. 269-277 - Håkan Ardö, Mikael G. Nilsson, Anthony Cioppa, Floriane Magera, Silvio Giancola, Haochen Liu, Bernard Ghanem, Marc Van Droogenbroeck:
Spiideo SoccerNet SynLoc: Single Frame World Coordinate Athlete Detection and Localization with Synthetic Data. 278-285 - Kim Bjerge, Paul Bodesheim, Henrik Karstoft:
Deep Image Clustering with Model-Agnostic Meta-Learning. 286-297 - Masakazu Fujio, Yosuke Kaga, Kenta Takahashi:
Improving Periocular Recognition Accuracy: Opposite Side Learning Suppression and Vertical Image Inversion. 298-305 - Youssef Shoeb, Nazir Nayal, Azarm Nowzad, Fatma Güney, Hanno Gottschalk:
Segment-Level Road Obstacle Detection Using Visual Foundation Model Priors and Likelihood Ratios. 306-315 - Michael Schulze, Nikolas Ebert, Laurenz Reichardt, Oliver Wasenmüller:
Classifier Ensemble for Efficient Uncertainty Calibration of Deep Neural Networks for Image Classification. 316-323 - Mathijs Lens, Aaron Van Campenhout, Toon Goedemé:
Conditioned Generative AI for Synthetic Training of 6D Object Pose Detection. 324-331 - Afshin Dini, Esa Rahtu:
Deep Local Feature Matching Image Anomaly Detection with Patch Adaptive Average Pooling Technique. 332-339 - Takuya Okano, Yohei Minekawa, Miki Hayakawa:
CTypiClust: Confidence-Aware Typical Clustering for Budget-Agnostic Active Learning with Confidence Calibration. 340-347 - Jurica Runtas, Tomislav Petkovic:
Neural Network Meta Classifier: Improving the Reliability of Anomaly Segmentation. 348-355 - Lucas Wojcik, Luiz Coelho, Roger Granada, David Menotti:
New Paths in Document Data Augmentation Using Templates and Language Models. 356-366 - Maniraj Sai Adapa, Marco Zullich, Matias Valdenegro-Toro:
Uncertainty Estimation for Super-Resolution Using ESRGAN. 367-374 - Majedaldein Almahasneh, Baihua Li, Haibin Cai, Nasir Rajabi, Laura Davies, Qinggang Meng:
Herbicide Efficacy Prediction Based on Object Segmentation of Glasshouse Imagery. 375-382 - Deryk Willyan Biotto, Guilherme Henrique Jardim, Vinicius Atsushi Sato Kawai, Bionda Rozin, Denis Henrique Pinheiro Salvadeo, Daniel Carlos Guimarães Pedronette:
Inductive Self-Supervised Dimensionality Reduction for Image Retrieval. 383-391 - Rikuto Konishi, Toru Abe, Takuo Suganuma:
A Method for Detecting Hands Moving Objects from Videos. 392-399 - Qin Wang, Kai Krajsek, Hanno Scharr:
Rescuing Easy Samples in Self-Supervised Pretraining. 400-409 - Tristan Cladière, Olivier Alata, Christophe Ducottet, Hubert Konik, Anne-Claire Legrand:
Knowledge Amalgamation for Single-Shot Context-Aware Emotion Recognition. 410-419 - Xuban Barberena, Fátima A. Saiz, Iñigo Barandiaran:
Handling Drift in Industrial Defect Detection Through MMD-Based Domain Adaptation. 420-429 - Muhammad Ahsan, Guy Ben-Yosef, Gemma Roig:
Beyond Data Augmentations: Generalization Abilities of Few-Shot Segmentation Models. 430-438 - Lauritz Christian Holme, Anton Mosquera Storgaard, Siavash Arjomand Bigdeli:
Membership Inference Attacks for Face Images Against Fine-Tuned Latent Diffusion Models. 439-446 - Carter Ung, Pranav Mantini, Shishir K. Shah:
Minimizing Number of Distinct Poses for Pose-Invariant Face Recognition. 447-455 - Masaki Nambata, Tsubasa Hirakawa, Takayoshi Yamashita, Hirobobu Fujiyoshi, Takehito Teraguchi, Shota Okubo, Takuya Nanri:
VLLM Guided Human-Like Guidance Navigation Generation. 456-463 - Shonosuke Gonda, Fumihiko Sakaue, Jun Sato:
CLIP-MDGAN: Multi-Discriminator GAN Using CLIP Task Allocation. 464-470 - Junya Isogawa, Fumihiko Sakaue, Jun Sato:
Simultaneous Estimation of Driving Intentions for Multiple Vehicles Using Video Transformer. 471-477 - Yasutomo Kawanishi, Hitoshi Nishimura, Hiroshi Murase:
Human Pose Estimation from an Extremely Low-Resolution Image Sequence by Pose Transition Embedding Network. 478-485 - Samuel Marschall, Kira Maag:
Multi-Scale Foreground-Background Confidence for Out-of-Distribution Segmentation. 486-496 - Takahiro Sannomiya, Kazuhiro Hotta:
Accuracy Improvement of Neuron Concept Discovery Using CLIP with Grad-CAM-Based Attention Regions. 497-502 - Dominik Schraml, Gunther Notni:
Expanding Domain Coverage in Injection Molding Quality Inspection with Physically-Based Synthetic Data. 503-510 - Gustavo Rosseto Leticio, Vinicius Atsushi Sato Kawai, Lucas Pascotti Valem, Daniel Carlos Guimarães Pedronette:
Neighbor Embedding Projection and Graph Convolutional Networks for Image Classification. 511-518 - Gustavo Rosseto Leticio, Matheus Henrique Jacob dos Santos, Lucas Pascotti Valem, Vinicius Atsushi Sato Kawai, Fabricio Aparecido Breve, Daniel Carlos Guimarães Pedronette:
Graph Convolutional Networks and Particle Competition and Cooperation for Semi-Supervised Learning. 519-526 - Simon Fischer, Benedikt Kottler, Eva Strauß, Dimitri Bulatov:
Exploration and Validation of Specialized Loss Functions for Generative Visual-Thermal Image Domain Transfer. 527-534 - Alina Burgert, Babette Dellen, Uwe Jaekel, Dietrich Paulus:
Semi-Supervised Anomaly Detection in Skin Lesion Images. 535-541 - Artur Urzedowski, Kazimierz Choros:
Automatic Detection of the Driver Distractions Based on the Analysis of Face Videos. 542-549
Motion, Tracking, and 3D Vision
- Muhammad Asad Ali, Nadia Robertini, Didier Stricker:
HandMvNet: Real-Time 3D Hand Pose Estimation Using Multi-View Cross-Attention Fusion. 555-562 - Sudarshan Raghavan Iyengar, Subash Sharma, Patrick Vandewalle:
MuSt-NeRF: A Multi-Stage NeRF Pipeline to Enhance Novel View Synthesis. 563-573 - William A. Ramirez, César A. Sierra Franco, Thiago Motta, Alberto Raposo:
Urban Re-Identification: Fusing Local and Global Features with Residual Masked Maps for Enhanced Vehicle Monitoring in Small Datasets. 574-581 - Ryota Inoue, Tsubasa Hirakawa, Takayoshi Yamashita, Hironobu Fujiyoshi:
2D Motion Generation Using Joint Spatial Information with 2CM-GPT. 582-590 - Yizhou Li, Yusuke Monno, Masatoshi Okutomi, Yuuichi Tanaka, Seiichi Kataoka, Teruaki Kosiba:
Segmentation-Guided Neural Radiance Fields for Novel Street View Synthesis. 591-597 - Akash Malhotra, Nacéra Seghouani, Gilbert Badaro, Christophe Blaya:
ConMax3D: Frame Selection for 3D Reconstruction Through Concept Maximization. 598-609 - Glenn Grubert, Florian Barthel, Anna Hilsmann, Peter Eisert:
Improving Adaptive Density Control for 3D Gaussian Splatting. 610-621 - Valentino Behret, Regina Kushtanova, Islam Fadl, Simon Weber, Thomas Helmer, Frank Palme:
Sensor Calibration and Data Analysis of the MuFoRa Dataset. 622-631 - Reena, John H. Doonan, Kevin Williams, Fiona M. K. Corke, Huaizhong Zhang, Yonghuai Liu:
Uncertainty and Feature-Based Weighted Loss for 3D Wheat Part Segmentation. 632-641 - Maik Steinhauser, Laurenz Reichardt, Nikolas Ebert, Oliver Wasenmüller:
D-PLS: Decoupled Semantic Segmentation for 4D-Panoptic-LiDAR-Segmentation. 645-650 - Marko Pavlic, Darius Burschka:
Adaptable Distributed Vision System for Robot Manipulation Tasks. 651-658 - Muhannad Ismael, Maël Cornil:
Real-Time Kinematic Positioning and Optical See-Through Head-Mounted Display for Outdoor Tracking: Hybrid System and Preliminary Assessment. 659-666 - Vladimir Mashurov, Vasilii Latonov, Anastasia Martynova, Natalia Semenova:
Noisemaker 3D: Comprehensive Framework for Mesh Noise Generation. 667-674 - Shunpei Aou, Yota Yamamoto, Kazuaki Nakamura, Yukinobu Taniguchi:
Evaluating Homography Error for Accurate Multi-Camera Multi-Object Tracking of Dairy Cows. 675-682 - Murilo Santos Regio, Isabel H. Manssour:
FiDaSS: A Novel Dataset for Firearm Threat Detection in Real-World Scenes. 683-690 - Elton Alencar, Larissa Pessoa, Fernanda Costa, Guilherme Souza, Rosiane de Freitas:
Comparative Analysis of Deep Learning-Based Multi-Object Tracking Approaches Applied to Sports User-Generated Videos. 691-698 - Sota Ito, Yoshikazu Hayashi, Hiroaki Aizawa, Kunihito Kato:
Learning Neural Velocity Fields from Dynamic 3D Scenes via Edge-Aware Ray Sampling. 699-706 - Maxime Mérizette, Nicolas Audebert, Pierre Kervella, Jérôme Verdun:
3DSES: An Indoor Lidar Point Cloud Segmentation Dataset with Real and Pseudo-Labels from a 3D Model. 707-716 - Remi Lhoste, Antoine Vacavant, Damien Delhay:
MAESTRO: A Full Point Cloud Approach for 3D Anomaly Detection Based on Reconstruction. 717-724 - Iryna Repinetska, Anna Hilsmann, Peter Eisert:
Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios. 725-734 - Yushan Wang, Shuhei Tarashima, Norio Tagawa:
Efficient 3D Human Pose and Shape Estimation Using Group-Mix Attention in Transformer Models. 735-742 - Mingyang Zhang, Kristof Van Beeck, Toon Goedemé:
Leveraging Unreal Engine for UAV Object Tracking: The AirTrackSynth Synthetic Dataset. 743-750 - Yumi Ando, Fumihiko Sakaue, Jun Sato:
Recovery of Detailed Posture and Shape from Motion Video Images by Deforming SMPL. 751-757 - Marziyeh Bamdad, Hans-Peter Hutter, Alireza Darvishy:
Deep Learning-Powered Visual SLAM Aimed at Assisting Visually Impaired Navigation. 758-765 - Salvatore Mario Carota, Alessandro Privitera, Daniele Di Mauro, Antonino Furnari, Giovanni Maria Farinella, Francesco Ragusa:
Benchmarking Neural Rendering Approaches for 3D Reconstruction of Underwater Environments. 766-773 - Diego Hernández Rodríguez, Motoharu Sonogashira, Kazuya Kitano, Yuki Fujimura, Takuya Funatomi, Yasuhiro Mukaigawa, Yasutomo Kawanishi:
An Event Camera Simulator for Arbitrary Viewpoints Based on Neural Radiance Fields. 774-780 - Masanori Nishiguchi, Hitoshi Habe, Koji Abe, Masayuki Otani, Nobukazu Iguchi:
A Computer Vision Approach to Counting Farmed Fish in Flowing Water. 781-789 - Shunsuke Nakagawa, Takahiro Okabe, Ryo Kawahara:
Shape from Mirrored Polarimetric Light Field. 790-796
Mobile, Egocentric, and Robotic Vision
- Oguz Kedilioglu, Tasnim Tabassum Nova, Martin Landesberger, Lijiu Wang, Michael Hofmann, Jörg Franke, Sebastian Reitelshöfer:
PrIcosa: High-Precision 3D Camera Calibration with Non-Overlapping Field of Views. 801-809 - Daiki Iwata, Kanji Tanaka, Mitsuki Yoshida, Ryogo Yamamoto, Morishita Yuudai, Hiroki Tomoe:
Fine-Grained Self-Localization from Coarse Egocentric Topological Maps. 810-819 - Ahmed N. Ahmed, Siegfried Mercelis, Ali Anwar:
GIFF: Graph Iterative Attention Based Feature Fusion for Collaborative Perception. 820-829 - Kenta Tsukahara, Ryogo Yamamoto, Kanji Tanaka, Hiroki Tomoe:
SSGA: Synthetic Scene Graph Augmentation via Multiple Pipeline Variants. 833-840 - Bingyu Huang, Gianni Allebosch, Peter Veelaert, Tim Willems, Wilfried Philips, Jan Aelterman:
Low Latency Pedestrian Detection Based on Dynamic Vision Sensor and RGB Camera Fusion. 841-850 - Christian Hofmann, Christopher May, Patrick Ziegler, Iliya Ghotbiravandi, Jörg Franke, Sebastian Reitelshöfer:
Automated Individualization of Object Detectors for the Semantic Environment Perception of Mobile Robots. 851-862 - Alessandro Sebastiano Catinello, Giovanni Maria Farinella, Antonino Furnari:
Online Detection of End of Take and Release Actions from Egocentric Videos. VISIGRAPP (2): VISAPP 2025: 863-870 - Diego Renan Bruno, William D'Abruzzo Martins, Rafael Alceste Berri, Fernando Santos Osório:
Robotic Visual Attention Architecture for ADAS in Critical Embedded Systems for Smart Vehicles. 871-878
Applications and Services
- Yuta Kotsuji, Kazuaki Nakamura:
Defense Against Model Inversion Attacks Using a Dummy Recognition Model Trained with Synthetic Samples. 883-892 - Giovani Candido, Luis Henrique Morelli, Danilo Samuel Jodas, Giuliana Del Nero Velasco, Reinaldo Araújo de Lima, Kelton Augusto Pontara da Costa, João Paulo Papa:
Optimum-Path Forest Ensembles to Estimate the Internal Decay in Urban Trees. 895-902 - Pin-Yuan Yang, Yu-Shan Deng, Chieh-Shan Lin, An-Chun Luo, Shih-Chieh Chang:
Coloring 3D Avatars with Single-Image. 903-910 - Ayaka Asaeda, Noriko Takemura:
Internal State Estimation Based on Facial Images with Individual Feature Separation and Mixup Augmentation. 911-918 - Shiori Furukawa, Noriko Takemura:
Disease Estimation Using Gait Videos by Separating Individual Features Based on Disentangled Representation Learning. 919-925 - Laura Frank, Germaine Götzelmann, Danah Tonne:
Towards a Dataset for Paleographic Details in Historical Torah Scrolls. 926-933 - Hojin Yoo, Dhanyapriya Somasundaram, Hyunju Oh:
Efficient CNN-Based System for Automated Beetle Elytra Coordinates Prediction. 934-941 - Ömer Muhammet Soysal, Iphy Emeka Kelvin, Muhammed Esad Oztemel:
Effectiveness of Cross-Model Learning Through View-Model Ensemble on Detection of Spatiotemporal EEG Patterns. 942-949 - Pranav Bookanakere, Syeda Saniya, Syed Munzer Nouman, S. Pramath, Jayashree Rangareddy:
A Multimodal Approach to Research Paper Summarization. 950-957 - Nighat Bibi, Kathleen M. Curran, Jane Courtney:
Differential Diagnosis of Brain Diseases Using Ensemble Learning and Explainable AI. 958-964 - Leina Yoshida, Gustavo Camargo Domingues, Fabiana F. F. Peres, Claudio R. M. Mauricio, João Marcelo X. N. Teixeira:
Leveraging Affordable Solutions for Stereo Video Capture in Virtual Reality Applications. 965-971 - Eldiane Borges dos Santos Durães, João Batista Florindo:
Sleep-Stage Efficient Classification Using a Lightweight Self-Supervised Model. 972-979 - Sara Tesfamariam, Isah Abdullahi Lawal, Arda Durmaz, Jacob G. Scott:
DeepCellCount: Cell Counting Using Two-Step Deep Learning. 980-985 - Aleenah Khan, Hassan Foroosh:
Towards Safe Self-Stimulatory Behaviors in Autistic Children: HarmAlert4AutisticChildren (HA4AC). 986-994

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.