default search action
30th EUSIPCO 2022: Belgrade, Serbia
- 30th European Signal Processing Conference, EUSIPCO 2022, Belgrade, Serbia, August 29 - Sept. 2, 2022. IEEE 2022, ISBN 978-90-827970-9-1
- Duygu Dogan, Huang Xie, Toni Heittola, Tuomas Virtanen:
Zero-Shot Audio Classification using Image Embeddings. 1-5 - Meng-Han Lin, Jeng-Lin Li, Chi-Chun Lee:
Improving Multimodal Movie Scene Segmentation Using Mixture of Acoustic Experts. 6-10 - Ellen Riemens, Pablo Martínez-Nuevo, Jorge Martínez, Martin Bo Møller, Richard C. Hendriks:
On the Integration of Acoustics and LiDAR: a Multi-Modal Approach to Acoustic Reflector Estimation. 11-15 - Shreya G. Upadhyay, Bo-Hao Su, Chi-Chun Lee:
Improving Induced Valence Recognition by Integrating Acoustic Sound Semantics in Movies. 16-20 - Quoc-Huy Nguyen, Masashi Unoki:
Bone-conducted Speech Enhancement Using Vector-quantized Variational Autoencoder and Gammachirp Filterbank Cepstral Coefficients. 21-25 - Mathieu Fontaine, Diego Di Carlo, Kouhei Sekiguchi, Aditya Arie Nugraha, Yoshiaki Bando, Kazuyoshi Yoshii:
Elliptically Contoured Alpha-Stable Representation for MUSIC-Based Sound Source Localization. 26-30 - Natsuki Ueno, Hirokazu Kameoka:
Multiple Sound Source Localization Based on Stochastic Modeling of Spatial Gradient Spectra. 31-35 - Guillermo García-Barrios, Daniel Aleksander Krause, Archontis Politis, Annamaria Mesaros, Juana M. Gutiérrez-Arriola, Rubén Fraile:
Binaural source localization using deep learning and head rotation information. 36-40 - Julian Wechsler, Wolfgang Mack, Emanuël A. P. Habets:
End-to-End Signal-Aware Direction-of-Arrival Estimation Using Weighted Steered-Response Power. 41-45 - Ofer Schwartz:
Memory-Reduced DOA Estimators While Using Cyclic-Symmetrical Array. 46-49 - Ariel Frank, Assaf Ben-Kish, Israel Cohen:
Constant-Beamwidth Linearly Constrained Minimum Variance Beamformer. 50-54 - Emilie D'Olne, Vincent W. Neo, Patrick A. Naylor:
Speech Enhancement in Distributed Microphone Arrays Using Polynomial Eigenvalue Decomposition. 55-59 - Amos Schreibman, Elior Hadad, Anna Barnov, Eli Tzirkel-Hancock:
Dual MVDR Architecture for Adaptive Cancellation of Dynamic Interference. 60-64 - Rintaro Ikeshita, Tomohiro Nakatani:
ISS2: An Extension of Iterative Source Steering Algorithm for Majorization-Minimization-Based Independent Vector Analysis. 65-69 - Stefan Thaleiser, Gerald Enzner:
Binaural Wind-Noise Tracking with Steering Preset. 70-74 - Srikanth Burra, Asutosh Kar, Mads Græsbøll Christensen:
An Improved Functional Link Architecture for Nonlinear AEC. 75-79 - William Ravenscroft, Stefan Goetze, Thomas Hain:
Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation. 80-84 - Fran Pastor-Naranjo, Rocío del Amor, Julio Silva-Rodríguez, Miguel Ferrer, Gema Piñero, Valery Naranjo:
Conditional Generative Adversarial Networks for Acoustic Echo Cancellation. 85-89 - Tobias Kabzinski, Peter Jax:
A Unified Perspective on Time-Domain and Frequency-Domain Kalman Filters for Acoustic System Identification. 90-94 - Henri Gode, Simon Doclo:
Adaptive Dereverberation, Noise and Interferer Reduction Using Sparse Weighted Linearly Constrained Minimum Power Beamforming. 95-99 - Priyanka Gupta, Piyushkumar K. Chodingala, Hemant A. Patil:
Morlet Wavelet-Based Voice Liveness Detection using Convolutional Neural Network. 100-104 - Berkay Köprü, Engin Erzin:
Affective Burst Detection from Speech using Kernel-fusion Dilated Convolutional Neural Networks. 105-109 - Ankur T. Patil, Kuldeep Khoria, Hemant A. Patil:
Voice Liveness Detection using Constant-Q Transform-Based Features. 110-114 - Wei-Cheng Lin, Dimitra Emmanouilidou:
Toxic Speech and Speech Emotions: Investigations of Audio-based Modeling and Intercorrelations. 115-119 - Linh Vu, Raphaël C.-W. Phan, Lim Wern Han, Dinh Phung:
Improved speech emotion recognition based on music-related audio features. 120-124 - Sangeeta Srivastava, Ho-Hsiang Wu, João Rulff, Magdalena Fuentes, Mark Cartwright, Cláudio T. Silva, Anish Arora, Juan Pablo Bello:
A Study on Robustness to Perturbations for Representations of Environmental Sound. 125-129 - Ge Li, Dushyant Sharma, Patrick A. Naylor:
Non-Intrusive Signal Analysis for Room Adaptation of ASR Models. 130-134 - Erez Shalev, Israel Cohen:
Multiroom Speech Emotion Recognition. 135-139 - Ville-Veikko Eklund, Aleksandr Diment, Tuomas Virtanen:
Noise, Device and Room Robustness Methods for Pronunciation Error Detection. 140-144 - Aditya Raikar, Meet H. Soni, Ashish Panda, Sunil Kumar Kopparapu:
Acoustic Model Adaptation In Reverberant Conditions Using Multi-task Learned Embeddings. 145-149 - Shrishail Baligar, Shawn D. Newsam:
CoSSD - An end-to-end framework for multi-instance source separation and detection. 150-154 - Zicheng Feng, Yu Tsao, Fei Chen:
Recurrent Neural Network-based Estimation and Correction of Relative Transfer Function for Preserving Spatial Cues in Speech Separation. 155-159 - Jouni Paulus, Matteo Torcoli:
Sampling Frequency Independent Dialogue Separation. 160-164 - George Close, Thomas Hain, Stefan Goetze:
MetricGAN+/-: Increasing Robustness of Noise Reduction on Unseen Data. 165-169 - Runze Wang, Iman Moazzen, Wei-Ping Zhu:
A Computation-Efficient Neural Network for VAD using Multi-Channel Feature. 170-174 - Christos Garoufis, Athanasia Zlatintsi, Panayiotis Paraskevas Filntisis, Niki Efthymiou, Emmanouil Kalisperakis, Thomas Karantinos, Vasiliki Garyfalli, Marina Lazaridi, Nikolaos Smyrnis, Petros Maragos:
Towards Unsupervised Subject-Independent Speech-Based Relapse Detection in Patients with Psychosis using Variational Autoencoders. 175-179 - Ajinkya Kulkarni, Vincent Colotte, Denis Jouvet:
Multi-stage attention for fine-grained expressivity transfer in multispeaker text-to-speech system. 180-184 - Madhurananda Pahar, Marisa Klopper, Byron Reeve, Rob Warren, Grant Theron, Andreas H. Diacon, Thomas Niesler:
Wake-Cough: cough spotting and cougher identification for personalised long-term cough monitoring. 185-189 - Shakeel A. Sheikh, Md. Sahidullah, Fabrice Hirsch, Slim Ouni:
Robust Stuttering Detection via Multi-task and Adversarial Learning. 190-194 - Changhong Wang, Emmanouil Benetos, Shuge Wang, Elisabetta Versace:
Joint Scattering for Automatic Chick Call Recognition. 195-199 - Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino:
Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model. 200-204 - Jan Schlüter, Gerald Gutenbrunner:
EfficientLEAF: A Faster LEarnable Audio Frontend of Questionable Use. 205-208 - Nara Hahn, Frank Schultz, Sascha Spors:
Band Limited Impulse Invariance Method. 209-213 - Valeria Bruschi, Stefano Nobili, Alessandro Terenzi, Stefania Cecchi:
Using Interpolated FIR Technique for Digital Crossover Filters Design. 214-218 - Ruchi Pandey, Santosh Nannuru, Peter Gerstoft:
Experimental Validation of Wideband SBL Models for DOA Estimation. 219-223 - Tian Cheng, Masataka Goto:
An Analysis of Using Fuzzy Annotations in CRNN-Based Joint Beat and Downbeat Tracking. 224-228 - Sehun Kim, Tomoki Hayashi, Tomoki Toda:
Note-level Automatic Guitar Transcription Using Attention Mechanism. 229-233 - Antoine Lavault, Axel Roebel, Matthieu Voiry:
StyleWaveGAN: Style-based synthesis of drum sounds using generative adversarial networks for higher audio quality. 234-238 - Yudong Zhao, György Fazekas, Mark Sandler:
Transfer Learning for Violinist Identification. 239-243 - Jeff Miller, Ken O'Hanlon, Mark B. Sandler:
Improving Balance in Automatic Chord Recognition with Random Forests. 244-248 - Panagiotis Papantonakis, Christos Garoufis, Petros Maragos:
Multi-band Masking for Waveform-based Singing Voice Separation. 249-253 - Yigitcan Özer, Jonathan Hansen, Tim Zunner, Meinard Müller:
Investigating Nonnegative Autoencoders for Efficient Audio Decomposition. 254-258 - Zhaoyi Liu, Haoyu Tang, Sam Michiels, Wouter Joosen, Danny Hughes:
Unsupervised Acoustic Anomaly Detection Systems Based on Gaussian Mixture Density Neural Network. 259-263 - Nikolaos Stefanakis, Konstantinos Psaroulakis, Nikonas Simou, Christos Astaras:
An Open-Access System for Long-Range Chainsaw Sound Detection. 264-268 - Tomoya Nishida, Kota Dohi, Takashi Endo, Masaaki Yamamoto, Yohei Kawaguchi:
Anomalous Sound Detection Based on Machine Activity Detection. 269-273 - Harsh Purohit, Takashi Endo, Masaaki Yamamoto, Yohei Kawaguchi:
Hierarchical Conditional Variational Autoencoder Based Acoustic Anomaly Detection. 274-278 - Kota Dohi, Takashi Endo, Yohei Kawaguchi:
Disentangling physical parameters for anomalous sound detection under domain shifts. 279-283 - Stavros Ntalampiras:
Adversarial Attacks Against Audio Surveillance Systems. 284-288 - Jens Heitkaemper, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
Neural Network Based Carrier Frequency Offset Estimation From Speech Transmitted Over High Frequency Channels. 289-293 - Ibuki Kuroyanagi, Tomoki Hayashi, Kazuya Takeda, Tomoki Toda:
Improvement of Serial Approach to Anomalous Sound Detection by Incorporating Two Binary Cross-Entropies for Outlier Exposure. 294-298 - Jouni Paulus, Matteo Torcoli:
Geometrically-Motivated Primary-Ambient Decomposition With Center-Channel Extraction. 299-303 - Gerald Enzner, Christoph Urbanietz, Rainer Martin:
Optimized Learning of Spatial-Fourier Representations from Fast HRIR Recordings. 304-308 - Amy Bastine, Lachlan Birnie, Thushara D. Abhayapala, Prasanga N. Samarasinghe, Vladimir Tourbabin:
Ambisonics Capture using Microphones on Head-worn Device of Arbitrary Geometry. 309-313 - Leo McCormack, Archontis Politis:
Estimating and Reproducing Ambience in Ambisonic Recordings. 314-318 - Xiaoli Tang, Jihui Zhang, David Lou Alon, Zamir Ben-Hur, Prasanga N. Samarasinghe, Thushara D. Abhayapala:
Wave Domain Sound Field Interpolation Using Two Spherical Microphone Arrays. 319-323 - Daniel T. Jones, Dushyant Sharma, Stanislav Yu. Kruchinin, Patrick A. Naylor:
Microphone Array Coding Preserving Spatial Information for Cloud-based Multichannel Speech Recognition. 324-328 - Yonggang Hu, Sharon Gannot:
Comparison of Learning-Based DOA Estimation Between SH Domain Features. 329-333 - Shoken Kaneko, Hannes Gamper:
Towards all-purpose full-sphere binaural localization. 334-338 - Raimundo Gonzalez, Christoph Hold, Tapio Lokki, Archontis Politis:
Sector-Based Encoding and Data Compression of Virtual Acoustic Scattering. 339-343 - Shuming Luan, Yukoh Wakabayashi, Tomoki Toda:
Modified Sound Field Interpolation Method for Rotation-robust Beamforming with Unequally Spaced Circular Microphone Array. 344-348 - Priyanka Gupta, Hemant A. Patil:
Linear Frequency Residual Cepstral Features for Replay Spoof Detection on ASVSpoof 2019. 349-353 - Ahmad Aloradi, Wolfgang Mack, Mohamed Elminshawi, Emanuël A. P. Habets:
Speaker Verification in Multi-Speaker Environments Using Temporal Feature Fusion. 354-358 - Yang Xiang, Jesper Lisby Højvang, Morten Højfeldt Rasmussen, Mads Græsbøll Christensen:
A deep representation learning speech enhancement method using β-VAE. 359-363 - Mohammad MohammadAmini, Driss Matrouf, Jean-François Bonatsre, Sandipana Dowerah, Romain Serizel, Denis Jouvet:
A Comprehensive Exploration of Noise Robustness and Noise Compensation in ResNet and TDNN-based Speaker Recognition Systems. 364-368 - Priyanka Gupta, Piyushkumar K. Chodingala, Hemant A. Patil:
Energy Separation Based Instantaneous Frequency Estimation from Quadrature and In-Phase Components for Replay Spoof Detection. 369-373 - Hemant A. Patil, Rajul Acharya, Ankur T. Patil, Priyanka Gupta:
Non-Cepstral Uncertainty Vector for Replay Spoofed Speech Detection. 374-378 - Kai Li, Xugang Lu, Masato Akagi, Jianwu Dang, Sheng Li, Masashi Unoki:
Relationship Between Speakers' Physiological Structure and Acoustic Speech Signals: Data-Driven Study Based on Frequency-Wise Attentional Neural Network. 379-383 - Abraham Woubie, Tom Bäckström:
Voice Quality Features for Replay Attack Detection. 384-388 - Frederik Bous, Laurent Benaroya, Nicolas Obin, Axel Roebel:
Voice Reenactment with F0 and timing constraints and adversarial learning of conversions. 389-393 - Akansha Tyagi, Padmanabhan Rajan:
Location-invariant representations for acoustic scene classification. 394-398 - Daniel Aleksander Krause, Annamaria Mesaros:
Binaural Signal Representations for Joint Sound Event Detection and Acoustic Scene Classification. 399-403 - Siyuan Song, Brecht Desplanques, Kris Demuynck, Nilesh Madhu:
SoftVAD in iVector-Based Acoustic Scene Classification for Robustness to Foreground Speech. 404-408 - Paul Primus, Gerhard Widmer:
Improved Zero-Shot Audio Tagging & Classification with Patchout Spectrogram Transformers. 410-413 - Carlo Aironi, Samuele Cornell, Emanuele Principi, Stefano Squartini:
Graph Node Embeddings for ontology-aware Sound Event Classification: an evaluation study. 414-418 - Yun-Ning Hung, Alexander Lerch:
Feature-informed Embedding Space Regularization For Audio Classification. 419-423 - Jagmohan Chauhan, Young D. Kwon, Cecilia Mascolo:
Exploring On-Device Learning Using Few Shots for Audio Classification. 424-428 - Shubhr Singh, Huy Phan, Emmanouil Benetos:
Hypernetworks for Sound event Detection: a Proof-of-Concept. 429-433 - Anna Ollerenshaw, Md Asif Jalal, Thomas Hain:
Insights of Neural Representations in Multi-Banded and Multi-Channel Convolutional Transformers for End-to-End ASR. 434-438 - Leila Ben Letaifa, Jean-Luc Rouas:
Transformer Model Compression for End-to-End Speech Recognition on Mobile Devices. 439-443 - Dhanya Eledath, Narasimha Rao Thurlapati, V. Pavithra, Tirthankar Banerjee, V. Ramasubramanian:
Few-shot learning for E2E speech recognition: architectural variants for support set generation. 444-448 - Edvin Pakoci, Darko Pekar, Branislav M. Popovic, Milan Secujski, Vlado Delic:
Overcoming Data Sparsity in Automatic Transcription of Dictated Medical Findings. 454-458 - Steven Vander Eeckt, Hugo Van hamme:
Continual Learning for Monolingual End-to-End Automatic Speech Recognition. 459-463 - Mahmoud El-Hindi, Michael Muma, Abdelhak M. Zoubir:
Semi-Supervised Online Speaker Diarization using Vector Quantization with Alternative Codebooks. 464-468 - Chi-Mao Fan, Tsung-Jung Liu, Kuan-Hsien Liu, Ching-Hsiang Chiu:
Selective Residual M-Net for Real Image Denoising. 469-473 - R. Krishna Kanth, Andrew Gigie, Kriti Kumar, Achanna Anil Kumar, Angshul Majumdar, Balamuralidhar P:
Multi-modal Image Super-resolution with Joint Coupled Deep Transform Learning. 474-478 - Renke Wang, Jun-Jie Huang, Pier Luigi Dragotti:
FRISPEE: FRI-Based Single Image Super-Resolution with Deep Recursive Residual Network. 479-483 - Nour Aburaed, Mohammed Q. Alkhatib, Stephen Marshall, Jaime Zabalza, Hussain Al-Ahmad:
A Comparative Study of Loss Functions for Hyperspectral SISR. 484-487 - Gabriele Scrivanti, Emilie Chouzenoux, Jean-Christophe Pesquet:
A CNC approach for Directional Total Variation. 488-492 - Manoj Kumar Panda, Badri N. Subudhi, Thangaraj Veerakumar, Vinit Jakhetiya:
Integration of Bi-dimensional Empirical Mode Decomposition With Two Streams Deep Learning Network for Infrared and Visible Image Fusion. 493-497 - Takuma Aizu, Ryo Matsuoka:
Reflection Removal Using Multiple Polarized Images with Different Exposure Times. 498-502 - Prasenjit Mondal, Ankit Bal:
A Statistical Approach for Multi-frame Shadow Movement Detection and Shadow Removal for Document Capture. 508-512 - Pierre Le Jeune, Anissa Mokraoui:
Improving Few-Shot Object Detection through a Performance Analysis on Aerial and Natural Images. 513-517 - Axel Baldanza, Jean-François Aujol, Yann Traonmilin, François Alary:
Piecewise linear prediction model for action tracking in sports. 518-522 - Matthias Pollach, Felix Schiegg, Matthias Ludwig, Ann-Christin Bette, Alois C. Knoll:
Boundary Enhanced Semantic Segmentation for High Resolution Electron Microscope Images. 523-527 - Zhengning Zhang, Lin Zhang, Yue Wang, Pengming Feng, Shaobo Liu, Jian Wang:
Cross-Level Semantic Segmentation Guided Feature Space Decoupling And Augmentation for Fine-Grained Ship Detection. 528-532 - Andreas Specker, Jürgen Beyerer:
Toward Accurate Online Multi-target Multi-camera Tracking in Real-time. 533-537 - Nizar Bouhlel, David Rousseau:
Multi-Temporal SAR Change Detection using Wavelet Transforms. 538-542 - Zheng Qi, AprilPyone MaungMaung, Yuma Kinoshita, Hitoshi Kiya:
Privacy-Preserving Image Classification Using Vision Transformer. 543-547 - Jorge Bacca, Alejandra Hernandez-Rojas, Henry Arguello:
Deep Coding Patterns Design for Compressive Near-Infrared Spectral Classification. 548-552 - Taichi Ishiwatari, Makiko Azuma, Takuya Handa, Masaki Takahashi, Takahiro Mochizuki, Masanori Sano:
Audio Visual Graph Attention Networks for Event Detection in Sports Video. 553-557 - Dragos Nastasiu, Angela Digulescu, Cornel Ioana, Maxime Bernier, Frédéric Garet, Alexandru Serbanescu:
A novel machine learning approach in Image Pattern Recognition under invariance constraints. 558-562 - Tony Marteau, David Sodoyer, Sebastien Ambellouis, Sitou Afanou:
Level fusion analysis of recurrent audio and video neural network for violence detection in railway. 563-567 - Zohaib Amjad Khan, Giuseppe Valenzise, Aladine Chetouani, Frédéric Dufaux:
Towards an Image Utility Assessment Framework for Machine Perception. 568-572 - Ching-Yu Kao, Junhao Chen, Karla Markert, Konstantin Böttinger:
Rectifying adversarial inputs using XAI techniques. 573-577 - Kakeru Hara, Hiromitsu Isobe, Takao Jinno:
Intrinsic Image Decomposition Under Multiple Colored Lighting Conditions Using a Single Image. 578-582 - Karelia Pena-Pena, Daniel L. Lau, Gonzalo R. Arce:
Colored-QRNet: Fast QR Code Color Image Embedding. 583-587 - Ziyi Yu, Ganggang Dong:
Similarity Analysis of Simulated SAR Target Images. 588-592 - Mireille Fares, Catherine Pelachaud, Nicolas Obin:
Transformer Network for Semantically-Aware and Speech-Driven Upper-Face Generation. 593-597 - Zuzana Bílková, Michal Bartos, Adam Domínec, Simon Gresko, Adam Novozámský, Barbara Zitová, Markéta Paroubková:
ASSISLT: Computer-aided speech therapy tool. 598-602 - Yan Xu, Hongce Wang, Zhongping Dong, Yuexuan Li, Andrew Abel:
Gabor-based Audiovisual Fusion for Mandarin Chinese Speech Recognition. 603-607 - Marcele O. K. Mendonça, Javier Maroto, Pascal Frossard, Paulo S. R. Diniz:
Adversarial training with informed data selection. 608-612 - Benedikt Lorch, Nicole Scheler, Christian Riess:
Compliance Challenges in Forensic Image Analysis Under the Artificial Intelligence Act. 613-617 - Ruzica Jevtic, Pablo Pérez-Tirador, Carmen Cabezaolias, Pablo Carnero, Gabriel Caffarena:
Side-channel Attack Countermeasure Based on Power Supply Modulation. 618-622 - David Meltzer, David Luengo:
An efficient clustering-based non-fiducial approach for ECG biometric recognition. 623-627 - Rudolf Schraml, Georg Wimmer, Heinz Hofbauer, Ehsaneddin Jalilian, Dinara Bekkozhayeva, Petr Císar, Andreas Uhl:
CNN-based fish iris identification. 628-632 - Süleyman Özdel, Çagatay Ates, Pelin Damla Ates, Mutlu Koca, Emin Anarim:
Payload-Based Network Traffic Analysis for Application Classification and Intrusion Detection. 638-642 - Anis Trabelsi, Marc Michel Pic, Jean-Luc Dugelay:
Improving Deepfake Detection by Mixing Top Solutions of the DFDC. 643-647 - Jakub Stankowski, Marek Domanski, Tomasz Grajek:
Fast and Energy-Efficient Watermarking of HEVC-Compressed Video Bitstreams. 648-652 - Florian Euchner, Niklas Süppel, Marc Gauger, Sebastian Dörner, Stephan ten Brink:
Deep Learning for Uplink CSI-based Downlink Precoding in FDD massive MIMO Evaluated on Indoor Measurements. 653-657 - Kai Kang, Qiyu Hu, Yunlong Cai, Guanding Yu, Jakob Hoydis, Yonina C. Eldar:
Joint Channel Estimation and Hybrid Beamforming via Deep-Unfolding. 658-662 - Arzhang Shahbazi, Igor Donevski, Jimmy Jessen Nielsen, Marco Di Renzo:
Federated Reinforcement Learning UAV Trajectory Design for Fast Localization of Ground Users. 663-666