


default search action
22nd ISMIR 2021: Online
- Jin Ha Lee, Alexander Lerch, Zhiyao Duan, Juhan Nam, Preeti Rao, Peter van Kranenburg, Ajay Srinivasamurthy: 
 Proceedings of the 22nd International Society for Music Information Retrieval Conference, ISMIR 2021, Online, November 7-12, 2021. 2021, ISBN 978-1-7327299-0-2
Papers
- Rohit M. A., Amitrajit Bhattacharjee, Preeti Rao: 
 Four-way Classification of Tabla Strokes with Models Adapted from Automatic Drum Transcription. 19-26
- Taketo Akama: 
 A Contextual Latent Space Model: Subsequence Modulation in Melodic Sequence. 27-34
- María Alfaro-Contreras, David Rizo, José M. Iñesta, Jorge Calvo-Zaragoza: 
 OMR-assisted transcription: a case study with early prints. 35-41
- Stefan Andreas Baumann: 
 Deeper Convolutional Neural Networks and Broad Augmentation Policies Improve Performance in Musical Key Estimation. 42-49
- Axel Berndt: 
 The Music Performance Markup Format and Ecosystem. 50-57
- Louis Bigo, David Regnier, Nicolas Martin: 
 Identification of rhythm guitar sections in symbolic tablatures. 58-65
- Charles Brazier, Gerhard Widmer: 
 On-Line Audio-to-Lyrics Alignment Based on a Reference Performance. 66-73
- Aaron Carter-Enyi, Gilad Rabinovitch, Nathaniel Condit-Schultz: 
 Visualizing Intertextual Form with Arc Diagrams: Contour and Schema-based Methods. 74-80
- Francisco J. Castellanos, Antonio Javier Gallego, Jorge Calvo-Zaragoza: 
 Unsupervised Domain Adaptation for Document Analysis of Music Score Images. 81-87
- Rodrigo Castellon, Chris Donahue, Percy Liang: 
 Codified audio language modeling learns useful representations for music information retrieval. 88-96
- Chin-Jui Chang, Chun-Yi Lee, Yi-Hsuan Yang: 
 Variable-Length Music Score Infilling via XLNet and Musically Specialized Positional Encoding. 97-104
- Yi-Wei Chen, Hung-Shin Lee, Yen-Hsing Chen, Hsin-Min Wang: 
 SurpriseNet: Melody Harmonization Conditioning on User-controlled Surprise Contours. 105-112
- Vincent K. M. Cheung, Hsuan-Kai Kao, Li Su: 
 Semi-supervised violin fingering generation using variational autoencoders. 113-120
- Keunwoo Choi, Yuxuan Wang: 
 Listen, Read, and Identify: Multimodal Singing Language Identification of Music. 121-127
- Shreyan Chowdhury, Gerhard Widmer: 
 On Perceived Emotion in Expressive Piano Performance: Further Experimental Evidence for the Relevance of Mid-level Perceptual Features. 128-134
- Bas Cornelissen, Willem H. Zuidema, John Ashley Burgoyne: 
 Cosine Contours: a Multipurpose Representation for Melodies. 135-142
- Shuqi Dai, Zeyu Jin, Celso Gomes, Roger B. Dannenberg: 
 Controllable deep melody generation via hierarchical music structure representation. 143-150
- Emir Demirel, Sven Ahlbäck, Simon Dixon: 
 MSTRE-Net: Multistreaming Acoustic Modeling for Automatic Lyrics Transcription. 151-158
- Hao-Wen Dong, Chris Donahue, Taylor Berg-Kirkpatrick, Julian J. McAuley: 
 Towards Automatic Instrumentation by Learning to Separate Parts in Symbolic Multitrack Music. 159-166
- Sachinda Edirisooriya, Hao-Wen Dong, Julian J. McAuley, Taylor Berg-Kirkpatrick: 
 An Empirical Evaluation of End-to-End Polyphonic Optical Music Recognition. 167-173
- Anders Elowsson, Olivier Lartillot: 
 A Hardanger Fiddle Dataset with Performances Spanning Emotional Expressions and Annotations Aligned using Image Registration. 174-181
- Jeffrey Ens, Philippe Pasquier: 
 Building the MetaMIDI Dataset: Linking Symbolic and Audio Musical Data. 182-188
- Christoph Finkensiep, Martin Rohrmeier: 
 Modeling and Inferring Proto-Voice Structure in Free Polyphony. 189-196
- Francesco Foscarin, Nicolas Audebert, Raphaël Fournier-S'niehotta: 
 PKSpell: Data-Driven Pitch Spelling and Key Signature Estimation. 197-204
- Dave Foster, Simon Dixon: 
 Filosax: A Dataset of Annotated Jazz Saxophone Recordings. 205-212
- Giovanni Gabbolini, Derek Bridge: 
 An interpretable music similarity measure based on path interestingness. 213-219
- Hugo Flores García, Aldo Aguilar, Ethan Manilow, Bryan Pardo: 
 Leveraging Hierarchical Structures for Few-Shot Musical Instrument Recognition. 220-228
- Mark Gotham, Rainer Kleinertz, Christof Weiss, Meinard Müller, Stephanie Klauk: 
 What if the 'When' Implies the 'What'?: Human harmonic analysis datasets clarify the relative role of the separate steps in automatic tonal analysis. 229-236
- Juan Sebastián Gómez Cañón, Estefanía Cano, Yi-Hsuan Yang, Perfecto Herrera, Emilia Gómez: 
 Let's agree to disagree: Consensus Entropy Active Learning for Personalized Music Emotion Recognition. 237-245
- Curtis Hawthorne, Ian Simon, Rigel Swavely, Ethan Manilow, Jesse H. Engel: 
 Sequence-to-Sequence Piano Transcription with Transformers. 246-253
- Ben Hayes, Charalampos Saitis, György Fazekas: 
 Neural Waveshaping Synthesis. 254-261
- Johannes Hentschel, Fabian C. Moss, Markus Neuwirth, Martin Rohrmeier: 
 A semi-automated workflow paradigm for the distributed creation and curation of expert annotations. 262-269
- Mojtaba Heydari, Frank Cwitkowitz, Zhiyao Duan: 
 BeatNet: CRNN and Particle Filtering for Online Joint Beat, Downbeat and Meter Tracking. 270-277
- Yuki Hiramatsu, Eita Nakamura, Kazuyoshi Yoshii: 
 Joint Estimation of Note Values and Voices for Audio-to-Score Piano Transcription. 278-284
- Yo-Wei Hsiao, Li Su: 
 Learning note-to-note affinity for voice segregation and melody line identification of symbolic music data. 285-292
- Jui-Yang Hsu, Li Su: 
 VOCANO: A note transcription framework for singing voice in polyphonic music. 293-300
- Rujing Stacy Huang, Bob L. T. Sturm, Andre Holzapfel: 
 De-centering the West: East Asian Philosophies and the Ethics of Applying Artificial Intelligence to Music. 301-309
- Tun-Min Hung, Bo-Yu Chen, Yen-Tung Yeh, Yi-Hsuan Yang: 
 A Benchmarking Initiative for Audio-domain Music Generation using the FreeSound Loop Dataset. 310-317
- Hsiao-Tzu Hung, Joann Ching, Seungheon Doh, Nabin Kim, Juhan Nam, Yi-Hsuan Yang: 
 EMOPIA: A Multi-Modal Pop Piano Dataset For Emotion Recognition and Emotion-based Music Generation. 318-325
- Kevin Ji, Daniel Yang, Timothy Tsai: 
 Piano Sheet Music Identification Using Marketplace Fingerprinting. 326-333
- Keunhyoung Luke Kim, Jongpil Lee, Sangeun Kum, Juhan Nam: 
 Learning a cross-domain embedding space of vocal and mixed audio with a structure-preserving triplet loss. 334-341
- Qiuqiang Kong, Yin Cao, Haohe Liu, Keunwoo Choi, Yuxuan Wang: 
 Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation. 342-349
- Filip Korzeniowski, Sergio Oramas, Fabien Gouyon: 
 Artist Similarity Using Graph Neural Networks. 350-357
- Jin Ha Lee, Arpita Bhattacharya, Ria Antony, Nicole K. Santero, Anh Le: 
 â??Finding Homeâ?: Understanding How Music Supports Listenersâ?? Mental Health through a Case Study of BTS. 358-365
- Harin Lee, Frank Höger, Marc Schönwiesner, Minsu Park, Nori Jacoby: 
 Cross-cultural Mood Perception in Pop Songs and its Alignment with Mood Detection Algorithms. 366-373
- Jordan Lenchitz: 
 Reconsidering quantization in MIR. 374-380
- Liwei Lin, Gus Xia, Qiuqiang Kong, Junyan Jiang: 
 A unified model for zero-shot music source separation, transcription and synthesis. 381-388
- Carlos Lordelo, Emmanouil Benetos, Simon Dixon, Sven Ahlbäck: 
 Pitch-Informed Instrument Assignment using a Deep Convolutional Network with Multiple Kernel Shapes. 389-395
- Wei Tsung Lu, Ju-Chiang Wang, Minz Won, Keunwoo Choi, Xuchen Song: 
 SpecTNT: a Time-Frequency Transformer for Music Audio. 396-403
- Néstor Nápoles López, Mark Gotham, Ichiro Fujinaga: 
 AugmentedNet: A Roman Numeral Analysis Network with Synthetic Training Examples and Additional Tonal Tasks. 404-411
- Vincenzo Madaghiele, Pasquale Lisena, Raphaël Troncy: 
 MINGUS: Melodic Improvisation Neural Generator Using Seq2Seq. 412-419
- Ninon Lizé Masclef, Andrea Vaglio, Manuel Moussallam: 
 User-centered evaluation of lyrics-to-audio alignment. 420-427
- Naotake Masuda, Daisuke Saito: 
 Synthesizer Sound Matching with Differentiable DSP. 428-434
- Andrew McLeod, Martin Rohrmeier: 
 A Modular System for the Harmonic Analysis of Musical Scores using a Large Vocabulary. 435-442
- Gianluca Micchi, Katerina Kosta, Gabriele Medeot, Pierre Chanquion: 
 A deep learning method for enforcing coherence in Automatic Chord Recognition. 443-451
- Martin Miguel, Diego Fernández Slezak: 
 Modeling beat uncertainty as a 2D distribution of period and phase: a MIR task proposal. 452-459
- Olof Misgeld, Torbjörn Gulz, Jura Miniotaite, Andre Holzapfel: 
 A case study of deep enculturation and sensorimotor synchronization to real music. 460-467
- Gautam Mittal, Jesse H. Engel, Curtis Hawthorne, Ian Simon: 
 Symbolic Music Generation with Diffusion Models. 468-475
- Faraaz Nadeem: 
 Learning from Musical Feedback with Sonic the Hedgehog. 476-483
- Javier Nistal, Stefan Lattner, Gaël Richard: 
 DarkGAN: Exploiting Knowledge Distillation for Comprehensible Audio Synthesis With GANs. 484-492
- Takehisa Oyama, Ryoto Ishizuka, Kazuyoshi Yoshii: 
 Phase-Aware Joint Beat and Downbeat Estimation Based on Periodicity of Metrical Structure. 493-499
- Yuto Ozaki, John M. McBride, Emmanouil Benetos, Peter Q. Pfordresher, Joren Six, Adam Tierney, Polina Proutskova, Emi Sakai, Haruka Kondo, Haruno Fukatsu, Shinya Fujii, Patrick E. Savage: 
 Agreement Among Human and Automated Transcriptions of Global Songs. 500-508
- Emilia Parada-Cabaleiro, Maximilian Schmitt, Anton Batliner, Björn W. Schuller, Markus Schedl: 
 Automatic Recognition of Texture in Renaissance Music. 509-516
- Ashis Pati, Alexander Lerch: 
 Is Disentanglement enough? On Latent Representations for Controllable Music Generation. 517-524
- Nicolás Pironio, Diego Fernández Slezak, Martin Miguel: 
 Pulse clarity metrics developed from a deep learning beat tracking model. 525-530
- Verena Praher, Katharina Prinz, Arthur Flexer, Gerhard Widmer: 
 On the Veracity of Local, Model-agnostic Explanations in Audio Classification: Targeted Investigations with Adversarial Examples. 531-538
- Laure Prétet, Gaël Richard, Geoffroy Peeters: 
 Is there a "language of music-video clips" ? A qualitative and quantitative study. 539-546
- R. Gowriprasad, V. Venkatesh, Hema A. Murthy, R. Aravind, K. Sri Rama Murty: 
 Tabla Gharana Recognition from Audio music recordings of Tabla Solo performances. 547-554
- Lindsey Reymore, Emmanuelle Beauvais-Lacasse, Bennett Smith, Stephen McAdams: 
 Navigating noise: Modeling perceptual correlates of noise-related semantic timbre categories with audio features. 555-561
- Kyle Robinson, Dan Brown: 
 Quantitative User Perceptions of Music Recommendation List Diversity. 562-568
- Martin Rohrmeier, Fabian C. Moss: 
 A Formal Model of Extended Tonal Harmony. 569-578
- Simon Rouard, Gaëtan Hadjeres: 
 CRASH: Raw Audio Score-based Generative Modeling for Controllable High-resolution Drum Sound Synthesis. 579-585
- Luke O. Rowe, George Tzanetakis: 
 Curriculum Learning for Imbalanced Classification in Large Vocabulary Automatic Chord Recognition. 586-593
- Justin Salamon, Oriol Nieto, Nicholas J. Bryan: 
 Deep Embeddings and Section Fusion Improve Music Segmentation. 594-601
- Antonia Saravanou, Federico Tomasi, Rishabh Mehrotra, Mounia Lalmas: 
 Multi-Task Learning of Graph-based Inductive Representations of Music Content. 602-609
- Pedro Sarmento, Adarsh Kumar, CJ Carr, Zack Zukowski, Mathieu Barthet, Yi-Hsuan Yang: 
 DadaGP: A Dataset of Tokenized GuitarPro Songs for Sequence Models. 610-617
- Harald Victor Schweiger, Emilia Parada-Cabaleiro, Markus Schedl: 
 Does Track Sequence in User-generated Playlists Matter?. 618-625
- Simon J. Schwär, Sebastian Rosenzweig, Meinard Müller: 
 A Differentiable Cost Measure for Intonation Processing in Polyphonic Music. 626-633
- Pavan Seshadri, Alexander Lerch: 
 Improving Music Performance Assessment With Contrastive Learning. 634-641
- Dougal Shakespeare, Camille Roth: 
 Tracing Affordance and Item Adoption on Music Streaming Platforms. 642-649
- Zhengshan Shi: 
 Computational analysis and modeling of expressive timing in Chopin's Mazurkas. 650-656
- Nithya Nadig Shikarpur, Asawari Keskar, Preeti Rao: 
 Computational analysis of melodic mode switching in raga performance. 657-664
- Qingwei Song, Qiwei Sun, Dongsheng Guo, Haiyong Zheng: 
 SinTra: Learning an inspiration model from a single multi-track music segment. 665-672
- Janne Spijkervet, John Ashley Burgoyne: 
 Contrastive Learning of Musical Representations. 673-681
- Xiaoheng Sun, Qiqi He, Yongwei Gao, Wei Li: 
 Musical Tempo Estimation Using a Multi-scale Network. 682-689
- Pau Torras, Arnau Baró, Lei Kang, Alicia Fornés: 
 On the Integration of Language Models into Sequence to Sequence Architectures for Handwritten Music Recognition. 690-696
- Kosetsu Tsukuda, Keisuke Ishida, Masahiro Hamasaki, Masataka Goto: 
 Kiite Cafe: A Web Service for Getting Together Virtually to Listen to Music. 697-704
- Kosetsu Tsukuda, Masahiro Hamasaki, Masataka Goto: 
 Toward an Understanding of Lyrics-viewing Behavior While Listening to Music on a Smartphone. 705-713
- Andrea Vaglio, Romain Hennequin, Manuel Moussallam, Gaël Richard: 
 The Words Remain the Same: Cover Detection with Lyrics Transcription. 714-721
- Ziyu Wang, Gus Xia: 
 MuseBERT: Pre-training Music Representation for Music Understanding and Controllable Generation. 722-729
- Ju-Chiang Wang, Jordan B. L. Smith, Wei Tsung Lu, Xuchen Song: 
 Supervised Metric Learning For Music Structure Features. 730-737
- Shiqi Wei, Gus Xia: 
 Learning long-term music representations via hierarchical contextual constraints. 738-745
- Christof Weiss, Johannes Zeitler, Tim Zunner, Florian Schuberth, Meinard Müller: 
 Learning Pitch-Class Representations from Score-Audio Pairs of Classical Music. 746-753
- Christof Weiss, Geoffroy Peeters: 
 Training Deep Pitch-Class Representations With a Multi-Label CTC Loss. 754-761
- Daniel Wolff, Rémi Mignot, Axel Roebel: 
 Audio Defect Detection in Music with Deep Networks. 762-768
- Minz Won, Keunwoo Choi, Xavier Serra: 
 Semi-supervised Music Tagging Transformer. 769-776
- Minz Won, Justin Salamon, Nicholas J. Bryan, Gautham J. Mysore, Xavier Serra: 
 Emotion Embedding Spaces for Matching Music to Stories. 777-785
- Abudukelimu Wuerkaixi, Christodoulos Benetatos, Zhiyao Duan, Changshui Zhang: 
 CollageNet: Fusing arbitrary melody and accompaniment into a coherent song. 786-793
- Kazuhiko Yamamoto: 
 Human-in-the-Loop Adaptation for Interactive Musical Beat Tracking. 794-801
- Daniel Yang, Timothy Tsai: 
 Composer Classification With Cross-Modal Transfer Learning and Musically-Informed Augmentation. 802-809
- Daniel Yang, Kevin Ji, Timothy Tsai: 
 Aligning Unsynchronized Part Recordings to a Full Mix Using Iterative Subtractive Alignment. 810-817
- Mickaël Zehren, Marco Alunno, Paolo Bientinesi: 
 ADTOF: A large dataset of non-synthetic music for automatic drum transcription. 818-824
- Huan Zhang, Yiliang Jiang, Tao Jiang, Hu Peng: 
 Learn by Referencing: Towards Deep Metric Learning for Singing Assessment. 825-832
- Jingwei Zhao, Gus Xia: 
 AccoMontage: Accompaniment Arrangement via Phrase Selection and Style Transfer. 833-840

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


 Google
Google Google Scholar
Google Scholar Semantic Scholar
Semantic Scholar Internet Archive Scholar
Internet Archive Scholar CiteSeerX
CiteSeerX ORCID
ORCID














