default search action

combined dblp search
author search
venue search
publication search

ask others

Mark Hasegawa-Johnson

Mark A. Hasegawa-Johnson

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2026
[j44]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/NiWZQGHGY26
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/NiWZQGHGY26
Junrui Ni, Liming Wang, Yang Zhang, Kaizhi Qian, Heting Gao, Mark Hasegawa-Johnson, James R. Glass, Chang D. Yoo:
Towards unsupervised speech recognition without pronunciation models. Speech Commun. 176: 103330 (2026)
[i77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2601-17645
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2601-17645
Xilin Jiang, Qiaolin Wang, Junkai Wu, Xiaomin He, Zhongweiyang Xu, Yinghao Ma, Minshuo Piao, Kaiyi Yang, Xiuwen Zheng, Riki Shimizu, Yicong Chen, Arsalan Firoozi, Gavin Mischler, Sukru Samet Dindar, Richard J. Antonello, Linyang He, Tsun-An Hsieh, Xulin Fan, Yulun Wu, Yuesheng Ma, Chaitanya Amballa, Weixiong Chen, Jiarui Hai, Ruisi Li, Vishal Choudhari, Cong Han, Yinghao Aaron Li, Adeen Flinker, Mounya Elhilali, Emmanouil Benetos, Mark Hasegawa-Johnson, Romit Roy Choudhury, Nima Mesgarani:
AVMeme Exam: A Multimodal Multilingual Multicultural Benchmark for LLMs' Contextual and Cultural Knowledge and Thinking. CoRR abs/2601.17645 (2026)
2025
[j43]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/GodinoLlorenteAHMS25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/GodinoLlorenteAHMS25
Juan Ignacio Godino-Llorente, Julián D. Arias-Londoño, Mark Hasegawa-Johnson, Helen Meng, Stefanie Shattuck-Hufnagel:
Guest Editorial: Modeling and Processing Language and Speech in Neurodegenerative Disorders. IEEE J. Sel. Top. Signal Process. 19(5): 696-699 (2025)
[j42]
- view
  authority control:
- export record
  dblp key:
  - journals/spm/LiLFMCGH25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spm/LiLFMCGH25
Jialu Li, Marvin Lavechin, Xulin Fan, Nancy L. McElwain, Alejandrina Cristià, L. Paola García-Perera, Mark A. Hasegawa-Johnson:
Automated Analysis of Naturalistic Recordings in Early Childhood: Applications, challenges, and opportunities. IEEE Signal Process. Mag. 42(6): 16-34 (2025)
[c244]
- view
  authority control:
- export record
  dblp key:
  - conf/chi/WuWLLPKLKVHSR25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chi/WuWLLPKLKVHSR25
Shaomei Wu, Kimi Wenzel, Jingjin Li, Qisheng Li, Alisha Pradhan, Raja S. Kushalnagar, Colin Lea, Allison Koenecke, Christian Vogler, Mark Hasegawa-Johnson, Norman Makoto Su, Nan Bernstein Ratner:
Speech AI for All: Promoting Accessibility, Fairness, Inclusivity, and Equity. CHI Extended Abstracts 2025: 801:1-801:6
[c243]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/JahanMTH0DM25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/JahanMTH0DM25
Maliha Jahan, Priyam Mazumdar, Thomas Thebaud, Mark Hasegawa-Johnson, Jesús Villalba, Najim Dehak, Laureano Moro-Velázquez:
Unveiling Performance Bias in ASR Systems: A Study on Gender, Age, Accent, and More. ICASSP 2025: 1-5
[c242]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Na0LH25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Na0LH25
Jonghwan Na, Xiuwen Zheng, Bowon Lee, Mark Hasegawa-Johnson:
Improved Recognition of the Speech of People with Parkinson's Who Stutter. ICASSP 2025: 1-5
[c241]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NaHL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NaHL25
Jonghwan Na, Mark Hasegawa-Johnson, Bowon Lee:
Cohort-Sensitive Labeling: An Effective Approach for Enhancing ASR Performance. ICASSP 2025: 1-5
[c240]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/OlbrichMKWZH25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/OlbrichMKWZH25
Philipp Olbrich, Hema A. Murthy, Pranaw Kumar, Shinji Watanabe, Sheng Zhao, Mark Hasegawa-Johnson:
LIMMITS'25: Multilingual Streaming TTS With Neural Codecs for Indian Languages. ICASSP 2025: 1-2
[c239]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SinghWZMHAS25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SinghWZMHAS25
Satwinder Singh, Qianli Wang, Zihan Zhong, Clarion Mendes, Mark Hasegawa-Johnson, Waleed Abdulla, Seyed Reza Shahamiri:
Robust Cross-Etiology and Speaker-Independent Dysarthric Speech Recognition. ICASSP 2025: 1-5
[c238]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangZSMHAS25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangZSMHAS25
Qianli Wang, Zihan Zhong, Satwinder Singh, Clarion Mendes, Mark Hasegawa-Johnson, Waleed Abdulla, Seyed Reza Shahamiri:
Dysarthric Speech Conformer: Adaptation for Sequence-to-Sequence Dysarthric Speech Recognition. ICASSP 2025: 1-5
[c237]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/YoonYHY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YoonYHY25
Eunseop Yoon, Hee Suk Yoon, Mark A. Hasegawa-Johnson, Chang D. Yoo:
Can Video LLMs Refuse to Answer? Alignment for Answerability in Video Large Language Models. ICLR 2025
[c236]
- view
- export record
  dblp key:
  - conf/icml/YoonYHKY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/YoonYHKY25
Hee Suk Yoon, Eunseop Yoon, Mark A. Hasegawa-Johnson, Sungwoong Kim, Chang D. Yoo:
ConfPO: Exploiting Policy Model Confidence for Critical Token Selection in Preference Optimization. ICML 2025
[c235]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0003PNCHHJKLMMR25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0003PNCHHJKLMMR25
Xiuwen Zheng, Bornali Phukon, Jonghwan Na, Ed Cutrell, Kyu J. Han, Mark Hasegawa-Johnson, Pan-Pan Jiang, Aadhrik Kuila, Colin Lea, Bob MacDonald, Gautam Varma Mantena, Venkatesh Ravichandran, Leda Sari, Katrin Tomanek, Chang D. Yoo, Chris Zwilling:
The Interspeech 2025 Speech Accessibility Project Challenge. INTERSPEECH 2025
[c234]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EomHY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EomHY25
SooHwan Eom, Mark Hasegawa-Johnson, Chang D. Yoo:
SiamCTC: Learning Speech Representations through Monotonic Temporal Alignment. INTERSPEECH 2025
[c233]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Fan0HM25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Fan0HM25
Xulin Fan, Jialu Li, Mark Hasegawa-Johnson, Nancy L. McElwain:
Band-Split Self-supervised Mamba for Infant-centered Audio Analysis. INTERSPEECH 2025
[c232]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JahanSMFT0HDM25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JahanSMFT0HDM25
Maliha Jahan, Yinglun Sun, Priyam Mazumdar, Zsuzsanna Fagyal, Thomas Thebaud, Jesús Villalba, Mark Hasegawa-Johnson, Najim Dehak, Laureano Moro-Velázquez:
FaiST: A Benchmark Dataset for Fairness in Speech Technology. INTERSPEECH 2025
[c231]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Phukon0H25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Phukon0H25
Bornali Phukon, Xiuwen Zheng, Mark Hasegawa-Johnson:
Aligning ASR Evaluation with Human and LLM Judgments: Intelligibility Metrics Using Phonetic, Semantic, and NLI Approaches. INTERSPEECH 2025
[c230]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZwillingHHRBMKB25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZwillingHHRBMKB25
Chris Zwilling, Mark Hasegawa-Johnson, Heather Hodges, Lorraine O. Ramig, Adina Bradshaw, Clarion Mendes, Heejin Kim, Alexandria Barkhimer, Laura Mattie, Meg Dickinson, Shawnise Carter, Marie Moore Channell:
The Speech Accessibility Project: Best Practices for Collection and Curation of Disordered Speech. INTERSPEECH 2025
[c229]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/TinH25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/TinH25
Alara Tin, Mark Hasegawa-Johnson:
An Analysis of Associations Between Maternal Vocalizations and Infant Stress Recovery Using Speech Emotion Recognition Models. MLSP 2025: 1-6
[c228]
- view
  authority control:
- export record
  dblp key:
  - conf/wacv/FanGC0HH25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wacv/FanGC0HH25
Xulin Fan, Heting Gao, Ziyi Chen, Peng Chang, Mei Han, Mark Hasegawa-Johnson:
SyncDiff: Diffusion-Based Talking Head Synthesis with Bottlenecked Temporal Visual Prior for Improved Synchronization. WACV 2025: 4554-4563
[d1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - data/11/MoroVelazquezH25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/11/MoroVelazquezH25
Laureano Moro-Velázquez, Mark Hasegawa-Johnson:
FaiST: Fairness in Speech Technology. Zenodo, 2025
[i76]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-14994
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-14994
Satwinder Singh, Qianli Wang, Zihan Zhong, Clarion Mendes, Mark Hasegawa-Johnson, Waleed Abdulla, Seyed Reza Shahamiri:
Robust Cross-Etiology and Speaker-Independent Dysarthric Speech Recognition. CoRR abs/2501.14994 (2025)
[i75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-13371
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-13371
Xulin Fan, Heting Gao, Ziyi Chen, Peng Chang, Mei Han, Mark Hasegawa-Johnson:
SyncDiff: Diffusion-based Talking Head Synthesis with Bottlenecked Temporal Visual Prior for Improved Synchronization. CoRR abs/2503.13371 (2025)
[i74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-08712
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-08712
Hee Suk Yoon, Eunseop Yoon, Mark Hasegawa-Johnson, Sungwoong Kim, Chang D. Yoo:
ConfPO: Exploiting Policy Model Confidence for Critical Token Selection in Preference Optimization. CoRR abs/2506.08712 (2025)
[i73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-16528
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-16528
Bornali Phukon, Xiuwen Zheng, Mark Hasegawa-Johnson:
Aligning ASR Evaluation with Human and LLM Judgments: Intelligibility Metrics Using Phonetic, Semantic, and NLI Approaches. CoRR abs/2506.16528 (2025)
[i72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-04976
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-04976
Eunseop Yoon, Hee Suk Yoon, Mark A. Hasegawa-Johnson, Chang D. Yoo:
Can Video LLMs Refuse to Answer? Alignment for Answerability in Video Large Language Models. CoRR abs/2507.04976 (2025)
[i71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-06202
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-06202
Charlotte Kiesel, Dipayan Mukherjee, Mark Hasegawa-Johnson, Karrie Karahalios:
V(is)owel: An Interactive Vowel Chart to Understand What Makes Visual Pronunciation Effective in Second Language Learning. CoRR abs/2507.06202 (2025)
[i70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-20091
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-20091
Kaizhi Qian, Xulin Fan, Junrui Ni, Slava Shechtman, Mark Hasegawa-Johnson, Chuang Gan, Yang Zhang:
ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models. CoRR abs/2507.20091 (2025)
[i69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-22047
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-22047
Xiuwen Zheng, Bornali Phukon, Jonghwan Na, Ed Cutrell, Kyu J. Han, Mark Hasegawa-Johnson, Pan-Pan Jiang, Aadhrik Kuila, Colin Lea, Bob MacDonald, Gautam Varma Mantena, Venkatesh Ravichandran, Leda Sari, Katrin Tomanek, Chang D. Yoo, Chris Zwilling:
The Interspeech 2025 Speech Accessibility Project Challenge. CoRR abs/2507.22047 (2025)
[i68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-13395
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-13395
Haolong Zheng, Yekaterina Yegorova, Mark Hasegawa-Johnson:
TICL: Text-Embedding KNN For Speech In-Context Learning Unlocks Speech Recognition Abilities of Large Multimodal Models. CoRR abs/2509.13395 (2025)
[i67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-18235
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-18235
Jialu Li, Marvin Lavechin, Xulin Fan, Nancy L. McElwain, Alejandrina Cristià, L. Paola García-Perera, Mark Hasegawa-Johnson:
Automated Analysis of Naturalistic Recordings in Early Childhood: Applications, Challenges, and Opportunities. CoRR abs/2509.18235 (2025)
[i66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-03639
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-03639
Liming Wang, Junrui Ni, Kai-Wei Chang, Saurabhchand Bhati, David Harwath, Mark Hasegawa-Johnson, James R. Glass:
Towards Unsupervised Speech Recognition at the Syllable-Level. CoRR abs/2510.03639 (2025)
[i65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-19116
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-19116
Jaesung Bae, Cameron Churchwell, Mitchell Hermon, Tsun-An Hsieh, Jocelyn Xu, Yekaterina Yegorova, Mark Hasegawa-Johnson, Heng Ji:
That's Deprecated! Understanding, Detecting, and Steering Knowledge Conflicts in Language Models for Code Generation. CoRR abs/2510.19116 (2025)
[i64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-22255
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-22255
Eunseop Yoon, Hee Suk Yoon, Jaehyun Jang, SooHwan Eom, Qi Dai, Chong Luo, Mark A. Hasegawa-Johnson, Chang D. Yoo:
PACR: Progressively Ascending Confidence Reward for LLM Reasoning. CoRR abs/2510.22255 (2025)
[i63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-18263
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-18263
Haolong Zheng, Yekaterina Yegorova, Mark Hasegawa-Johnson:
TICL+: A Case Study On Speech In-Context Learning for Children's Speech Recognition. CoRR abs/2512.18263 (2025)
2024
[j41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/sensors/IslamMLDHHBDCH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/sensors/IslamMLDHHBDCH24
Bashima Islam, Nancy L. McElwain, Jialu Li, Maria I. Davila, Yannan Hu, Kexin Hu, Jordan M. Bodway, Ashutosh Dhekne, Romit Roy Choudhury, Mark Hasegawa-Johnson:
Preliminary Technical Validation of LittleBeats™: A Multimodal Sensing Platform to Capture Cardiac Physiology, Motion, and Vocalizations. Sensors 24(3): 901 (2024)
[c227]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/YoonYEHNJOHKY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/YoonYEHNJOHKY24
Eunseop Yoon, Hee Suk Yoon, SooHwan Eom, Gunsoo Han, Daniel Wontae Nam, Daejin Jo, Kyoung-Woon On, Mark Hasegawa-Johnson, Sungwoong Kim, Chang Dong Yoo:
TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback. ACL (Findings) 2024: 14969-14981
[c226]
- view
  authority control:
- export record
  dblp key:
  - conf/bsn/KhanMHI24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bsn/KhanMHI24
Mohammad Nur Hossain Khan, Nancy L. McElwain, Mark Hasegawa-Johnson, Bashima Islam:
InfantMotion2Vec: Unlabeled Data-Driven Infant Pose Estimation Using a Single Chest IMU. BSN 2024: 1-4
[c225]
- view
  authority control:
- export record
  dblp key:
  - conf/chase/KhanLMHI24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/chase/KhanLMHI24
Mohammad Nur Hossain Khan, Jialu Li, Nancy L. McElwain, Mark Hasegawa-Johnson, Bashima Islam:
Sound Tagging in Infant-centric Home Soundscapes. CHASE 2024: 142-146
[c224]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/JahanWTSLFSHMD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/JahanWTSLFSHMD24
Maliha Jahan, Helin Wang, Thomas Thebaud, Yinglun Sun, Giang Ha Le, Zsuzsanna Fagyal, Odette Scharenborg, Mark Hasegawa-Johnson, Laureano Moro-Velázquez, Najim Dehak:
Finding Spoken Identifications: Using GPT-4 Annotation for an Efficient and Fast Dataset Creation Pipeline. LREC/COLING 2024: 7296-7306
[c223]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/EomSKNHKY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/EomSKNHKY24
SooHwan Eom, Jay Shim, Gwanhyeong Koo, Haebin Na, Mark Hasegawa-Johnson, Sungwoong Kim, Chang Dong Yoo:
Query-based Cross-Modal Projector Bolstering Mamba Multimodal LLM. EMNLP (Findings) 2024: 14158-14167
[c222]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SinghNGBRBUGMKTHO24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SinghNGBRBUGMKTHO24
Abhayjeet Singh, Amala Nagireddi, Deekshitha G, Jesuraja Bandekar, Roopa R., Sandhya Badiger, Sathvik Udupa, Prasanta Kumar Ghosh, Hema A. Murthy, Pranaw Kumar, Keiichi Tokuda, Mark Hasegawa-Johnson, Philipp Olbrich:
LIMMITS'24: Multi-Speaker, Multi-Lingual Indic TTS with Voice Cloning. ICASSP Workshops 2024: 61-62
[c221]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiHM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiHM24
Jialu Li, Mark Hasegawa-Johnson, Nancy L. McElwain:
Analysis of Self-Supervised Speech Models on Children's Speech and Infant Vocalizations. ICASSP Workshops 2024: 550-554
[c220]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GaoHY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GaoHY24
Heting Gao, Mark Hasegawa-Johnson, Chang D. Yoo:
G2PU: Grapheme-To-Phoneme Transducer with Speech Units. ICASSP 2024: 10061-10065
[c219]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangHY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangHY24
Liming Wang, Mark Hasegawa-Johnson, Chang D. Yoo:
Unsupervised Speech Recognition with N-skipgram and Positional Unigram Matching. ICASSP 2024: 10936-10940
[c218]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/EomYY0HY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/EomYY0HY24
SooHwan Eom, Eunseop Yoon, Hee Suk Yoon, Chanwoo Kim, Mark Hasegawa-Johnson, Chang D. Yoo:
AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition. ICASSP 2024: 12707-12711
[c217]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/YoonYTHLY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YoonYTHLY24
Hee Suk Yoon, Eunseop Yoon, Joshua Tian Jin Tee, Mark A. Hasegawa-Johnson, Yingzhen Li, Chang D. Yoo:
C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion. ICLR 2024
[c216]
- view
- export record
  dblp key:
  - conf/icml/GaoQNGHC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/GaoQNGHC024
Heting Gao, Kaizhi Qian, Junrui Ni, Chuang Gan, Mark A. Hasegawa-Johnson, Shiyu Chang, Yang Zhang:
Speech Self-Supervised Learning Using Diffusion Model Synthetic Data. ICML 2024: 14790-14810
[c215]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0002HK24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0002HK24
Jialu Li, Mark Hasegawa-Johnson, Karrie Karahalios:
Enhancing Child Vocalization Classification with Phonetically-Tuned Embeddings for Assisting Autism Diagnosis. INTERSPEECH 2024
[c214]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0003PH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0003PH24
Xiuwen Zheng, Bornali Phukon, Mark Hasegawa-Johnson:
Fine-Tuning Automatic Speech Recognition for People with Parkinson's: An Effective Strategy for Enhancing Speech Technology Accessibility. INTERSPEECH 2024
[c213]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/YoderKHA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YoderKHA24
Charlotte Yoder, Karrie Karahalios, Mark Hasegawa-Johnson, Shreyansh Agrawal:
Visualization for improving foreign language pronunciation. INTERSPEECH 2024
[c212]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YoonYHHY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YoonYHHY24
Eunseop Yoon, Hee Suk Yoon, John B. Harvill, Mark Hasegawa-Johnson, Chang D. Yoo:
LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition. INTERSPEECH 2024
[c211]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/WuFLJMHO24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/WuFLJMHO24
Junkai Wu, Xulin Fan, Bo-Ru Lu, Xilin Jiang, Nima Mesgarani, Mark Hasegawa-Johnson, Mari Ostendorf:
Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify And Understand Speaker in Spoken Dialogue. SLT 2024: 1137-1143
[i62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-06888
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-06888
Jialu Li, Mark Hasegawa-Johnson, Nancy L. McElwain:
Analysis of Self-Supervised Speech Models on Children's Speech and Infant Vocalizations. CoRR abs/2402.06888 (2024)
[i61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-14119
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-14119
Hee Suk Yoon, Eunseop Yoon, Joshua Tian Jin Tee, Mark Hasegawa-Johnson, Yingzhen Li, Chang D. Yoo:
C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion. CoRR abs/2403.14119 (2024)
[i60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-08380
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-08380
Junrui Ni, Liming Wang, Yang Zhang, Kaizhi Qian, Heting Gao, Mark Hasegawa-Johnson, Chang D. Yoo:
Towards Unsupervised Speech Recognition Without Pronunciation Models. CoRR abs/2406.08380 (2024)
[i59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-17190
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-17190
Mohammad Nur Hossain Khan, Jialu Li, Nancy L. McElwain, Mark Hasegawa-Johnson, Bashima Islam:
Sound Tagging in Infant-centric Home Soundscapes. CoRR abs/2406.17190 (2024)
[i58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-16574
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-16574
Eunseop Yoon, Hee Suk Yoon, SooHwan Eom, Gunsoo Han, Daniel Wontae Nam, Daejin Jo, Kyoung-Woon On, Mark A. Hasegawa-Johnson, Sungwoong Kim, Chang D. Yoo:
TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback. CoRR abs/2407.16574 (2024)
[i57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-05769
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-05769
Eunseop Yoon, Hee Suk Yoon, John B. Harvill, Mark Hasegawa-Johnson, Chang D. Yoo:
LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition. CoRR abs/2408.05769 (2024)
[i56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-04927
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-04927
Junkai Wu, Xulin Fan, Bo-Ru Lu, Xilin Jiang, Nima Mesgarani, Mark Hasegawa-Johnson, Mari Ostendorf:
Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue. CoRR abs/2409.04927 (2024)
[i55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-19818
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-19818
Xiuwen Zheng, Bornali Phukon, Mark Hasegawa-Johnson:
Fine-Tuning Automatic Speech Recognition for People with Parkinson's: An Effective Strategy for Enhancing Speech Technology Accessibility. CoRR abs/2409.19818 (2024)
[i54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-15851
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-15851
Sandeep Nagar, Mark Hasegawa-Johnson, David G. Beiser, Narendra Ahuja:
R2I-rPPG: A Robust Region of Interest Selection Method for Remote Photoplethysmography to Extract Heart Rate. CoRR abs/2410.15851 (2024)
2023
[j40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ploscb/ThomasSRHPH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ploscb/ThomasSRHPH23
Oshane O. Thomas, Hongyu Shen, Ryan L. Raaum, William E. H. Harcourt-Smith, John D. Polk, Mark Hasegawa-Johnson:
Automated morphological phenotyping using learned shape descriptors and functional maps: A novel approach to geometric morphometrics. PLoS Comput. Biol. 19(1) (2023)
[c210]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/WangHY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/WangHY23
Liming Wang, Mark Hasegawa-Johnson, Chang Dong Yoo:
A Theory of Unsupervised Speech Recognition. ACL (1) 2023: 1192-1215
[c209]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/WangNGLCFWHY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/WangNGLCFWHY23
Liming Wang, Junrui Ni, Heting Gao, Jialu Li, Kai Chieh Chang, Xulin Fan, Junkai Wu, Mark Hasegawa-Johnson, Chang Dong Yoo:
Listen, Decipher and Sign: Toward Unsupervised Speech-to-Sign Language Recognition. ACL (Findings) 2023: 6785-6800
[c208]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/YoonYHHY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/YoonYHHY23
Eunseop Yoon, Hee Suk Yoon, John B. Harvill, Mark Hasegawa-Johnson, Chang Dong Yoo:
INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition. ACL (Findings) 2023: 9893-9902
[c207]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ChangHMI23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ChangHMI23
Kai Chieh Chang, Mark Hasegawa-Johnson, Nancy L. McElwain, Bashima Islam:
Classification of Infant Sleep/Wake States: Cross-Attention among Large Scale Pretrained Transformer Networks using Audio, ECG, and IMU Data. APSIPA ASC 2023: 2370-2377
[c206]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SinghNGBRBUGMZK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SinghNGBRBUGMZK23
Abhayjeet Singh, Amala Nagireddi, Deekshitha G, Jesuraja Bandekar, Roopa R., Sandhya Badiger, Sathvik Udupa, Prasanta Kumar Ghosh, Hema A. Murthy, Heiga Zen, Pranaw Kumar, Kamal Kant, Amol Bole, Bira Chandra Singh, Keiichi Tokuda, Mark Hasegawa-Johnson, Philipp Olbrich:
Lightweight, Multi-Speaker, Multi-Lingual Indic Text-to-Speech. ICASSP 2023: 1-2
[c205]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XuFH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XuFH23
Zhongweiyang Xu, Xulin Fan, Mark Hasegawa-Johnson:
Dual-Path Cross-Modal Attention for Better Audio-Visual Speech Extraction. ICASSP 2023: 1-5
[c204]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiHM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiHM23
Jialu Li, Mark Hasegawa-Johnson, Nancy L. McElwain:
Towards Robust Family-Infant Audio Analysis Based on Unsupervised Pretraining of Wav2vec 2.0 on Large-Scale Unlabeled Family Audio. INTERSPEECH 2023: 1035-1039
[c203]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YoonYGEKHGH0Y23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YoonYGEKHGH0Y23
Eunseop Yoon, Hee Suk Yoon, Dhananjaya Gowda, SooHwan Eom, Daehyeok Kim, John B. Harvill, Heting Gao, Mark Hasegawa-Johnson, Chanwoo Kim, Chang D. Yoo:
Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction. INTERSPEECH 2023: 2028-2032
[c202]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KangHR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KangHR23
Wonjune Kang, Mark Hasegawa-Johnson, Deb Roy:
End-to-End Zero-Shot Voice Conversion with Location-Variable Convolutions. INTERSPEECH 2023: 2303-2307
[c201]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaiH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaiH23
Wanyue Zhai, Mark Hasegawa-Johnson:
Wav2ToBI: a new approach to automatic ToBI transcription. INTERSPEECH 2023: 2748-2752
[c200]
- view
  authority control:
- export record
  dblp key:
  - conf/rep4nlp/HarvillHYYY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rep4nlp/HarvillHYYY23
John B. Harvill, Mark Hasegawa-Johnson, Hee Suk Yoon, Chang D. Yoo, Eunseop Yoon:
One-Shot Exemplification Modeling via Latent Sense Representations. RepL4NLP@ACL 2023: 303-314
[i53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12530
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12530
Jialu Li, Mark Hasegawa-Johnson, Nancy L. McElwain:
Towards Robust Family-Infant Audio Analysis Based on Unsupervised Pretraining of Wav2vec 2.0 on Large-Scale Unlabeled Family Audio. CoRR abs/2305.12530 (2023)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-16371
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-16371
Eunseop Yoon, Hee Suk Yoon, John B. Harvill, Mark Hasegawa-Johnson, Chang D. Yoo:
INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition. CoRR abs/2305.16371 (2023)
[i51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-07926
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-07926
Liming Wang, Mark A. Hasegawa-Johnson, Chang D. Yoo:
A Theory of Unsupervised Speech Recognition. CoRR abs/2306.07926 (2023)
[i50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-15808
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-15808
Kai Chieh Chang, Mark Hasegawa-Johnson, Nancy L. McElwain, Bashima Islam:
Classification of Infant Sleep/Wake States: Cross-Attention among Large Scale Pretrained Transformer Networks using Audio, ECG, and IMU Data. CoRR abs/2306.15808 (2023)
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-08442
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-08442
Eunseop Yoon, Hee Suk Yoon, Dhananjaya Gowda, SooHwan Eom, Daehyeok Kim, John B. Harvill, Heting Gao, Mark Hasegawa-Johnson, Chanwoo Kim, Chang D. Yoo:
Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction. CoRR abs/2308.08442 (2023)
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-07287
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-07287
Jialu Li, Mark Hasegawa-Johnson, Karrie Karahalios:
Enhancing Child Vocalization Classification in Multi-Channel Child-Adult Conversations Through Wav2vec2 Children ASR Features. CoRR abs/2309.07287 (2023)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-02382
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-02382
Liming Wang, Mark Hasegawa-Johnson, Chang D. Yoo:
Unsupervised Speech Recognition with N-Skipgram and Positional Unigram Matching. CoRR abs/2310.02382 (2023)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-00079
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-00079
Zhonghao Wang, Wei Wei, Yang Zhao, Zhisheng Xiao, Mark Hasegawa-Johnson, Humphrey Shi, Tingbo Hou:
HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion Models. CoRR abs/2312.00079 (2023)
2022
[j39]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/ZelaskoFMABSHD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/ZelaskoFMABSHD22
Piotr Zelasko, Siyuan Feng, Laureano Moro-Velázquez, Ali Abavisani, Saurabhchand Bhati, Odette Scharenborg, Mark Hasegawa-Johnson, Najim Dehak:
Discovering phonetic inventories with crosslingual automatic speech recognition. Comput. Speech Lang. 74: 101358 (2022)
[j38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/frai/GaoNZQCH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/frai/GaoNZQCH22
Heting Gao, Junrui Ni, Yang Zhang, Kaizhi Qian, Shiyu Chang, Mark Hasegawa-Johnson:
Domain Generalization for Language-Independent Automatic Speech Recognition. Frontiers Artif. Intell. 5: 806274 (2022)
[j37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/speech/GaoWKMIHSHY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/GaoWKMIHSHY22
Heting Gao, Xiaoxuan Wang, Sunghun Kang, Rusty Mina, Dias Issa, John B. Harvill, Leda Sari, Mark Hasegawa-Johnson, Chang D. Yoo:
Seamless equal accuracy ratio for inclusive CTC speech recognition. Speech Commun. 136: 76-83 (2022)
[j36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/LiH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiH22
Jialu Li, Mark Hasegawa-Johnson:
Autosegmental Neural Nets 2.0: An Extensive Study of Training Synchronous and Asynchronous Phones and Tones for Under-Resourced Tonal Languages. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1918-1926 (2022)
[c199]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LeeKOHY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LeeKOHY22
Junghyun Lee, Gwangsu Kim, Mahbod Olfat, Mark Hasegawa-Johnson, Chang D. Yoo:
Fast and Efficient MMD-Based Fair PCA via Optimization over Stiefel Manifold. AAAI 2022: 7363-7371
[c198]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/WangFHY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/WangFHY22
Liming Wang, Siyuan Feng, Mark Hasegawa-Johnson, Chang Dong Yoo:
Self-supervised Semantic-driven Phoneme Discovery for Zero-resource Speech Recognition. ACL (1) 2022: 8027-8047
[c197]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/YehHHS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/YehHHS22
Raymond A. Yeh, Yuan-Ting Hu, Mark Hasegawa-Johnson, Alexander G. Schwing:
Equivariance Discovery by Learned Parameter-Sharing. AISTATS 2022: 1527-1545
[c196]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/embc/HarvillWAAHCB22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/embc/HarvillWAAHCB22
John B. Harvill, Yash R. Wani, Mustafa Alam, Narendra Ahuja, Mark Hasegawa-Johnson, David Chestek, David G. Beiser:
Estimation of Respiratory Rate from Breathing Audio. EMBC 2022: 4599-4603
[c195]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/YoonYHYHY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/YoonYHYHY22
Hee Suk Yoon, Eunseop Yoon, John B. Harvill, Sunjae Yoon, Mark Hasegawa-Johnson, Chang Dong Yoo:
SMSMix: Sense-Maintained Sentence Mixup for Word Sense Disambiguation. EMNLP (Findings) 2022: 1493-1502
[c194]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HarvillWCABCHA22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HarvillWCABCHA22
John B. Harvill, Yash R. Wani, Moitreya Chatterjee, Mustafa Alam, David G. Beiser, David Chestek, Mark Hasegawa-Johnson, Narendra Ahuja:
Detection of Covid-19 from Joint Time and Frequency Analysis of Speech, Breathing and Cough Audio. ICASSP 2022: 3683-3687
[c193]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChanQZH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChanQZH22
Chak Ho Chan, Kaizhi Qian, Yang Zhang, Mark Hasegawa-Johnson:
SpeechSplit2.0: Unsupervised Speech Disentanglement for Voice Conversion without Tuning Autoencoder Bottlenecks. ICASSP 2022: 6332-6336
[c192]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/KangMMYHHY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/KangMMYHHY22
Haeyong Kang, Rusty John Lloyd Mina, Sultan Rizky Hikmawan Madjid, Jaehong Yoon, Mark Hasegawa-Johnson, Sung Ju Hwang, Chang D. Yoo:
Forget-free Continual Learning with Winning Subnetworks. ICML 2022: 10734-10750
[c191]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/QianZGNLCHC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/QianZGNLCHC22
Kaizhi Qian, Yang Zhang, Heting Gao, Junrui Ni, Cheng-I Lai, David D. Cox, Mark Hasegawa-Johnson, Shiyu Chang:
ContentVec: An Improved Self-Supervised Speech Representation by Disentangling Speakers. ICML 2022: 18003-18017
[c190]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NiWGQ0CH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NiWGQ0CH22
Junrui Ni, Liming Wang, Heting Gao, Kaizhi Qian, Yang Zhang, Shiyu Chang, Mark Hasegawa-Johnson:
Unsupervised Text-to-Speech Synthesis by Unsupervised Automatic Speech Recognition. INTERSPEECH 2022: 461-465
[c189]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MorshedH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MorshedH22
Mahir Morshed, Mark Hasegawa-Johnson:
Cross-lingual articulatory feature information transfer for speech recognition using recurrent progressive neural networks. INTERSPEECH 2022: 2298-2302
[c188]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoNQZCH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoNQZCH22
Heting Gao, Junrui Ni, Kaizhi Qian, Yang Zhang, Shiyu Chang, Mark Hasegawa-Johnson:
WavPrompt: Towards Few-Shot Spoken Language Understanding with Frozen Language Models. INTERSPEECH 2022: 2738-2742
[c187]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HarvillHY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HarvillHY22
John B. Harvill, Mark Hasegawa-Johnson, Chang D. Yoo:
Frame-Level Stutter Detection. INTERSPEECH 2022: 2843-2847
[c186]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/HarvillGH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/HarvillGH22
John B. Harvill, Roxana Girju, Mark Hasegawa-Johnson:
Syn2Vec: Synset Colexification Graphs for Lexical Semantic Similarity. NAACL-HLT 2022: 5259-5270
[i45]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-11207
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-11207
Piotr Zelasko, Siyuan Feng, Laureano Moro-Velázquez, Ali Abavisani, Saurabhchand Bhati, Odette Scharenborg, Mark Hasegawa-Johnson, Najim Dehak:
Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition. CoRR abs/2201.11207 (2022)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-14156
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-14156
Chak Ho Chan, Kaizhi Qian, Yang Zhang, Mark Hasegawa-Johnson:
SpeechSplit 2.0: Unsupervised speech disentanglement for voice conversion Without tuning autoencoder Bottlenecks. CoRR abs/2203.14156 (2022)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15183
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15183
Jialu Li, Mark Hasegawa-Johnson, Nancy L. McElwain:
Visualizations of Complex Sequences of Family-Infant Vocalizations Using Bag-of-Audio-Words Approach Based on Wav2vec 2.0 Features. CoRR abs/2203.15183 (2022)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15796
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15796
Junrui Ni, Liming Wang, Heting Gao, Kaizhi Qian, Yang Zhang, Shiyu Chang, Mark Hasegawa-Johnson:
Unsupervised Text-to-Speech Synthesis by Unsupervised Automatic Speech Recognition. CoRR abs/2203.15796 (2022)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15863
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15863
Heting Gao, Junrui Ni, Kaizhi Qian, Yang Zhang, Shiyu Chang, Mark Hasegawa-Johnson:
WAVPROMPT: Towards Few-Shot Spoken Language Understanding with Frozen Language Models. CoRR abs/2203.15863 (2022)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-03640
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-03640
Raymond A. Yeh, Yuan-Ting Hu, Mark Hasegawa-Johnson, Alexander G. Schwing:
Equivariance Discovery by Learned Parameter-Sharing. CoRR abs/2204.03640 (2022)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-09224
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-09224
Kaizhi Qian, Yang Zhang, Heting Gao, Junrui Ni, Cheng-I Lai, David D. Cox, Mark Hasegawa-Johnson, Shiyu Chang:
Improving Self-Supervised Speech Representations by Disentangling Speakers. CoRR abs/2204.09224 (2022)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-04213
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-04213
Zhongweiyang Xu, Xulin Fan, Mark Hasegawa-Johnson:
Dual-path Attention is All You Need for Audio-Visual Speech Extraction. CoRR abs/2207.04213 (2022)
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-07072
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-07072
Hee Suk Yoon, Eunseop Yoon, John B. Harvill, Sunjae Yoon, Mark Hasegawa-Johnson, Chang D. Yoo:
SMSMix: Sense-Maintained Sentence Mixup for Word Sense Disambiguation. CoRR abs/2212.07072 (2022)
2021
[j35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/speech/LiHM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/LiHM21
Jialu Li, Mark Hasegawa-Johnson, Nancy L. McElwain:
Analysis of acoustic and voice quality features for the classification of infant and mother vocalizations. Speech Commun. 133: 41-61 (2021)
[j34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/SariHT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SariHT21
Leda Sari, Mark Hasegawa-Johnson, Samuel Thomas:
Auxiliary Networks for Joint Speaker Adaptation and Speaker Change Detection. IEEE ACM Trans. Audio Speech Lang. Process. 29: 324-333 (2021)
[j33]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangHZHS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangHZHS21
Xinsheng Wang, Justin van der Hout, Jihua Zhu, Mark Hasegawa-Johnson, Odette Scharenborg:
Synthesizing Spoken Descriptions of Images. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3242-3254 (2021)
[j32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/SariHY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SariHY21
Leda Sari, Mark Hasegawa-Johnson, Chang D. Yoo:
Counterfactually Fair Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3515-3525 (2021)
[c185]
- view
  authority control:
- export record
  dblp key:
  - conf/acssc/WangH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acssc/WangH21
Liming Wang, Mark Hasegawa-Johnson:
A Translation Framework for Visually Grounded Spoken Unit Discovery. ACSCC 2021: 1419-1425
[c184]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhuYH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhuYH21
Junzhe Zhu, Raymond A. Yeh, Mark Hasegawa-Johnson:
Multi-Decoder Dprnn: Source Separation for Variable Number of Speakers. ICASSP 2021: 3420-3424
[c183]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ShiZWCQHZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ShiZWCQHZ21
Hui Shi, Yang Zhang, Hao Wu, Shiyu Chang, Kaizhi Qian, Mark Hasegawa-Johnson, Jishen Zhao:
Continuous Cnn For Nonuniform Time Series. ICASSP 2021: 3550-3554
[c182]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangFZHS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangFZHS21
Xinsheng Wang, Siyuan Feng, Jihua Zhu, Mark Hasegawa-Johnson, Odette Scharenborg:
Show and Speak: Directly Synthesize Spoken Description of Images. ICASSP 2021: 4190-4194
[c181]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HarvillIHY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HarvillIHY21
John B. Harvill, Dias Issa, Mark Hasegawa-Johnson, Chang Dong Yoo:
Synthesis of New Words for Improved Dysarthric Speech Recognition on an Expanded Vocabulary. ICASSP 2021: 6428-6432
[c180]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhuHM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhuHM21
Junzhe Zhu, Mark Hasegawa-Johnson, Nancy L. McElwain:
A Comparison Study on Infant-Parent Voice Diarization. ICASSP 2021: 7178-7182
[c179]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FengZMAHSD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FengZMAHSD21
Siyuan Feng, Piotr Zelasko, Laureano Moro-Velázquez, Ali Abavisani, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
How Phonotactics Affect Multilingual and Zero-Shot ASR Performance. ICASSP 2021: 7238-7242
[c178]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangWHSD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangWHSD21
Liming Wang, Xinsheng Wang, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
Align or attend? Toward More Efficient and Accurate Spoken Word Discovery Using Speech-to-Image Retrieval. ICASSP 2021: 7603-7607
[c177]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/WangWYXHHS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/WangWYXHHS21
Zhonghao Wang, Kai Wang, Mo Yu, Jinjun Xiong, Wen-Mei Hwu, Mark Hasegawa-Johnson, Humphrey Shi:
Interpretable Visual Reasoning via Induced Symbolic Space. ICCV 2021: 1858-1867
[c176]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/Qian0CXGCH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/Qian0CXGCH21
Kaizhi Qian, Yang Zhang, Shiyu Chang, Jinjun Xiong, Chuang Gan, David D. Cox, Mark Hasegawa-Johnson:
Global Prosody Style Transfer Without Text Transcriptions. ICML 2021: 8650-8660
[c175]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HarvillWHABC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HarvillWHABC21
John B. Harvill, Yash R. Wani, Mark Hasegawa-Johnson, Narendra Ahuja, David G. Beiser, David Chestek:
Classification of COVID-19 from Cough Using Autoregressive Predictive Coding Pretraining and Spectral Data Augmentation. Interspeech 2021: 926-930
[c174]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoNZQCH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoNZQCH21
Heting Gao, Junrui Ni, Yang Zhang, Kaizhi Qian, Shiyu Chang, Mark Hasegawa-Johnson:
Zero-Shot Cross-Lingual Phonetic Recognition with External Language Embedding. Interspeech 2021: 1304-1308
[c173]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/RamnathSHY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/RamnathSHY21
Kiran Ramnath, Leda Sari, Mark Hasegawa-Johnson, Chang D. Yoo:
Worldly Wise (WoW) - Cross-Lingual Knowledge Fusion for Fact-based Visual Spoken-Question Answering. NAACL-HLT 2021: 1908-1919
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-08519
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-08519
Kaizhi Qian, Yang Zhang, Shiyu Chang, Jinjun Xiong, Chuang Gan, David D. Cox, Mark Hasegawa-Johnson:
Global Rhythm Style Transfer Without Text Transcriptions. CoRR abs/2106.08519 (2021)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-11196
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-11196
Junghyun Lee, Gwangsu Kim, Matt Olfat, Mark Hasegawa-Johnson, Chang D. Yoo:
Fast and Efficient MMD-based Fair PCA via Optimization over Stiefel Manifold. CoRR abs/2109.11196 (2021)
2020
[j31]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ScharenborgOPAC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ScharenborgOPAC20
Odette Scharenborg, Lucas Ondel, Shruti Palaskar, Philip Arthur, Francesco Ciannella, Mingxing Du, Elin Larsen, Danny Merkx, Rachid Riad, Liming Wang, Emmanuel Dupoux, Laurent Besacier, Alan W. Black, Mark Hasegawa-Johnson, Florian Metze, Graham Neubig, Sebastian Stüker, Pierre Godard, Markus Müller:
Speech Technology for Unwritten Languages. IEEE ACM Trans. Audio Speech Lang. Process. 28: 964-975 (2020)
[j30]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangH20
Liming Wang, Mark Hasegawa-Johnson:
Multimodal Word Discovery and Retrieval With Spoken Descriptions and Visual Concepts. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1560-1573 (2020)
[c172]
- view
  authority control:
- export record
  dblp key:
  - conf/acl-louhi/SakakiniLDASGBM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl-louhi/SakakiniLDASGBM20
Tarek Sakakini, Jong Yoon Lee, Aditya Duri, Renato Ferreira Leitão Azevedo, Victor Sadauskas, Kuangxiao Gu, Suma Bhat, Daniel G. Morrow, James Graumlich, Saqib Walayat, Mark Hasegawa-Johnson, Thomas S. Huang, Ann Willemsen-Dunlap, Donald Halpin:
Context-Aware Automatic Text Simplification of Health Materials in Low-Resource Domains. LOUHI@EMNLP 2020: 115-126
[c171]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QianJHM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QianJHM20
Kaizhi Qian, Zeyu Jin, Mark Hasegawa-Johnson, Gautham J. Mysore:
F0-Consistent Many-To-Many Non-Parallel Voice Conversion Via Conditional Autoencoder. ICASSP 2020: 6284-6288
[c170]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Sari0H20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Sari0H20
Leda Sari, Samuel Thomas, Mark Hasegawa-Johnson:
Training Spoken Language Understanding Systems with Non-Parallel Speech and Text. ICASSP 2020: 8109-8113
[c169]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/QianZCHC20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/QianZCHC20
Kaizhi Qian, Yang Zhang, Shiyu Chang, Mark Hasegawa-Johnson, David D. Cox:
Unsupervised Speech Decomposition via Triple Information Bottleneck. ICML 2020: 7836-7846
[c168]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiH20
Jialu Li, Mark Hasegawa-Johnson:
Autosegmental Neural Nets: Should Phones and Tones be Synchronous or Asynchronous? INTERSPEECH 2020: 1027-1031
[c167]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AbavisaniH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AbavisaniH20
Ali Abavisani, Mark Hasegawa-Johnson:
Automatic Estimation of Intelligibility Measure for Consonants in Speech. INTERSPEECH 2020: 1161-1165
[c166]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangH20
Liming Wang, Mark Hasegawa-Johnson:
A DNN-HMM-DNN Hybrid Model for Discovering Word-Like Units from Spoken Captions and Image Regions. INTERSPEECH 2020: 1456-1460
[c165]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SariH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SariH20
Leda Sari, Mark Hasegawa-Johnson:
Deep F-Measure Maximization for End-to-End Speech Understanding. INTERSPEECH 2020: 1580-1584
[c164]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoutDHS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoutDHS20
Justin van der Hout, Zoltán D'Haese, Mark Hasegawa-Johnson, Odette Scharenborg:
Evaluating Automatically Generated Phoneme Captions for Images. INTERSPEECH 2020: 2317-2321
[c163]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuHS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuHS20
Junzhe Zhu, Mark Hasegawa-Johnson, Leda Sari:
Identify Speakers in Cocktail Parties with End-to-End Attention. INTERSPEECH 2020: 3092-3096
[c162]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZelaskoMHSD20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZelaskoMHSD20
Piotr Zelasko, Laureano Moro-Velázquez, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
That Sounds Familiar: An Analysis of Phonetic Representations Transfer Across Languages. INTERSPEECH 2020: 3705-3709
[c161]
- view
  authority control:
- export record
  dblp key:
  - conf/slsp/Hasegawa-Johnson20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slsp/Hasegawa-Johnson20
Mark Hasegawa-Johnson, Leanne Rolston, Camille Goudeseune, Gina-Anne Levow, Katrin Kirchhoff:
Grapheme-to-Phoneme Transduction for Cross-Language ASR. SLSP 2020: 3-19
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-07370
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-07370
Kaizhi Qian, Zeyu Jin, Mark Hasegawa-Johnson, Gautham J. Mysore:
F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder. CoRR abs/2004.07370 (2020)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-11284
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-11284
Kaizhi Qian, Yang Zhang, Shiyu Chang, David D. Cox, Mark Hasegawa-Johnson:
Unsupervised Speech Decomposition via Triple Information Bottleneck. CoRR abs/2004.11284 (2020)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-06065
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-06065
Ali Abavisani, Mark Hasegawa-Johnson:
Automatic Estimation of Inteligibility Measure for Consonants in Speech. CoRR abs/2005.06065 (2020)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-08118
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-08118
Piotr Zelasko, Laureano Moro-Velázquez, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
That Sounds Familiar: an Analysis of Phonetic Representations Transfer Across Languages. CoRR abs/2005.08118 (2020)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-11408
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-11408
Junzhe Zhu, Mark Hasegawa-Johnson, Leda Sari:
Identify Speakers in Cocktail Parties with End-to-End Attention. CoRR abs/2005.11408 (2020)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-14351
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-14351
Jialu Li, Mark Hasegawa-Johnson:
Autosegmental Neural Nets: Should Phones and Tones be Synchronous or Asynchronous? CoRR abs/2007.14351 (2020)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-15916
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-15916
Justin van der Hout, Zoltán D'Haese, Mark Hasegawa-Johnson, Odette Scharenborg:
Evaluating Automatically Generated Phoneme Captions for Images. CoRR abs/2007.15916 (2020)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-03425
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-03425
Leda Sari, Mark Hasegawa-Johnson:
Deep F-measure Maximization for End-to-End Speech Understanding. CoRR abs/2008.03425 (2020)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-08064
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-08064
Wenda Chen, Jonathan Huang, Mark Hasegawa-Johnson:
Utterance-level Intent Recognition from Keywords. CoRR abs/2009.08064 (2020)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-12104
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-12104
Siyuan Feng, Piotr Zelasko, Laureano Moro-Velázquez, Ali Abavisani, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak:
How Phonotactics Affect Multilingual and Zero-shot ASR Performance. CoRR abs/2010.12104 (2020)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-12267
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-12267
Xinsheng Wang, Siyuan Feng, Jihua Zhu, Mark Hasegawa-Johnson, Odette Scharenborg:
Show and Speak: Directly Synthesize Spoken Description of Images. CoRR abs/2010.12267 (2020)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-02698
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-02698
Junzhe Zhu, Mark Hasegawa-Johnson, Nancy McElwain:
A Comparison Study on Infant-Parent Voice Diarization. CoRR abs/2011.02698 (2020)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-11603
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-11603
Zhonghao Wang, Mo Yu, Kai Wang, Jinjun Xiong, Wen-Mei Hwu, Mark Hasegawa-Johnson, Humphrey Shi:
Interpretable Visual Reasoning via Induced Symbolic Space. CoRR abs/2011.11603 (2020)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-12022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-12022
Junzhe Zhu, Raymond A. Yeh, Mark Hasegawa-Johnson:
Multi-Decoder DPRNN: High Accuracy Source Counting and Separation. CoRR abs/2011.12022 (2020)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-15484
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-15484
Kiran Ramnath, Mark Hasegawa-Johnson:
Seeing is Knowing! Fact-based Visual Question Answering using Knowledge Graph Embeddings. CoRR abs/2012.15484 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c160]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/LiQHA19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/LiQHA19
Feng Li, Kaizhi Qian, Mark Hasegawa-Johnson, Masato Akagi:
Monaural Singing Voice Separation Using Fusion-Net with Time-Frequency Masking. APSIPA 2019: 1239-1243
[c159]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HeYLLHC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HeYLLHC19
Di He, Xuesong Yang, Boon Pang Lim, Yi Liang, Mark Hasegawa-Johnson, Deming Chen:
When CTC Training Meets Acoustic Landmarks. ICASSP 2019: 5996-6000
[c158]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SariTHP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SariTHP19
Leda Sari, Samuel Thomas, Mark Hasegawa-Johnson, Michael Picheny:
Pre-training of Speaker Embeddings for Low-latency Speaker Change Detection in Broadcast News. ICASSP 2019: 6286-6290
[c157]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PietrowiczACHKC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PietrowiczACHKC19
Mary Pietrowicz, Carla Agurto, Jonah Casebeer, Mark Hasegawa-Johnson, Karrie Karahalios, Guillermo A. Cecchi:
Dimensional Analysis of Laughter in Female Conversational Speech. ICASSP 2019: 6600-6604
[c156]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/QianZCYH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/QianZCYH19
Kaizhi Qian, Yang Zhang, Shiyu Chang, Xuesong Yang, Mark Hasegawa-Johnson:
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss. ICML 2019: 5210-5219
[c155]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SariTH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SariTH19
Leda Sari, Samuel Thomas, Mark A. Hasegawa-Johnson:
Learning Speaker Aware Offsets for Speaker Adaptation of Neural Networks. INTERSPEECH 2019: 769-773
[c154]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ScharenborgKSHF19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ScharenborgKSHF19
Odette Scharenborg, Jiska Koemans, Cybelle Smith, Mark A. Hasegawa-Johnson, Kara D. Federmeier:
The Neural Correlates Underlying Lexically-Guided Perceptual Learning. INTERSPEECH 2019: 1223-1227
[c153]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangH19
Liming Wang, Mark A. Hasegawa-Johnson:
Multimodal Word Discovery and Retrieval with Phone Sequence and Image Concepts. INTERSPEECH 2019: 2683-2687
[c152]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Moro-VelazquezC19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Moro-VelazquezC19
Laureano Moro-Velázquez, Jaejin Cho, Shinji Watanabe, Mark A. Hasegawa-Johnson, Odette Scharenborg, Heejin Kim, Najim Dehak:
Study of the Performance of Automatic Speech Recognition Systems in Speakers with Parkinson's Disease. INTERSPEECH 2019: 3875-3879
[c151]
- view
  authority control:
- export record
  dblp key:
  - conf/iwsds/ScharenborgH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwsds/ScharenborgH19
Odette Scharenborg, Mark Hasegawa-Johnson:
Position Paper: Brain Signal-Based Dialogue Systems. IWSDS 2019: 389-392
[c150]
- view
  authority control:
- export record
  dblp key:
  - conf/slsp/NiHS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slsp/NiHS19
Junrui Ni, Mark Hasegawa-Johnson, Odette Scharenborg:
The Time-Course of Phoneme Category Adaptation in Deep Neural Networks. SLSP 2019: 3-15
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1905-05879
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1905-05879
Kaizhi Qian, Yang Zhang, Shiyu Chang, Xuesong Yang, Mark Hasegawa-Johnson:
Zero-Shot Voice Style Transfer with Only Autoencoder Loss. CoRR abs/1905.05879 (2019)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1908-04751
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-04751
Ali Abavisani, Mark A. Hasegawa-Johnson:
The role of cue enhancement and frequency fine-tuning in hearing impaired phone recognition. CoRR abs/1908.04751 (2019)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-07285
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-07285
Mark Hasegawa-Johnson, Camille Goudeseune, Gina-Anne Levow:
Fast transcription of speech in low-resource languages. CoRR abs/1909.07285 (2019)
2018
[j29]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/DoCLH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/DoCLH18
Van Hai Do, Nancy F. Chen, Boon Pang Lim, Mark A. Hasegawa-Johnson:
Multitask Learning for Phone Recognition of Underresourced Languages Using Mismatched Transcription. IEEE ACM Trans. Audio Speech Lang. Process. 26(3): 501-514 (2018)
[c149]
- view
- export record
  dblp key:
  - conf/amia/AzevedoMGWHHGBS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/amia/AzevedoMGWHHGBS18
Renato Ferreira Leitão Azevedo, Daniel G. Morrow, James F. Graumlich, Ann M. Willemsen-Dunlap, Mark Hasegawa-Johnson, Thomas S. Huang, Kuangxiao Gu, Suma Bhat, Tarek Sakakini, Victor Sadauskas, Donald J. Halpin:
Using Conversational Agents to Explain Medication Instructions to Older Adults. AMIA 2018
[c148]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LimYXDH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LimYXDH18
Teck-Yian Lim, Raymond A. Yeh, Yijia Xu, Minh N. Do, Mark Hasegawa-Johnson:
Time-Frequency Networks for Audio Super-Resolution. ICASSP 2018: 646-650
[c147]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ScharenborgBBHM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ScharenborgBBHM18
Odette Scharenborg, Laurent Besacier, Alan W. Black, Mark Hasegawa-Johnson, Florian Metze, Graham Neubig, Sebastian Stüker, Pierre Godard, Markus Müller, Lucas Ondel, Shruti Palaskar, Philip Arthur, Francesco Ciannella, Mingxing Du, Elin Larsen, Danny Merkx, Rachid Riad, Liming Wang, Emmanuel Dupoux:
Linguistic Unit Discovery from Multi-Modal Inputs in Unwritten Languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop. ICASSP 2018: 4979-4983
[c146]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/QianZCYFH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/QianZCYFH18
Kaizhi Qian, Yang Zhang, Shiyu Chang, Xuesong Yang, Dinei A. F. Florêncio, Mark Hasegawa-Johnson:
Deep Learning Based Speech Beamforming. ICASSP 2018: 5389-5393
[c145]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/OndelGBLHSDBYK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/OndelGBLHSDBYK18
Lucas Ondel, Pierre Godard, Laurent Besacier, Elin Larsen, Mark Hasegawa-Johnson, Odette Scharenborg, Emmanuel Dupoux, Lukás Burget, François Yvon, Sanjeev Khudanpur:
Bayesian Models for Unit Discovery on a Very Low Resource Language. ICASSP 2018: 5939-5943
[c144]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenHC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenHC18
Wenda Chen, Mark Hasegawa-Johnson, Nancy F. Chen:
Recognizing Zero-Resourced Languages Based on Mismatched Machine Transcriptions. ICASSP 2018: 5979-5983
[c143]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangARTRH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangARTRH18
Xuesong Yang, Kartik Audhkhasi, Andrew Rosenberg, Samuel Thomas, Bhuvana Ramabhadran, Mark Hasegawa-Johnson:
Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition. ICASSP 2018: 5989-5993
[c142]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YehL0SHD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YehL0SHD18
Raymond A. Yeh, Teck-Yian Lim, Chen Chen, Alexander G. Schwing, Mark Hasegawa-Johnson, Minh N. Do:
Image Restoration with Deep Generative Models. ICASSP 2018: 6772-6776
[c141]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuHM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuHM18
Yijia Xu, Mark Hasegawa-Johnson, Nancy McElwain:
Infant Emotional Outbursts Detection in Infant-parent Spoken Interactions. INTERSPEECH 2018: 242-246
[c140]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ScharenborgTHD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ScharenborgTHD18
Odette Scharenborg, Sebastian Tiesmeyer, Mark Hasegawa-Johnson, Najim Dehak:
Visualizing Phoneme Category Adaptation in Deep Neural Networks. INTERSPEECH 2018: 1482-1486
[c139]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenHC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenHC18
Wenda Chen, Mark Hasegawa-Johnson, Nancy F. Chen:
Topic and Keyword Identification for Low-resourced Speech Using Cross-Language Transfer Learning. INTERSPEECH 2018: 2047-2051
[c138]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DasH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DasH18
Amit Das, Mark Hasegawa-Johnson:
Improving DNNs Trained with Non-Native Transcriptions Using Knowledge Distillation and Target Interpolation. INTERSPEECH 2018: 2434-2438
[c137]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeLYHC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeLYHC18
Di He, Boon Pang Lim, Xuesong Yang, Mark Hasegawa-Johnson, Deming Chen:
Improved ASR for Under-resourced Languages through Multi-task Learning with Acoustic Landmarks. INTERSPEECH 2018: 2618-2622
[c136]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SariHSSN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SariHSSN18
Leda Sari, Mark Hasegawa-Johnson, Kumaran S, Georg Stemmer, Krishnakumar N. Nair:
Speaker Adaptive Audio-Visual Fusion for the Open-Vocabulary Section of AVICAR. INTERSPEECH 2018: 3524-3528
[c135]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/sltu/ScharenborgEHD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sltu/ScharenborgEHD18
Odette Scharenborg, Patrick Ebel, Mark Hasegawa-Johnson, Najim Dehak:
Building an ASR System for Mboshi Using A Cross-Language Definition of Acoustic Units Approach. SLTU 2018: 167-171
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-02656
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-02656
Xuesong Yang, Kartik Audhkhasi, Andrew Rosenberg, Samuel Thomas, Bhuvana Ramabhadran, Mark Hasegawa-Johnson:
Joint Modeling of Accents and Acoustics for Multi-Accent Speech Recognition. CoRR abs/1802.02656 (2018)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-05092
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-05092
Odette Scharenborg, Laurent Besacier, Alan W. Black, Mark Hasegawa-Johnson, Florian Metze, Graham Neubig, Sebastian Stüker, Pierre Godard, Markus Müller, Lucas Ondel, Shruti Palaskar, Philip Arthur, Francesco Ciannella, Mingxing Du, Elin Larsen, Danny Merkx, Rachid Riad, Liming Wang, Emmanuel Dupoux:
Linguistic unit discovery from multi-modal inputs in unwritten languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop. CoRR abs/1802.05092 (2018)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-05383
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-05383
Kaizhi Qian, Yang Zhang, Shiyu Chang, Xuesong Yang, Dinei A. F. Florêncio, Mark Hasegawa-Johnson:
Deep Learning Based Speech Beamforming. CoRR abs/1802.05383 (2018)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-06053
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-06053
Lucas Ondel, Pierre Godard, Laurent Besacier, Elin Larsen, Mark Hasegawa-Johnson, Odette Scharenborg, Emmanuel Dupoux, Lukás Burget, François Yvon, Sanjeev Khudanpur:
Bayesian Models for Unit Discovery on a Very Low Resource Language. CoRR abs/1802.06053 (2018)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1805-05574
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1805-05574
Di He, Boon Pang Lim, Xuesong Yang, Mark Hasegawa-Johnson, Deming Chen:
Improved ASR for Under-Resourced Languages Through Multi-Task Learning with Acoustic Landmarks. CoRR abs/1805.05574 (2018)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1811-02063
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1811-02063
Di He, Xuesong Yang, Boon Pang Lim, Yi Liang, Mark Hasegawa-Johnson, Deming Chen:
When CTC Training Meets Acoustic Landmarks. CoRR abs/1811.02063 (2018)
2017
[j28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/jbi/MorrowHHSAGZRG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jbi/MorrowHHSAGZRG17
Daniel G. Morrow, Mark Hasegawa-Johnson, Thomas S. Huang, William Schuh, Renato Ferreira Leitão Azevedo, Kuangxiao Gu, Yang Zhang, Bidisha Roy, Rocío García-Retamero:
A multidisciplinary approach to designing and evaluating Electronic Medical Record portal messages that support patient self-care. J. Biomed. Informatics 69: 63-74 (2017)
[j27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/Hasegawa-Johnson17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/Hasegawa-Johnson17
Mark A. Hasegawa-Johnson, Preethi Jyothi, Daniel McCloy, Majid Mirbagheri, Giovanni M. Di Liberto, Amit Das, Bradley Ekin, Chunxi Liu, Vimal Manohar, Hao Tang, Edmund C. Lalor, Nancy F. Chen, Paul Hager, Tyler Kekona, Rose Sloan, Adrian K. C. Lee:
ASR for Under-Resourced Languages From Probabilistic Transcription. IEEE ACM Trans. Audio Speech Lang. Process. 25(1): 46-59 (2017)
[c134]
- view
  authority control:
- export record
  dblp key:
  - conf/acssc/Hasegawa-Johnson17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acssc/Hasegawa-Johnson17
Mark Hasegawa-Johnson, Preethi Jyothi, Wenda Chen, Van Hai Do:
Mismatched crowdsourcing: Mining latent skills to acquire speech transcriptions. ACSSC 2017: 1277-1281
[c133]
- view
- export record
  dblp key:
  - conf/amia/AzevedoGZSSMHHB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/amia/AzevedoGZSSMHHB17
Renato Ferreira Leitão Azevedo, Kuangxiao Gu, Yang Zhang, Victor Sadauskas, Tarek Sakakini, Daniel G. Morrow, Mark Hasegawa-Johnson, Thomas S. Huang, Suma Bhat, Ann Willemsen-Dunlap, Donald J. Halpin, James F. Graumlich, William Schuh:
Using Computer Agents to Explain Clinical Test Results. AMIA 2017
[c132]
- view
- export record
  dblp key:
  - conf/amia/SakakiniASGZBMH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/amia/SakakiniASGZBMH17
Tarek Sakakini, Renato Ferreira Leitão Azevedo, Victor Sadauskas, Kuangxiao Gu, Yang Zhang, Suma Bhat, Daniel G. Morrow, Mark Hasegawa-Johnson, Thomas S. Huang, Ann M. Willemsen-Dunlap, Donald J. Halpin, James F. Graumlich:
Dr. Babel Fish: A Machine Translator to Simplify Providers' Language. AMIA 2017
[c131]
- view
  authority control:
- export record
  dblp key:
  - conf/apsipa/ChenLDPNXHCXSCM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsipa/ChenLDPNXHCXSCM17
Nancy F. Chen, Boon Pang Lim, Van Hai Do, Van Tung Pham, Chongjia Ni, Haihua Xu, Mark Hasegawa-Johnson, Wenda Chen, Xiong Xiao, Sunil Sivadas, Eng Siong Chng, Bin Ma, Haizhou Li:
Low-resource spoken keyword search strategies in georgian inspired by distinctive feature theory. APSIPA 2017: 1322-1327
[c130]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/Yeh0LSHD17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/Yeh0LSHD17
Raymond A. Yeh, Chen Chen, Teck-Yian Lim, Alexander G. Schwing, Mark Hasegawa-Johnson, Minh N. Do:
Semantic Image Inpainting with Deep Generative Models. CVPR 2017: 6882-6890
[c129]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/JyothiH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/JyothiH17
Preethi Jyothi, Mark Hasegawa-Johnson:
Low-resource grapheme-to-phoneme conversion using recurrent neural networks. ICASSP 2017: 5030-5034
[c128]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PietrowiczHK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PietrowiczHK17
Mary Pietrowicz, Mark Hasegawa-Johnson, Karrie Karahalios:
Discovering dimensions of perceived vocal expression in semi-structured, unscripted oral history accounts. ICASSP 2017: 5695-5699
[c127]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/RamachandranPKB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/RamachandranPKB17
Prajit Ramachandran, Tom Le Paine, Pooya Khorrami, Mohammad Babaeizadeh, Shiyu Chang, Yang Zhang, Mark A. Hasegawa-Johnson, Roy H. Campbell, Thomas S. Huang:
Fast Generation for Convolutional Autoregressive Models. ICLR (Workshop) 2017
[c126]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoCLH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoCLH17
Van Hai Do, Nancy F. Chen, Boon Pang Lim, Mark Hasegawa-Johnson:
Multi-Task Learning Using Mismatched Transcription for Under-Resourced Speech Recognition. INTERSPEECH 2017: 734-738
[c125]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeCHC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeCHC17
Di He, Zuofu Cheng, Mark Hasegawa-Johnson, Deming Chen:
Using Approximated Auditory Roughness as a Pre-Filtering Feature for Human Screaming and Affective Speech AED. INTERSPEECH 2017: 1914-1918
[c124]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianZCYFH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianZCYFH17
Kaizhi Qian, Yang Zhang, Shiyu Chang, Xuesong Yang, Dinei Florêncio, Mark Hasegawa-Johnson:
Speech Enhancement Using Bayesian Wavenet. INTERSPEECH 2017: 2013-2017
[c123]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PapadopoulosTVM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PapadopoulosTVM17
Pavlos Papadopoulos, Ruchir Travadi, Colin Vaz, Nikolaos Malandrakis, Ulf Hermjakob, Nima Pourdamghani, Michael Pust, Boliang Zhang, Xiaoman Pan, Di Lu, Ying Lin, Ondrej Glembek, Murali Karthick Baskar, Martin Karafiát, Lukás Burget, Mark Hasegawa-Johnson, Heng Ji, Jonathan May, Kevin Knight, Shrikanth S. Narayanan:
Team ELISA System for DARPA LORELEI Speech Evaluation 2016. INTERSPEECH 2017: 2053-2057
[c122]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DasHV17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DasHV17
Amit Das, Mark Hasegawa-Johnson, Karel Veselý:
Deep Auto-Encoder Based Multi-Task Learning Using Probabilistic Transcriptions. INTERSPEECH 2017: 2073-2077
[c121]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangFH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangFH17
Yang Zhang, Dinei Florêncio, Mark Hasegawa-Johnson:
Glottal Model Based Speech Beamforming for ad-hoc Microphone Arrays. INTERSPEECH 2017: 2675-2679
[c120]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenHCL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenHCL17
Wenda Chen, Mark Hasegawa-Johnson, Nancy F. Chen, Boon Pang Lim:
Mismatched Crowdsourcing from Multiple Annotator Languages for Recognizing Zero-Resourced Languages: A Nullspace Clustering Approach. INTERSPEECH 2017: 2789-2793
[c119]
- view
- export record
  dblp key:
  - conf/nips/ChangZHYGTCWHH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChangZHYGTCWHH17
Shiyu Chang, Yang Zhang, Wei Han, Mo Yu, Xiaoxiao Guo, Wei Tan, Xiaodong Cui, Michael Witbrock, Mark A. Hasegawa-Johnson, Thomas S. Huang:
Dilated Recurrent Neural Networks. NIPS 2017: 77-87
[c118]
- view
  authority control:
- export record
  dblp key:
  - conf/www/ChangZTYCHH17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/www/ChangZTYCHH17
Shiyu Chang, Yang Zhang, Jiliang Tang, Dawei Yin, Yi Chang, Mark A. Hasegawa-Johnson, Thomas S. Huang:
Streaming Recommender Systems. WWW 2017: 381-389
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/RamachandranPKB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/RamachandranPKB17
Prajit Ramachandran, Tom Le Paine, Pooya Khorrami, Mohammad Babaeizadeh, Shiyu Chang, Yang Zhang, Mark A. Hasegawa-Johnson, Roy H. Campbell, Thomas S. Huang:
Fast Generation for Convolutional Autoregressive Models. CoRR abs/1704.06001 (2017)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1710-02224
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-02224
Shiyu Chang, Yang Zhang, Wei Han, Mo Yu, Xiaoxiao Guo, Wei Tan, Xiaodong Cui, Michael Witbrock, Mark Hasegawa-Johnson, Thomas S. Huang:
Dilated Recurrent Neural Networks. CoRR abs/1710.02224 (2017)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1710-09985
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-09985
Di He, Boon Pang Lim, Xuesong Yang, Mark Hasegawa-Johnson, Deming Chen:
Acoustic Landmarks Contain More Information About the Phone String than Other Frames. CoRR abs/1710.09985 (2017)
2016
[j26]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/LivescuRFHB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/LivescuRFHB16
Karen Livescu, Frank Rudzicz, Eric Fosler-Lussier, Mark Hasegawa-Johnson, Jeff A. Bilmes:
Speech Production in Speech Technologies: Introduction to the CSL Special Issue. Comput. Speech Lang. 36: 165-172 (2016)
[c117]
- view
  authority control:
- export record
  dblp key:
  - conf/ialp/DoCLH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ialp/DoCLH16
Van Hai Do, Nancy F. Chen, Boon Pang Lim, Mark Hasegawa-Johnson:
Speech recognition of under-resourced languages using mismatched transcriptions. IALP 2016: 112-115
[c116]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YehHD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YehHD16
Raymond A. Yeh, Mark Hasegawa-Johnson, Minh N. Do:
Stable and symmetric filter convolutional neural network. ICASSP 2016: 2652-2656
[c115]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XieHQZ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XieHQZ16
Yanlu Xie, Mark Hasegawa-Johnson, Leyuan Qu, Jinsong Zhang:
Landmark of Mandarin nasal codas and its application in pronunciation error detection. ICASSP 2016: 5370-5374
[c114]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuJTMSKHK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuJTMSKHK16
Chunxi Liu, Preethi Jyothi, Hao Tang, Vimal Manohar, Rose Sloan, Tyler Kekona, Mark Hasegawa-Johnson, Sanjeev Khudanpur:
Adapting ASR for under-resourced languages using mismatched transcriptions. ICASSP 2016: 5840-5844
[c113]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DasJH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DasJH16
Amit Das, Preethi Jyothi, Mark Hasegawa-Johnson:
Automatic Speech Recognition Using Probabilistic Transcriptions in Swahili, Amharic, and Dinka. INTERSPEECH 2016: 3524-3528
[c112]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DasH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DasH16
Amit Das, Mark Hasegawa-Johnson:
An Investigation on Training Deep Neural Networks Using Probabilistic Transcriptions. INTERSPEECH 2016: 3858-3862
[c111]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoCLH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoCLH16
Van Hai Do, Nancy F. Chen, Boon Pang Lim, Mark Hasegawa-Johnson:
Analysis of Mismatched Transcriptions Generated by Humans and Machines for Under-Resourced Languages. INTERSPEECH 2016: 3863-3867
[c110]
- view
  authority control:
- export record
  dblp key:
  - conf/ita/VarshneyJH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ita/VarshneyJH16
Lav R. Varshney, Preethi Jyothi, Mark Hasegawa-Johnson:
Language coverage for mismatched crowdsourcing. ITA 2016: 1-9
[c109]
- view
  authority control:
- export record
  dblp key:
  - conf/kdd/ChangZTYCHH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kdd/ChangZTYCHH16
Shiyu Chang, Yang Zhang, Jiliang Tang, Dawei Yin, Yi Chang, Mark A. Hasegawa-Johnson, Thomas S. Huang:
Positive-Unlabeled Learning in Streaming Networks. KDD 2016: 755-764
[c108]
- view
  authority control:
- export record
  dblp key:
  - conf/rivf/DoCLH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/rivf/DoCLH16
Van Hai Do, Nancy F. Chen, Boon Pang Lim, Mark Hasegawa-Johnson:
A many-to-one phone mapping approach for cross-lingual speech recognition. RIVF 2016: 120-124
[c107]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/sltu/ChenHC16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sltu/ChenHC16
Wenda Chen, Mark Hasegawa-Johnson, Nancy F. Chen:
Mismatched Crowdsourcing based Language Perception for Under-resourced Languages. SLTU 2016: 23-29
[c106]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/sltu/KongJH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sltu/KongJH16
Xiang Kong, Preethi Jyothi, Mark Hasegawa-Johnson:
Performance Improvement of Probabilistic Transcriptions with Language-specific Constraints. SLTU 2016: 30-36
[c105]
- view
  authority control:
- export record
  dblp key:
  - conf/ssp/WangZOH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssp/WangZOH16
Ruobai Wang, Yang Zhang, Zhijian Ou, Mark Hasegawa-Johnson:
Use of particle filtering and MCMC for inference in Probabilistic Acoustic Tube model. SSP 2016: 1-5
[c104]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/wssanlp/ChenHCJV16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wssanlp/ChenHCJV16
Wenda Chen, Mark Hasegawa-Johnson, Nancy F. Chen, Preethi Jyothi, Lav R. Varshney:
Clustering-based Phonetic Projection in Mismatched Crowdsourcing Channels for Low-resourced ASR. WSSANLP@COLING 2016: 133-141
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/ChangZTYCHH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ChangZTYCHH16
Shiyu Chang, Yang Zhang, Jiliang Tang, Dawei Yin, Yi Chang, Mark A. Hasegawa-Johnson, Thomas S. Huang:
Streaming Recommender Systems. CoRR abs/1607.06182 (2016)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/YehCLHD16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/YehCLHD16
Raymond A. Yeh, Chen Chen, Teck-Yian Lim, Mark Hasegawa-Johnson, Minh N. Do:
Semantic Image Inpainting with Perceptual and Contextual Losses. CoRR abs/1607.07539 (2016)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/KongYHCS16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/KongYHCS16
Xiang Kong, Xuesong Yang, Mark Hasegawa-Johnson, Jeung-Yoon Choi, Stefanie Shattuck-Hufnagel:
Landmark-based consonant voicing detection on multilingual corpora. CoRR abs/1611.03533 (2016)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/PaineKCZRHH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/PaineKCZRHH16
Tom Le Paine, Pooya Khorrami, Shiyu Chang, Yang Zhang, Prajit Ramachandran, Mark A. Hasegawa-Johnson, Thomas S. Huang:
Fast Wavenet Generation Algorithm. CoRR abs/1611.09482 (2016)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/KongJH16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/KongJH16
Xiang Kong, Preethi Jyothi, Mark Hasegawa-Johnson:
Performance Improvements of Probabilistic Transcript-adapted ASR with Recurrent Neural Network and Language-specific Constraints. CoRR abs/1612.03991 (2016)
2015
[j25]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/HuangKHS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HuangKHS15
Po-Sen Huang, Minje Kim, Mark Hasegawa-Johnson, Paris Smaragdis:
Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source Separation. IEEE ACM Trans. Audio Speech Lang. Process. 23(12): 2136-2147 (2015)
[c103]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/JyothiH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/JyothiH15
Preethi Jyothi, Mark Hasegawa-Johnson:
Acquiring Speech Transcriptions Using Mismatched Crowdsourcing. AAAI 2015: 1263-1269
[c102]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangNH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangNH15
Yang Zhang, Nasser M. Nasrabadi, Mark Hasegawa-Johnson:
Multichannel transient acoustic signal classification using task-driven dictionary with joint sparsity and beamforming. ICASSP 2015: 1866-1870
[c101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JyothiH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JyothiH15
Preethi Jyothi, Mark Hasegawa-Johnson:
Transcribing continuous speech using mismatched crowdsourcing. INTERSPEECH 2015: 2774-2778
[c100]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JyothiH15a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JyothiH15a
Preethi Jyothi, Mark Hasegawa-Johnson:
Improved hindi broadcast ASR by adapting the language model and pronunciation model using a priori syntactic and morphophonemic knowledge. INTERSPEECH 2015: 3164-3168
[c99]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DasH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DasH15
Amit Das, Mark Hasegawa-Johnson:
Cross-lingual transfer learning during supervised training in low resource scenarios. INTERSPEECH 2015: 3531-3535
[c98]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PietrowiczHK15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PietrowiczHK15
Mary Pietrowicz, Mark Hasegawa-Johnson, Karrie Karahalios:
Acoustic correlates for perceived effort levels in expressive speech. INTERSPEECH 2015: 3720-3724
[c97]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/slte/RenHA15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slte/RenHA15
Jia Chen Ren, Mark Hasegawa-Johnson, Lawrence Angrave:
Classtranscribe: a new tool with new educational opportunities for student crowdsourced college lecture transcription. SLaTE 2015: 179-180
[c96]
- view
  authority control:
- export record
  dblp key:
  - conf/waspaa/ZhangOH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/waspaa/ZhangOH15
Yang Zhang, Zhijian Ou, Mark Hasegawa-Johnson:
Incorporating AM-FM effect in voiced speech for probabilistic acoustic tube model. WASPAA 2015: 1-5
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/HuangKHS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/HuangKHS15
Po-Sen Huang, Minje Kim, Mark Hasegawa-Johnson, Paris Smaragdis:
Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source Separation. CoRR abs/1502.04149 (2015)
2014
[j24]
- view
  authority control:
- export record
  dblp key:
  - journals/prl/KimLWHH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/prl/KimLWHH14
Kyungtae Kim, Kai-Hsiang Lin, Dirk B. Walther, Mark Hasegawa-Johnson, Thomas S. Huang:
Automatic detection of auditory salience with optimized linear filters derived from human annotation. Pattern Recognit. Lett. 38: 78-85 (2014)
[j23]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ChenH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ChenH14
Austin Chen, Mark A. Hasegawa-Johnson:
Mixed stereo audio classification using a stereo-input mixed-to-panned level feature. IEEE ACM Trans. Audio Speech Lang. Process. 22(12): 2025-2033 (2014)
[c95]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/BharadwajH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/BharadwajH14
Sujeeth Bharadwaj, Mark Hasegawa-Johnson:
A PAC-Bayesian Approach to Minimum Perplexity Language Modeling. COLING 2014: 130-140
[c94]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/WangWMHGNHKH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/WangWMHGNHKH14
Zhaowen Wang, Zhangyang Wang, Mark Moll, Po-Sen Huang, Devin K. Grady, Nasser M. Nasrabadi, Thomas S. Huang, Lydia E. Kavraki, Mark Hasegawa-Johnson:
Active Planning, Sensing, and Recognition Using a Resource-Constrained Discriminant POMDP. CVPR Workshops 2014: 754-761
[c93]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangKHS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangKHS14
Po-Sen Huang, Minje Kim, Mark Hasegawa-Johnson, Paris Smaragdis:
Deep learning for monaural speech separation. ICASSP 2014: 1562-1566
[c92]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangOH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangOH14
Yang Zhang, Zhijian Ou, Mark Hasegawa-Johnson:
Improvement of Probabilistic Acoustic Tube model for speech decomposition. ICASSP 2014: 7929-7933
[c91]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/LinKWHH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/LinKWHH14
Kai-Hsiang Lin, Pooya Khorrami, Jiangping Wang, Mark Hasegawa-Johnson, Thomas S. Huang:
Foreground object detection in highly dynamic scenes using saliency. ICIP 2014: 1125-1129
[c90]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KhasanovaCH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KhasanovaCH14
Alina Khasanova, Jennifer Cole, Mark Hasegawa-Johnson:
Detecting articulatory compensation in acoustic data through linear regression modeling. INTERSPEECH 2014: 925-929
[c89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenZH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenZH14
Xiayu Chen, Yang Zhang, Mark Hasegawa-Johnson:
An iterative approach to decision tree training for context dependent speech synthesis. INTERSPEECH 2014: 2327-2331
[c88]
- view
- export record
  dblp key:
  - conf/ismir/HuangKHS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/HuangKHS14
Po-Sen Huang, Minje Kim, Mark Hasegawa-Johnson, Paris Smaragdis:
Singing-Voice Separation from Monaural Recordings using Deep Recurrent Neural Networks. ISMIR 2014: 477-482
[c87]
- view
  - electronic edition @ lrec-conf.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/lrec/ElmahdyHM14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lrec/ElmahdyHM14
Mohamed Elmahdy, Mark Hasegawa-Johnson, Eiman Mustafawi:
Development of a TV Broadcasts Speech Recognition System for Qatari Arabic. LREC 2014: 3057-3061
[c86]
- view
  - electronic edition @ lrec-conf.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/lrec/ElmahdyHM14a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/lrec/ElmahdyHM14a
Mohamed Elmahdy, Mark Hasegawa-Johnson, Eiman Mustafawi:
Automatic Long Audio Alignment and Confidence Scoring for Conversational Arabic Speech. LREC 2014: 3062-3066
2013
[j22]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/SharmaH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/SharmaH13
Harsh Vardhan Sharma, Mark Hasegawa-Johnson:
Acoustic model adaptation using in-domain background models for dysarthric speech recognition. Comput. Speech Lang. 27(6): 1147-1162 (2013)
[j21]
- view
  authority control:
- export record
  dblp key:
  - journals/tap/LinZGKHH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tap/LinZGKHH13
Kai-Hsiang Lin, Xiaodan Zhuang, Camille Goudeseune, Sarah King, Mark Hasegawa-Johnson, Thomas S. Huang:
Saliency-maximized audio visualization and efficient audio-visual browsing for faster-than-real-time human acoustic event detection. ACM Trans. Appl. Percept. 10(4): 26:1-26:16 (2013)
[c85]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BharadwajHADV13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BharadwajHADV13
Sujeeth Bharadwaj, Mark Hasegawa-Johnson, Jitendra Ajmera, Om Deshmukh, Ashish Verma:
Sparse hidden Markov models for purer clusters. ICASSP 2013: 3098-3102
[c84]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangDHH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangDHH13
Po-Sen Huang, Li Deng, Mark Hasegawa-Johnson, Xiaodong He:
Random features for Kernel Deep Convex Network. ICASSP 2013: 3143-3147
[c83]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KingH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KingH13
Sarah King, Mark Hasegawa-Johnson:
Accurate speech segmentation by mimicking human auditory processing. ICASSP 2013: 8096-8100
2012
[j20]
- view
  authority control:
- export record
  dblp key:
  - journals/ijmdem/MertensHGFDH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijmdem/MertensHGFDH12
Robert Mertens, Po-Sen Huang, Luke R. Gottlieb, Gerald Friedland, Ajay Divakaran, Mark Hasegawa-Johnson:
On the Applicability of Speaker Diarization to Audio Indexing of Non-Speech and Mixed Non-Speech/Speech Video Soundtracks. Int. J. Multim. Data Eng. Manag. 3(3): 1-19 (2012)
[j19]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/TangCHH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/TangCHH12
Hao Tang, Stephen M. Chu, Mark Hasegawa-Johnson, Thomas S. Huang:
Partially Supervised Speaker Clustering. IEEE Trans. Pattern Anal. Mach. Intell. 34(5): 959-971 (2012)
[j18]
- view
  authority control:
- export record
  dblp key:
  - journals/socnet/MathurPPHC12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/socnet/MathurPPHC12
Shobhit Mathur, Marshall Scott Poole, Feniosky Peña-Mora, Mark Hasegawa-Johnson, Noshir S. Contractor:
Detecting interaction links in a collaborating group using manually annotated data. Soc. Networks 34(4): 515-526 (2012)
[j17]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/OzbekHD12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/OzbekHD12
I. Yücel Özbek, Mark Hasegawa-Johnson, Mübeccel Demirekler:
On Improving Dynamic State Space Approaches to Articulatory Inversion With MAP-Based Parameter Estimation. IEEE Trans. Speech Audio Process. 20(1): 67-81 (2012)
[c82]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/KingH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/KingH12
Sarah King, Mark Hasegawa-Johnson:
Detection of Acoustic-Phonetic Landmarks in Mismatched Conditions using a Biomimetic Model of Human Auditory Processing. COLING (Posters) 2012: 589-598
[c81]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangCSH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangCSH12
Po-Sen Huang, Scott Deeann Chen, Paris Smaragdis, Mark Hasegawa-Johnson:
Singing-voice separation from monaural recordings using robust principal component analysis. ICASSP 2012: 57-60
[c80]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangMDFH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangMDFH12
Po-Sen Huang, Robert Mertens, Ajay Divakaran, Gerald Friedland, Mark Hasegawa-Johnson:
How to put it into words - using random forests to extract symbol level descriptions from audio content for concept detection. ICASSP 2012: 505-508
[c79]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LinZGKHH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LinZGKHH12
Kai-Hsiang Lin, Xiaodan Zhuang, Camille Goudeseune, Sarah King, Mark Hasegawa-Johnson, Thomas S. Huang:
Improving faster-than-real-time human acoustic event detection by saliency-maximized audio visualization. ICASSP 2012: 2277-2280
[c78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MahrtCFH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MahrtCFH12
Tim Mahrt, Jennifer Cole, Margaret M. Fleck, Mark Hasegawa-Johnson:
F0 and the Perception of Prominence. INTERSPEECH 2012: 2422-2425
[c77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangYHLH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangYHLH12
Po-Sen Huang, Jianchao Yang, Mark Hasegawa-Johnson, Feng Liang, Thomas S. Huang:
Pooling Robust Shift-Invariant Sparse Representations of Acoustic Signals. INTERSPEECH 2012: 2518-2521
[c76]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/HuangHYH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/HuangHYH12
Po-Sen Huang, Mark Hasegawa-Johnson, Wotao Yin, Thomas S. Huang:
Opportunistic sensing: Unattended acoustic sensor selection using crowdsourcing models. MLSP 2012: 1-6
2011
[j16]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/LobdellAH11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/LobdellAH11
Bryce E. Lobdell, Jont B. Allen, Mark Hasegawa-Johnson:
Intelligibility predictors and neural representation of speech. Speech Commun. 53(2): 185-194 (2011)
[j15]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/OzbekHD11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/OzbekHD11
I. Yücel Özbek, Mark Hasegawa-Johnson, Mübeccel Demirekler:
Estimation of Articulatory Trajectories Based on Gaussian Mixture Model (GMM) With Audio-Visual Information Fusion and Dynamic Kalman Smoothing. IEEE Trans. Speech Audio Process. 19(5): 1180-1195 (2011)
[c75]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/fusion/HuangDH11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/fusion/HuangDH11
Po-Sen Huang, Thyagaraju Damarla, Mark Hasegawa-Johnson:
Multi-sensory features for personnel detection at border crossings. FUSION 2011: 1-8
[c74]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangZH11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangZH11
Po-Sen Huang, Xiaodan Zhuang, Mark Hasegawa-Johnson:
Improving acoustic event detection using generalizable visual features and multi-modality modeling. ICASSP 2011: 349-352
[c73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MahrtHMFHC11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MahrtHMFHC11
Tim Mahrt, Jui-Ting Huang, Yoonsook Mo, Margaret M. Fleck, Mark Hasegawa-Johnson, Jennifer Cole:
Optimal Models of Prosodic Prominence Using the Bayesian Information Criterion. INTERSPEECH 2011: 2037-2040
[c72]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mlslp/Hasegawa-JohnsonHZ11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlslp/Hasegawa-JohnsonHZ11
Mark Hasegawa-Johnson, Jui-Ting Huang, Xiaodan Zhuang:
Unlabeled data and other marginals. MLSLP 2011
[p1]
- view
  authority control:
- export record
  dblp key:
  - series/sci/ZhuangZHH11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/series/sci/ZhuangZHH11
Xiaodan Zhuang, Xi Zhou, Mark Hasegawa-Johnson, Thomas S. Huang:
Efficient Object Localization with Variation-Normalized Gaussianized Vectors. Intelligent Video Event Analysis and Understanding 2011: 93-109
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1106-1199
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1106-1199
Bowon Lee, Camille Goudeseune, Mark Hasegawa-Johnson:
Open-loop multi-channel inversion of room impulse response. CoRR abs/1106.1199 (2011)
2010
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/prl/ZhouZTHH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/prl/ZhouZTHH10
Xi Zhou, Xiaodan Zhuang, Hao Tang, Mark Hasegawa-Johnson, Thomas S. Huang:
Novel Gaussianized vector representation for improved natural scene categorization. Pattern Recognit. Lett. 31(8): 702-708 (2010)
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/prl/ZhuangZHH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/prl/ZhuangZHH10
Xiaodan Zhuang, Xi Zhou, Mark Hasegawa-Johnson, Thomas S. Huang:
Real-world acoustic event detection. Pattern Recognit. Lett. 31(12): 1543-1551 (2010)
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/TangHH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/TangHH10
Hao Tang, Mark Hasegawa-Johnson, Thomas S. Huang:
A Novel Vector Representation of Stochastic Signals Based on Adapted Ergodic HMMs. IEEE Signal Process. Lett. 17(8): 715-718 (2010)
[c71]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KimHPL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KimHPL10
Lae-Hoon Kim, Mark Hasegawa-Johnson, Gerasimos Potamianos, Vit Libal:
Joint estimation of DOA and speech based on EM beamforming. ICASSP 2010: 121-124
[c70]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TangHH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TangHH10
Hao Tang, Mark Hasegawa-Johnson, Thomas S. Huang:
Toward robust learning of the Gaussian mixture state emission densities for hidden Markov models. ICASSP 2010: 5242-5245
[c69]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/TangHH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/TangHH10
Hao Tang, Mark Hasegawa-Johnson, Thomas S. Huang:
Non-frontal view facial expression recognition based on ergodic hidden Markov model supervectors. ICME 2010: 1202-1207
[c68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NamMTSGEH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NamMTSGEH10
Hosung Nam, Vikramjit Mitra, Mark Tiede, Elliot Saltzman, Louis Goldstein, Carol Y. Espy-Wilson, Mark Hasegawa-Johnson:
A procedure for estimating gestural scores from natural speech. INTERSPEECH 2010: 30-33
[c67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YoonHS10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YoonHS10
Su-Youn Yoon, Mark Hasegawa-Johnson, Richard Sproat:
Landmark-based automated pronunciation error detection. INTERSPEECH 2010: 614-617
[c66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangH10
Jui-Ting Huang, Mark Hasegawa-Johnson:
Semi-supervised training of Gaussian mixture models by conditional entropy minimization. INTERSPEECH 2010: 1353-1356
[c65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuangWSH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuangWSH10
Xiaodan Zhuang, Lijuan Wang, Frank K. Soong, Mark Hasegawa-Johnson:
A minimum converted trajectory error (MCTE) approach to high quality speech-to-lips conversion. INTERSPEECH 2010: 1736-1739
[c64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimKH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimKH10
Lae-Hoon Kim, Kyung-Tae Kim, Mark Hasegawa-Johnson:
Robust automatic speech recognition with decoder oriented ideal binary mask estimation. INTERSPEECH 2010: 2066-2069
[c63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuZH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuZH10
Chi Hu, Xiaodan Zhuang, Mark Hasegawa-Johnson:
FSM-based pronunciation modeling using articulatory phonological code. INTERSPEECH 2010: 2274-2277
[c62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimRLH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimRLH10
Heejin Kim, Panying Rong, Torrey M. Loucks, Mark Hasegawa-Johnson:
Kinematic analysis of tongue movement control in spastic dysarthria. INTERSPEECH 2010: 2578-2581
[c61]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/slpat/SharmaH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slpat/SharmaH10
Harsh Vardhan Sharma, Mark Hasegawa-Johnson:
State-Transition Interpolation and MAP Adaptation for HMM-based Dysarthric Speech Recognition. SLPAT@NAACL 2010: 72-79

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/spm/HuangHCZT09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spm/HuangHCZT09
Thomas S. Huang, Mark A. Hasegawa-Johnson, Stephen M. Chu, Zhihong Zeng, Hao Tang:
Sensitive Talking Heads [Applications Corner]. IEEE Signal Process. Mag. 26(4): 67-72 (2009)
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/HuangZHH09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/HuangZHH09
Jui-Ting Huang, Xi Zhou, Mark Hasegawa-Johnson, Thomas S. Huang:
Kernel metric learning for phonetic classification. ASRU 2009: 141-145
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhuangHPH09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhuangHPH09
Xiaodan Zhuang, Jing Huang, Gerasimos Potamianos, Mark Hasegawa-Johnson:
Acoustic fall detection using Gaussian mixture models and GMM supervectors. ICASSP 2009: 69-72
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/TangCHH09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/TangCHH09
Hao Tang, Stephen M. Chu, Mark Hasegawa-Johnson, Thomas S. Huang:
Emotion recognition from speech VIA boosted Gaussian mixture models. ICME 2009: 294-297
[c57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SharmaH09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SharmaH09
Harsh Vardhan Sharma, Mark Hasegawa-Johnson:
Universal access: speech recognition for talkers with spastic dysarthria. INTERSPEECH 2009: 1451-1454
[c56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YoonHS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YoonHS09
Su-Youn Yoon, Mark Hasegawa-Johnson, Richard Sproat:
Automated pronunciation scoring using confidence scoring and landmark-based SVM. INTERSPEECH 2009: 1903-1906
[c55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoCH09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoCH09
Yoonsook Mo, Jennifer Cole, Mark Hasegawa-Johnson:
Prosodic effects on vowel production: evidence from formant structure. INTERSPEECH 2009: 2535-2538
[c54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuangNHGS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuangNHGS09
Xiaodan Zhuang, Hosung Nam, Mark Hasegawa-Johnson, Louis Goldstein, Elliot Saltzman:
Articulatory phonological code for word classification. INTERSPEECH 2009: 2763-2766
[c53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OzbekHD09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OzbekHD09
I. Yücel Özbek, Mark Hasegawa-Johnson, Mübeccel Demirekler:
Formant trajectories for acoustic-to-articulatory inversion. INTERSPEECH 2009: 2807-2810
2008
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/neuroimage/ChangEAHL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/neuroimage/ChangEAHL08
Soo-Eun Chang, Kirk I. Erickson, Nicoline G. Ambrose, Mark A. Hasegawa-Johnson, Christy L. Ludlow:
Brain anatomy differences in childhood stuttering. NeuroImage 39(3): 1333-1344 (2008)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/TangFTHH08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/TangFTHH08
Hao Tang, Yun Fu, Jilin Tu, Mark Hasegawa-Johnson, Thomas S. Huang:
Humanoid Audio-Visual Avatar With Emotive Text-to-Speech Synthesis. IEEE Trans. Multim. 10(6): 969-981 (2008)
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/YanZLHH08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/YanZLHH08
Shuicheng Yan, Xi Zhou, Ming Liu, Mark Hasegawa-Johnson, Thomas S. Huang:
Regression from patch-kernel. CVPR 2008
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhuangZHH08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhuangZHH08
Xiaodan Zhuang, Xi Zhou, Thomas S. Huang, Mark Hasegawa-Johnson:
Feature analysis and selection for acoustic event detection. ICASSP 2008: 17-20
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KimH08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KimH08
Lae-Hoon Kim, Mark Hasegawa-Johnson:
Optimal speech estimator considering room response as well as additive noise: Different approaches in low and high frequency range. ICASSP 2008: 4573-4576
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/TangHFHH08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/TangHFHH08
Hao Tang, Yuxiao Hu, Yun Fu, Mark Hasegawa-Johnson, Thomas S. Huang:
Real-time conversion from a single 2D face image to a 3D text-driven emotive audio-visual avatar. ICME 2008: 1205-1208
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/ZhouZTHH08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/ZhouZTHH08
Xi Zhou, Xiaodan Zhuang, Hao Tang, Mark Hasegawa-Johnson, Thomas S. Huang:
A novel Gaussianized vector representation for natural scene categorization. ICPR 2008: 1-4
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/icpr/ZhuangZHH08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icpr/ZhuangZHH08
Xiaodan Zhuang, Xi Zhou, Mark Hasegawa-Johnson, Thomas S. Huang:
Face age estimation using patch-based hidden Markov model supervectors. ICPR 2008: 1-4
[c46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangH08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangH08
Jui-Ting Huang, Mark Hasegawa-Johnson:
Maximum mutual information estimation with unlabeled data for phonetic classification. INTERSPEECH 2008: 952-955
[c45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuangNHGS08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuangNHGS08
Xiaodan Zhuang, Hosung Nam, Mark Hasegawa-Johnson, Louis M. Goldstein, Elliot Saltzman:
The entropy of the articulatory phonological code: recognizing gestures from tract variables. INTERSPEECH 2008: 1489-1492
[c44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimHPGHWF08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimHPGHWF08
Heejin Kim, Mark Hasegawa-Johnson, Adrienne Perlman, Jon R. Gunderson, Thomas S. Huang, Kenneth L. Watkin, Simone Frame:
Dysarthric speech database for universal access research. INTERSPEECH 2008: 1741-1744
[c43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LobdellHA08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LobdellHA08
Bryce E. Lobdell, Mark Hasegawa-Johnson, Jont B. Allen:
Human speech perception and feature extraction. INTERSPEECH 2008: 1797-1800
[c42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TangZOHH08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TangZOHH08
Hao Tang, Xi Zhou, Matthias Odisio, Mark Hasegawa-Johnson, Thomas S. Huang:
Two-stage prosody prediction for emotional text-to-speech synthesis. INTERSPEECH 2008: 2138-2141
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhouZYCHH08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhouZYCHH08
Xi Zhou, Xiaodan Zhuang, Shuicheng Yan, Shih-Fu Chang, Mark Hasegawa-Johnson, Thomas S. Huang:
SIFT-Bag kernel for video event analysis. ACM Multimedia 2008: 229-238
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/wacv/TangFTHH08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wacv/TangFTHH08
Hao Tang, Yun Fu, Jilin Tu, Thomas S. Huang, Mark Hasegawa-Johnson:
EAVA: A 3D Emotive Audio-Visual Avatar. WACV 2008: 1-6
2007
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/jphonetics/ColeKCH07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jphonetics/ColeKCH07
Jennifer Cole, Heejin Kim, Hansook Choi, Mark Hasegawa-Johnson:
Prosodic effects on acoustic cues to stop voicing and place of articulation: Evidence from Radio News speech. J. Phonetics 35(2): 180-209 (2007)
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/clear/LiuCZZHH07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/clear/LiuCZZHH07
Ming Liu, Yanxiang Chen, Xi Zhou, Xiaodan Zhuang, Mark Hasegawa-Johnson, Thomas S. Huang:
Multichannel and Multimodality Person Identification. CLEAR 2007: 248-255
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/clear/ZhouZLTHH07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/clear/ZhouZLTHH07
Xi Zhou, Xiaodan Zhuang, Ming Liu, Hao Tang, Mark Hasegawa-Johnson, Thomas S. Huang:
HMM-Based Acoustic Event Detection with AdaBoost Feature Selection. CLEAR 2007: 345-353
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LivescuCHKBBKLYBDWFMS07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LivescuCHKBBKLYBDWFMS07
Karen Livescu, Özgür Çetin, Mark Hasegawa-Johnson, Simon King, Chris D. Bartels, Nash M. Borges, Arthur Kantor, Partha Lal, Lisa Yung, Ari Bezman, Stephen Dawson-Haggerty, Bronwyn Woods, Joe Frankel, Mathew Magimai-Doss, Kate Saenko:
Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 JHU Summer workshop. ICASSP (4) 2007: 621-624
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/FuZLHH07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/FuZLHH07
Yun Fu, Xi Zhou, Ming Liu, Mark Hasegawa-Johnson, Thomas S. Huang:
Lipreading by Locality Discriminant Graph. ICIP (3) 2007: 325-328
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/LiuZHH07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/LiuZHH07
Ming Liu, Zhengyou Zhang, Mark Hasegawa-Johnson, Thomas S. Huang:
Exploring Discriminative Learning for Text-Independent Speaker Recognition. ICME 2007: 56-59
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/ZhouFLHH07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/ZhouFLHH07
Xi Zhou, Yun Fu, Ming Liu, Mark Hasegawa-Johnson, Thomas S. Huang:
Robust Analysis and Weighting on MFCC Components for Speech Recognition and Speaker Identification. ICME 2007: 188-191
[c33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuZHHZ07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuZHHZ07
Ming Liu, Xi Zhou, Mark Hasegawa-Johnson, Thomas S. Huang, Zhengyou Zhang:
Frequency domain correspondence for speaker normalization. INTERSPEECH 2007: 274-277
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/mmsp/Hasegawa-Johnson07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mmsp/Hasegawa-Johnson07
Mark Hasegawa-Johnson:
A Multi-Stream Approach to Audiovisual Automatic Speech Recognition. MMSP 2007: 328-331
2006
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/ZhangHL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/ZhangHL06
Tong Zhang, Mark Hasegawa-Johnson, Stephen E. Levinson:
Extraction of pragmatic and semantic salience from spontaneous spoken English. Speech Commun. 48(3-4): 437-462 (2006)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/ZhangHL06a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/ZhangHL06a
Tong Zhang, Mark Hasegawa-Johnson, Stephen E. Levinson:
Cognitive state classification in a spoken tutorial dialogue system. Speech Commun. 48(6): 616-632 (2006)
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ChenHCBKCC06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ChenHCBKCC06
Ken Chen, Mark Hasegawa-Johnson, Aaron Cohen, Sarah Borys, Sung-Suk Kim, Jennifer Cole, Jeung-Yoon Choi:
Prosody dependent speech recognition on radio news corpus of American English. IEEE Trans. Speech Audio Process. 14(1): 232-245 (2006)
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KimHS06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KimHS06
Lae-Hoon Kim, Mark Hasegawa-Johnson, Koeng-Mo Sung:
Generalized Optimal Multi-Microphone Speech Enhancement Using Sequential Minimum Variance Distortionless Response(MVDR) Beamforming and Postfiltering. ICASSP (3) 2006: 65-68
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Hasegawa-JohnsonGPH06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Hasegawa-JohnsonGPH06
Mark Hasegawa-Johnson, Jon R. Gunderson, Adrienne Perlman, Thomas S. Huang:
Hmm-Based and Svm-Based Recognition of the Speech of Talkers With Spastic Dysarthria. ICASSP (3) 2006: 1060-1063
[c29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChitturiH06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChitturiH06
Rahul Chitturi, Mark Hasegawa-Johnson:
Novel time domain multi-class SVMs for landmark detection. INTERSPEECH 2006
[c28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChitturiH06a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChitturiH06a
Rahul Chitturi, Mark Hasegawa-Johnson:
Novel entropy based moving average refiners for HMM landmarks. INTERSPEECH 2006
2005
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/Hasegawa-JohnsonCCBKCZCKY05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/Hasegawa-JohnsonCCBKCZCKY05
Mark Hasegawa-Johnson, Ken Chen, Jennifer Cole, Sarah Borys, Sung-Suk Kim, Aaron Cohen, Tong Zhang, Jeung-Yoon Choi, Heejin Kim, Taejin Yoon:
Simultaneous recognition of words and prosody in the Boston University Radio Speech Corpus. Speech Commun. 46(3-4): 418-439 (2005)
[c27]
- view
  - electronic edition @ isca-archive.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/diss/ColeHSKLLMY05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/diss/ColeHSKLLMY05
Jennifer Cole, Mark Hasegawa-Johnson, Chilin Shih, Heejin Kim, Eun-Kyung Lee, Hsin-Yi Dora Lu, Yoonsook Mo, Taejin Yoon:
Prosodic parallelism as a cue to repetition and error correction disfluency. DiSS 2005: 53-58
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Hasegawa-JohnsonBBCCGJKLMMSW05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Hasegawa-JohnsonBBCCGJKLMMSW05
Mark Hasegawa-Johnson, James Baker, Sarah Borys, Ken Chen, Emily Coogan, Steven Greenberg, Amit Juneja, Katrin Kirchhoff, Karen Livescu, Srividya Mohan, Jennifer Muller, M. Kemal Sönmez, Tianyu Wang:
Landmark-Based Speech Recognition: Report of the 2004 Johns Hopkins Summer Workshop. ICASSP (1) 2005: 213-216
[c25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BorysH05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BorysH05
Sarah Borys, Mark Hasegawa-Johnson:
Distinctive feature based SVM discriminant features for improvements to phone recognition on telephone band speech. INTERSPEECH 2005: 697-700
2004
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/KimHC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/KimHC04
Sung-Suk Kim, Mark Hasegawa-Johnson, Ken Chen:
Automatic recognition of pitch movements using multilayer perceptron and time-Delay Recursive neural network. IEEE Signal Process. Lett. 11(7): 645-648 (2004)
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/tsp/OmarH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tsp/OmarH04
Mohamed Kamal Omar, Mark Hasegawa-Johnson:
Model enforcement: a unified feature transformation framework for classification and recognition. IEEE Trans. Signal Process. 52(10): 2701-2710 (2004)
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenHC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenHC04
Ken Chen, Mark Hasegawa-Johnson, Aaron Cohen:
An automatic prosody labeling system using ANN-based syntactic-prosodic model and GMM-based acoustic-prosodic model. ICASSP (1) 2004: 509-512
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhengH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhengH04
Yanli Zheng, Mark Hasegawa-Johnson:
Formant tracking by mixture state particle filter. ICASSP (1) 2004: 565-568
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DeorasH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DeorasH04
Ameya N. Deoras, Mark Hasegawa-Johnson:
A factorial HMM approach to simultaneous recognition of isolated digits spoken by multiple talkers on one audio channel. ICASSP (1) 2004: 861-864
[c21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hasegawa-JohnsonLZ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hasegawa-JohnsonLZ04
Mark Hasegawa-Johnson, Stephen E. Levinson, Tong Zhang:
Automatic detection of contrast for speech understanding. INTERSPEECH 2004: 581-584
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hasegawa-JohnsonLZ04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hasegawa-JohnsonLZ04a
Mark Hasegawa-Johnson, Stephen E. Levinson, Tong Zhang:
Children's emotion recognition in an intelligent tutoring scenario. INTERSPEECH 2004: 1441-1444
[c19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenH04a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenH04a
Ken Chen, Mark Hasegawa-Johnson:
Modeling pronunciation variation using artificial neural networks for English spontaneous speech. INTERSPEECH 2004: 1461-1464
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hasegawa-JohnsonD04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hasegawa-JohnsonD04
Mark Hasegawa-Johnson, Ameya N. Deoras:
A factorial HMM aproach to robust isolated digit recognition in background music. INTERSPEECH 2004: 2093-2096
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhengHB04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhengHB04
Yanli Zheng, Mark Hasegawa-Johnson, Sarah Borys:
Stop consonant classification by dynamic formant trajectory. INTERSPEECH 2004: 2481-2484
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeHGKBLH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeHGKBLH04
Bowon Lee, Mark Hasegawa-Johnson, Camille Goudeseune, Suketu Kamdar, Sarah Borys, Ming Liu, Thomas S. Huang:
AVICAR: audio-visual speech corpus in a car environment. INTERSPEECH 2004: 2489-2492
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GandhiH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GandhiH04
Mital Gandhi, Mark Hasegawa-Johnson:
Source separation using particle filters. INTERSPEECH 2004: 2673-2676
[c14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YoonCCH04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YoonCCH04
Taejin Yoon, Sandra Chavarria, Jennifer Cole, Mark Hasegawa-Johnson:
Intertranscriber reliability of prosodic labeling on telephone conversation using toBI. INTERSPEECH 2004: 2729-2732
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BorysCHC04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BorysCHC04
Sarah Borys, Aaron Cohen, Mark Hasegawa-Johnson, Jennifer Cole:
Modeling and recognition of phonetic and prosodic factors for improvements to acoustic speech recognition models. INTERSPEECH 2004: 3013-3016
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/iui/RenHL04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iui/RenHL04
Yuexi Ren, Mark Hasegawa-Johnson, Stephen E. Levinson:
Semantic analysis for a speech user interface in an intelligent tutoring system. IUI 2004: 313-315
2003
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/OmarH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/OmarH03
Mohamed Kamal Omar, Mark Hasegawa-Johnson:
Approximately independent factors of speech using nonlinear symplectic transformation. IEEE Trans. Speech Audio Process. 11(6): 660-671 (2003)
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhengH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhengH03
Yanli Zheng, Mark Hasegawa-Johnson:
Acoustic segmentation using switching state Kalman filter. ICASSP (1) 2003: 752-755
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenBHC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenBHC03
Ken Chen, Sarah Borys, Mark Hasegawa-Johnson, Jennifer Cole:
Prosody dependent speech recognition with explicit duration modelling at intonational phrase boundaries. INTERSPEECH 2003: 393-396
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OmarH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OmarH03
Mohamed Kamal Omar, Mark Hasegawa-Johnson:
Maximum conditional mutual information projection for speech recognition. INTERSPEECH 2003: 505-508
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OmarH03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OmarH03a
Mohamed Kamal Omar, Mark Hasegawa-Johnson:
Non-linear maximum likelihood feature transformation for speech recognition. INTERSPEECH 2003: 2497-2500
2002
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/OmarH02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/OmarH02
Mohamed Kamal Omar, Mark Hasegawa-Johnson:
Maximum mutual information based acoustic-features representation of phonological features for speech recognition. ICASSP 2002: 81-84
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/JingH02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/JingH02
Zhinian Jing, Mark Hasegawa-Johnson:
Auditory-modeling inspired methods of feature extraction for robust automatic speech recognition. ICASSP 2002: 4176
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OmarCHB02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OmarCHB02
Mohamed Kamal Omar, Ken Chen, Mark Hasegawa-Johnson, Yigal Brandman:
An evaluation of using mutual information for selection of acoustic-features representation of phonemes for speech recognition. INTERSPEECH 2002: 2129-2132
2001
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GunawanH01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GunawanH01
Wira Gunawan, Mark Hasegawa-Johnson:
PLP coefficients can be quantized at 400 bps. ICASSP 2001: 77-80
2000
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Hasegawa-Johnson00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Hasegawa-Johnson00
Mark Hasegawa-Johnson:
Multivariate-state hidden Markov models for simultaneous transcription of phones and formants. ICASSP 2000: 1323-1326
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hasegawa-Johnson00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hasegawa-Johnson00
Mark Hasegawa-Johnson:
Time-frequency distribution of partial phonetic information measured using mutual information. INTERSPEECH 2000: 133-136
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangLH00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangLH00
Jun Huang, Stephen E. Levinson, Mark Hasegawa-Johnson:
Signal approximation in Hilbert space and its application on articulatory speech synthesis. INTERSPEECH 2000: 775-778

1990 – 1999

see FAQ

What is the meaning of the colors in the publication lists?

1996
[b1]
- view
- export record
  dblp key:
  - phd/ndltd/HasegawaJohnson96
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/ndltd/HasegawaJohnson96
Mark A. Hasegawa-Johnson:
Formant and burst spectral measurements with quantitative error models for speech sound classification. Massachusetts Institute of Technology, Cambridge, MA, USA, 1996

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.