default search action
Suwon Shon
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c37]Siddhant Arora, Ankita Pasad, Chung-Ming Chien, Jionghao Han, Roshan S. Sharma, Jee-weon Jung, Hira Dhamyal, William Chen, Suwon Shon, Hung-yi Lee, Karen Livescu, Shinji Watanabe:
On the Evaluation of Speech Foundation Models for Spoken Language Understanding. ACL (Findings) 2024: 11923-11938 - [c36]Roshan Sharma, Suwon Shon, Mark Lindsey, Hira Dhamyal, Bhiksha Raj:
Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization? ACL (1) 2024: 14779-14797 - [c35]Suwon Shon, Kwangyoun Kim, Prashant Sridhar, Yi-Te Hsu, Shinji Watanabe, Karen Livescu:
Generative Context-Aware Fine-Tuning of Self-Supervised Speech Models. ICASSP 2024: 11156-11160 - [c34]Jiyang Tang, Kwangyoun Kim, Suwon Shon, Felix Wu, Prashant Sridhar:
Improving ASR Contextual Biasing with Guided Attention. ICASSP 2024: 12096-12100 - [i29]Jiyang Tang, Kwangyoun Kim, Suwon Shon, Felix Wu, Prashant Sridhar, Shinji Watanabe:
Improving ASR Contextual Biasing with Guided Attention. CoRR abs/2401.08835 (2024) - [i28]Suwon Shon, Kwangyoun Kim, Yi-Te Hsu, Prashant Sridhar, Shinji Watanabe, Karen Livescu:
DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding. CoRR abs/2406.09345 (2024) - [i27]Siddhant Arora, Ankita Pasad, Chung-Ming Chien, Jionghao Han, Roshan S. Sharma, Jee-weon Jung, Hira Dhamyal, William Chen, Suwon Shon, Hung-yi Lee, Karen Livescu, Shinji Watanabe:
On the Evaluation of Speech Foundation Models for Spoken Language Understanding. CoRR abs/2406.10083 (2024) - [i26]Roshan S. Sharma, Suwon Shon, Mark Lindsey, Hira Dhamyal, Rita Singh, Bhiksha Raj:
Speech vs. Transcript: Does It Matter for Human Annotators in Speech Summarization? CoRR abs/2408.07277 (2024) - 2023
- [c33]Suwon Shon, Siddhant Arora, Chyi-Jiunn Lin, Ankita Pasad, Felix Wu, Roshan S. Sharma, Wei-Lun Wu, Hung-yi Lee, Karen Livescu, Shinji Watanabe:
SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks. ACL (1) 2023: 8906-8937 - [c32]Suwon Shon, Felix Wu, Kwangyoun Kim, Prashant Sridhar, Karen Livescu, Shinji Watanabe:
Context-Aware Fine-Tuning of Self-Supervised Speech Models. ICASSP 2023: 1-5 - [c31]Yifan Peng, Kwangyoun Kim, Felix Wu, Brian Yan, Siddhant Arora, William Chen, Jiyang Tang, Suwon Shon, Prashant Sridhar, Shinji Watanabe:
A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks. INTERSPEECH 2023: 2208-2212 - [i25]Yifan Peng, Kwangyoun Kim, Felix Wu, Brian Yan, Siddhant Arora, William Chen, Jiyang Tang, Suwon Shon, Prashant Sridhar, Shinji Watanabe:
A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks. CoRR abs/2305.11073 (2023) - [i24]Suwon Shon, Kwangyoun Kim, Prashant Sridhar, Yi-Te Hsu, Shinji Watanabe, Karen Livescu:
Generative Context-aware Fine-tuning of Self-supervised Speech Models. CoRR abs/2312.09895 (2023) - 2022
- [c30]Suwon Shon, Ankita Pasad, Felix Wu, Pablo Brusco, Yoav Artzi, Karen Livescu, Kyu Jeong Han:
SLUE: New Benchmark Tasks For Spoken Language Understanding Evaluation on Natural Speech. ICASSP 2022: 7927-7931 - [c29]Ankita Pasad, Felix Wu, Suwon Shon, Karen Livescu, Kyu Jeong Han:
On the Use of External Data for Spoken Named Entity Recognition. NAACL-HLT 2022: 724-737 - [i23]Suwon Shon, Felix Wu, Kwangyoun Kim, Prashant Sridhar, Karen Livescu, Shinji Watanabe:
Context-aware Fine-tuning of Self-supervised Speech Models. CoRR abs/2212.08542 (2022) - [i22]Suwon Shon, Siddhant Arora, Chyi-Jiunn Lin, Ankita Pasad, Felix Wu, Roshan S. Sharma, Wei-Lun Wu, Hung-Yi Lee, Karen Livescu, Shinji Watanabe:
SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks. CoRR abs/2212.10525 (2022) - 2021
- [c28]Suwon Shon, Pablo Brusco, Jing Pan, Kyu Jeong Han, Shinji Watanabe:
Leveraging Pre-Trained Language Model for Speech Sentiment Analysis. Interspeech 2021: 3420-3424 - [i21]Suwon Shon, Pablo Brusco, Jing Pan, Kyu Jeong Han, Shinji Watanabe:
Leveraging Pre-trained Language Model for Speech Sentiment Analysis. CoRR abs/2106.06598 (2021) - [i20]Suwon Shon, Ankita Pasad, Felix Wu, Pablo Brusco, Yoav Artzi, Karen Livescu, Kyu Jeong Han:
SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech. CoRR abs/2111.10367 (2021) - [i19]Ankita Pasad, Felix Wu, Suwon Shon, Karen Livescu, Kyu Jeong Han:
On the Use of External Data for Spoken Named Entity Recognition. CoRR abs/2112.07648 (2021) - 2020
- [c27]Suwon Shon, Ahmed Ali, Younes Samih, Hamdy Mubarak, James R. Glass:
ADI17: A Fine-Grained Arabic Dialect Identification Dataset. ICASSP 2020: 8244-8248 - [c26]Shammur A. Chowdhury, Ahmed Ali, Suwon Shon, James R. Glass:
What Does an End-to-End Dialect Identification Model Learn About Non-Dialectal Information? INTERSPEECH 2020: 462-466 - [c25]Suwon Shon, James R. Glass:
Multimodal Association for Speaker Verification. INTERSPEECH 2020: 2247-2251
2010 – 2019
- 2019
- [j4]Achintya Kumar Sarkar, Zheng-Hua Tan, Hao Tang, Suwon Shon, James R. Glass:
Time-Contrastive Learning Based Deep Bottleneck Features for Text-Dependent Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 27(8): 1267-1279 (2019) - [c24]Ahmed Ali, Suwon Shon, Younes Samih, Hamdy Mubarak, Ahmed Abdelali, James R. Glass, Steve Renals, Khalid Choukri:
The MGB-5 Challenge: Recognition and Dialect Identification of Dialectal Arabic Speech. ASRU 2019: 1026-1033 - [c23]Seongkyu Mun, Suwon Shon:
Domain Mismatch Robust Acoustic Scene Classification Using Channel Information Conversion. ICASSP 2019: 845-849 - [c22]Suwon Shon, Tae-Hyun Oh, James R. Glass:
Noise-tolerant Audio-visual Online Person Verification Using an Attention-based Neural Network Fusion. ICASSP 2019: 3995-3999 - [c21]Suwon Shon, Ahmed Ali, James R. Glass:
Domain Attentive Fusion for End-to-end Dialect Identification with Unknown Target Domain. ICASSP 2019: 5951-5955 - [c20]Suwon Shon, Najim Dehak, Douglas A. Reynolds, James R. Glass:
MCE 2018: The 1st Multi-Target Speaker Detection and Identification Challenge Evaluation. INTERSPEECH 2019: 356-360 - [c19]Jesús Villalba, Nanxin Chen, David Snyder, Daniel Garcia-Romero, Alan McCree, Gregory Sell, Jonas Borgstrom, Fred Richardson, Suwon Shon, François Grondin, Réda Dehak, Leibny Paola García-Perera, Daniel Povey, Pedro A. Torres-Carrasquillo, Sanjeev Khudanpur, Najim Dehak:
State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18. INTERSPEECH 2019: 1488-1492 - [c18]Suwon Shon, Hao Tang, James R. Glass:
VoiceID Loss: Speech Enhancement for Speaker Verification. INTERSPEECH 2019: 2888-2892 - [c17]Suwon Shon, Younggun Lee, Taesu Kim:
Large-Scale Speaker Retrieval on Random Speaker Variability Subspace. INTERSPEECH 2019: 2963-2967 - [i18]Suwon Shon, Hao Tang, James R. Glass:
VoiceID Loss: Speech Enhancement for Speaker Verification. CoRR abs/1904.03601 (2019) - [i17]Suwon Shon, Najim Dehak, Douglas A. Reynolds, James R. Glass:
MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation. CoRR abs/1904.04240 (2019) - [i16]Achintya Kumar Sarkar, Zheng-Hua Tan, Hao Tang, Suwon Shon, James R. Glass:
Time-Contrastive Learning Based Deep Bottleneck Features for Text-Dependent Speaker Verification. CoRR abs/1905.04554 (2019) - 2018
- [c16]Maryam Najafian, Sameer Khurana, Suwon Shon, Ahmed Ali, James R. Glass:
Exploiting Convolutional Neural Networks for Phonotactic Based Dialect Identification. ICASSP 2018: 5174-5178 - [c15]Suwon Shon, Ahmed Ali, James R. Glass:
Convolutional Neural Network and Language Embeddings for End-to-End Dialect Recognition. Odyssey 2018: 98-104 - [c14]Suwon Shon, Wei-Ning Hsu, James R. Glass:
Unsupervised Representation Learning of Speech for Dialect Identification. SLT 2018: 105-111 - [c13]Suwon Shon, Hao Tang, James R. Glass:
Frame-Level Speaker Embeddings for Text-Independent Speaker Recognition and Analysis of End-to-End Model. SLT 2018: 1007-1013 - [c12]Marcos Zampieri, Shervin Malmasi, Preslav Nakov, Ahmed Ali, Suwon Shon, James R. Glass, Yves Scherrer, Tanja Samardzic, Nikola Ljubesic, Jörg Tiedemann, Chris van der Lee, Stefan Grondelaers, Nelleke Oostdijk, Dirk Speelman, Antal van den Bosch, Ritesh Kumar, Bornini Lahiri, Mayank Jain:
Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign. VarDial@COLING 2018 2018: 1-17 - [i15]Suwon Shon, Ahmed Ali, James R. Glass:
Convolutional Neural Networks and Language Embeddings for End-to-End Dialect Recognition. CoRR abs/1803.04567 (2018) - [i14]Suwon Shon, Najim Dehak, Douglas A. Reynolds, James R. Glass:
MCE 2018: The 1st Multi-target Speaker Detection and Identification Challenge Evaluation (MCE) Plan, Dataset and Baseline System. CoRR abs/1807.06663 (2018) - [i13]Suwon Shon, Hao Tang, James R. Glass:
Frame-level speaker embeddings for text-independent speaker recognition and analysis of end-to-end model. CoRR abs/1809.04437 (2018) - [i12]Suwon Shon, Wei-Ning Hsu, James R. Glass:
Unsupervised Representation Learning of Speech for Dialect Identification. CoRR abs/1809.04458 (2018) - [i11]Suwon Shon, Younggun Lee, Taesu Kim:
Large-scale Speaker Retrieval on Random Speaker Variability Subspace. CoRR abs/1811.10812 (2018) - [i10]Suwon Shon, Tae-Hyun Oh, James R. Glass:
Noise-tolerant Audio-visual Online Person Verification using an Attention-based Neural Network Fusion. CoRR abs/1811.10813 (2018) - [i9]Suwon Shon, Ahmed Ali, James R. Glass:
Domain Attentive Fusion for End-to-end Dialect Identification with Unknown Target Domain. CoRR abs/1812.01501 (2018) - [i8]Seongkyu Mun, Suwon Shon:
Domain Mismatch Robust Acoustic Scene Classification using Channel Information Conversion. CoRR abs/1812.01731 (2018) - 2017
- [j3]Seongkyu Mun, Minkyu Shin, Suwon Shon, Wooil Kim, David K. Han, Hanseok Ko:
DNN Transfer Learning Based Non-Linear Feature Extraction for Acoustic Event Classification. IEICE Trans. Inf. Syst. 100-D(9): 2249-2252 (2017) - [j2]Seongkyu Mun, Suwon Shon, Wooil Kim, David K. Han, Hanseok Ko:
A Novel Discriminative Feature Extraction for Acoustic Scene Classification Using RNN Based Source Separation. IEICE Trans. Inf. Syst. 100-D(12): 3041-3044 (2017) - [c11]Suwon Shon, Ahmed Ali, James R. Glass:
MIT-QCRI Arabic dialect identification system for the 2017 multi-genre broadcast challenge. ASRU 2017: 374-380 - [c10]Seongkyu Mun, Suwon Shon, Wooil Kim, David K. Han, Hanseok Ko:
Deep Neural Network based learning and transferring mid-level audio features for acoustic scene classification. ICASSP 2017: 796-800 - [c9]Suwon Shon, Seongkyu Mun, Wooil Kim, Hanseok Ko:
Autoencoder Based Domain Adaptation for Speaker Recognition Under Insufficient Channel Information. INTERSPEECH 2017: 1014-1018 - [c8]Suwon Shon, Seongkyu Mun, Hanseok Ko:
Recursive Whitening Transformation for Speaker Recognition on Language Mismatched Condition. INTERSPEECH 2017: 2869-2873 - [i7]Suwon Shon, Hanseok Ko:
KU-ISPL Speaker Recognition Systems under Language mismatch condition for NIST 2016 Speaker Recognition Evaluation. CoRR abs/1702.00956 (2017) - [i6]Suwon Shon, Seongkyu Mun, Wooil Kim, Hanseok Ko:
Autoencoder based Domain Adaptation for Speaker Recognition under Insufficient Channel Information. CoRR abs/1708.01227 (2017) - [i5]Suwon Shon, Seongkyu Mun, Hanseok Ko:
Recursive Whitening Transformation for Speaker Recognition on Language Mismatched Condition. CoRR abs/1708.01232 (2017) - [i4]Seongkyu Mun, Minkyu Shin, Suwon Shon, Wooil Kim, David K. Han, Hanseok Ko:
DNN Transfer Learning based Non-linear Feature Extraction for Acoustic Event Classification. CoRR abs/1708.03465 (2017) - [i3]Suwon Shon, Ahmed Ali, James R. Glass:
MIT-QCRI Arabic Dialect Identification System for the 2017 Multi-Genre Broadcast Challenge. CoRR abs/1709.00387 (2017) - 2016
- [c7]Seongkyu Mun, Suwon Shon, Wooil Kim, Hanseok Ko:
Deep Neural Network Bottleneck Features for Acoustic Event Recognition. INTERSPEECH 2016: 2954-2957 - [c6]Seong Jae Lee, Daehun Kim, Suwon Shon, Seongkyu Mun, Minkyu Shin, Youngseng Chen, Sejong Hyung, Mohammed Harris, Hanseok Ko:
KU-ISPL TRECVID 2016 Multimedia Event Detection System. TRECVID 2016 - [i2]Suwon Shon, Seongkyu Mun, John H. L. Hansen, Hanseok Ko:
KU-ISPL Language Recognition System for NIST 2015 i-Vector Machine Learning Challenge. CoRR abs/1609.06404 (2016) - [i1]Suwon Shon, Seongkyu Mun, David K. Han, Hanseok Ko:
Non-negative matrix factorization-based subband decomposition for acoustic source localization. CoRR abs/1610.04695 (2016) - 2015
- [c5]Suwon Shon, Seongkyu Mun, David K. Han, Hanseok Ko:
Maximum likelihood Linear Dimension Reduction of heteroscedastic feature for robust Speaker Recognition. AVSS 2015: 1-5 - [c4]Seongkyu Mun, Suwon Shon, Wooil Kim, Hanseok Ko:
Robust speaker direction estimation with microphone array using NMF for smart TV interaction. ICCE 2015: 112-113 - 2014
- [c3]Sungkyu Moon, Suwon Shon, Wooil Kim, David K. Han:
Generalized cross-correlation based noise robust abnormal acoustic event localization utilizing non-negative matrix factorization. AVSS 2014: 171-174 - 2013
- [c2]Suwon Shon, David K. Han, Hanseok Ko:
Abnormal acoustic event localization based on selective frequency bin in high noise environment for audio surveillance. AVSS 2013: 87-92 - 2012
- [j1]Suwon Shon, David K. Han, Jounghoon Beh, Hanseok Ko:
Full Azimuth Multiple Sound Source Localization with 3-Channel Microphone Array. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 95-A(4): 745-750 (2012) - [c1]Suwon Shon, Eric Kim, Jongsung Yoon, Hanseok Ko:
Sudden noise source localization system for intelligent automobile application with acoustic sensors. ICCE 2012: 233-234
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:19 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint