


Остановите войну!
for scientists:


default search action
Jim Glass
James R. Glass
Person information

- affiliation: Massachusetts Institute of Technology (MIT), CSAIL, Cambridge, MA, USA
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [c328]Tianxing He, Jingyu Zhang, Tianle Wang, Sachin Kumar, Kyunghyun Cho, James R. Glass, Yulia Tsvetkov:
On the Blind Spots of Model-Based Evaluation Metrics for Text Generation. ACL (1) 2023: 12067-12097 - [c327]Yung-Sung Chuang, Wei Fang, Shang-Wen Li, Wen-tau Yih, James R. Glass:
Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering. ACL (Findings) 2023: 12131-12147 - [c326]Jiaxin Ge, Hongyin Luo, Yoon Kim, James R. Glass:
Entailment as Robust Self-Learner. ACL (1) 2023: 13803-13817 - [c325]Hongyin Luo, James R. Glass:
Logic Against Bias: Textual Entailment Mitigates Stereotypical Sentence Reasoning. EACL 2023: 1235-1246 - [c324]Nauman Dawalatabad, Sameer Khurana, Antoine Laurent, James R. Glass:
On Unsupervised Uncertainty-Driven Speech Pseudo-Label Filtering and Model Calibration. ICASSP 2023: 1-5 - [c323]Andrew Rouditchenko, Yung-Sung Chuang, Nina Shvetsova, Samuel Thomas, Rogério Feris, Brian Kingsbury, Leonid Karlinsky, David Harwath, Hilde Kuehne, James R. Glass:
C2KD: Cross-Lingual Cross-Modal Knowledge Distillation for Multilingual Text-Video Retrieval. ICASSP 2023: 1-5 - [c322]Yuan Gong, Andrew Rouditchenko, Alexander H. Liu, David Harwath, Leonid Karlinsky, Hilde Kuehne, James R. Glass:
Contrastive Audio-Visual Masked Autoencoder. ICLR 2023 - [c321]Samuel Okhuegbe, Chengwen Zhang, Jiaojiao Dong, Yilu Liu, Austin Walker, Jim Glass:
Flexible Boundary Design for a Chattanooga Microgrid Powered by Landfill Solar Photovoltaic and Battery Storage. ISC2 2023: 1-6 - [c320]David Cheng-Han Chiang, Hung-yi Lee, Yung-Sung Chuang, James R. Glass:
Revealing the Blind Spot of Sentence Encoder Evaluation by HEROS. RepL4NLP@ACL 2023: 289-302 - [c319]Jingyu Zhang, James R. Glass, Tianxing He:
PCFG-Based Natural Language Interface Improves Generalization for Controlled Text Generation. *SEM@ACL 2023: 295-313 - [i144]Hongyin Luo, James R. Glass:
Logic Against Bias: Textual Entailment Mitigates Stereotypical Sentence Reasoning. CoRR abs/2303.05670 (2023) - [i143]Brian Chen, Nina Shvetsova, Andrew Rouditchenko, Daniel Kondermann, Samuel Thomas, Shih-Fu Chang, Rogério Feris, James R. Glass, Hilde Kuehne
:
What, when, and where? - Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions. CoRR abs/2303.16990 (2023) - [i142]Tianhua Zhang, Hongyin Luo, Yung-Sung Chuang, Wei Fang, Luc Gaitskell, Thomas Hartvigsen, Xixin Wu, Danny Fox, Helen Meng, James R. Glass:
Interpretable Unified Language Checking. CoRR abs/2304.03728 (2023) - [i141]Alexander H. Liu, Heng-Jui Chang, Michael Auli, Wei-Ning Hsu, James R. Glass:
DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning. CoRR abs/2305.10005 (2023) - [i140]Yuan Gong, Hongyin Luo, Alexander H. Liu, Leonid Karlinsky, James R. Glass:
Listen, Think, and Understand. CoRR abs/2305.10790 (2023) - [i139]Heng-Jui Chang, Alexander H. Liu, James R. Glass:
Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering. CoRR abs/2305.11072 (2023) - [i138]Andrew Rouditchenko, Sameer Khurana, Samuel Thomas, Rogério Feris, Leonid Karlinsky, Hilde Kuehne, David Harwath, Brian Kingsbury, James R. Glass:
Comparison of Multilingual Self-Supervised and Weakly-Supervised Speech Pre-Training for Adaptation to Unseen Languages. CoRR abs/2305.12606 (2023) - [i137]Hongyin Luo, Yung-Sung Chuang, Yuan Gong, Tianhua Zhang, Yoon Kim, Xixin Wu, Danny Fox, Helen Meng, James R. Glass:
SAIL: Search-Augmented Instruction Learning. CoRR abs/2305.15225 (2023) - [i136]Yung-Sung Chuang, Wei Fang, Shang-Wen Li, Wen-tau Yih, James R. Glass:
Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering. CoRR abs/2305.17080 (2023) - [i135]Jiaxin Ge, Hongyin Luo, Yoon Kim, James R. Glass:
Entailment as Robust Self-Learner. CoRR abs/2305.17197 (2023) - [i134]Sameer Khurana, Nauman Dawalatabad, Antoine Laurent, Luis Vicente, Pablo Gimeno, Victoria Mingote, James R. Glass:
Improved Cross-Lingual Transfer Learning For Automatic Speech Translation. CoRR abs/2306.00789 (2023) - [i133]David Cheng-Han Chiang, Yung-Sung Chuang, James R. Glass, Hung-yi Lee:
Revealing the Blind Spot of Sentence Encoder Evaluation by HEROS. CoRR abs/2306.05083 (2023) - [i132]Yuan Gong, Sameer Khurana, Leonid Karlinsky, James R. Glass:
Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong General Audio Event Taggers. CoRR abs/2307.03183 (2023) - [i131]Yung-Sung Chuang, Yujia Xie, Hongyin Luo, Yoon Kim, James R. Glass, Pengcheng He:
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models. CoRR abs/2309.03883 (2023) - [i130]Tianhua Zhang, Jiaxin Ge, Hongyin Luo, Yung-Sung Chuang, Mingye Gao, Yuan Gong, Xixin Wu, Yoon Kim, Helen Meng, James R. Glass:
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning. CoRR abs/2309.10814 (2023) - [i129]Yuan Gong, Alexander H. Liu, Hongyin Luo, Leonid Karlinsky, James R. Glass:
Joint Audio and Speech Understanding. CoRR abs/2309.14405 (2023) - [i128]Junmo Kang, Hongyin Luo, Yada Zhu, James R. Glass, David D. Cox, Alan Ritter, Rogério Feris, Leonid Karlinsky:
Self-Specialization: Uncovering Latent Expertise within Large Language Models. CoRR abs/2310.00160 (2023) - [i127]Cheng-I Jeff Lai, Freda Shi, Puyuan Peng, Yoon Kim, Kevin Gimpel, Shiyu Chang, Yung-Sung Chuang, Saurabhchand Bhati, David D. Cox, David Harwath, Yang Zhang, Karen Livescu, James R. Glass:
Audio-Visual Neural Syntax Acquisition. CoRR abs/2310.07654 (2023) - [i126]Heng-Jui Chang, James R. Glass:
R-Spin: Efficient Speaker and Noise-invariant Representation Learning with Acoustic Pieces. CoRR abs/2311.09117 (2023) - 2022
- [j38]Kevin P. Schneider
, Jim Glass
, Cecilia Klauber
, Thomas Ben Ollis
, Matthew J. Reno
, Michael Burck
, Lelic Muhidin, Anamika Dubey
, Wei Du
, Thanh Long Vu
, Jing Xie
, David Nordy
, William Dawson, Javier Hernandez-Alvidrez, Anjan Bose
, Dan Ton, Guohui Yuan
:
A Framework for Coordinated Self-Assembly of Networked Microgrids Using Consensus Algorithms. IEEE Access 10: 3864-3878 (2022) - [j37]Gene-Ping Yang
, Sung-Lin Yeh, Yu-An Chung
, James R. Glass
, Hao Tang
:
Autoregressive Predictive Coding: A Comprehensive Study. IEEE J. Sel. Top. Signal Process. 16(6): 1380-1390 (2022) - [j36]Sameer Khurana
, Antoine Laurent
, James R. Glass
:
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-Level Cross-Lingual Speech Representation. IEEE J. Sel. Top. Signal Process. 16(6): 1493-1504 (2022) - [j35]Yuan Gong
, Alexander H. Liu
, Andrew Rouditchenko
, James R. Glass
:
UAVM: Towards Unifying Audio and Visual Models. IEEE Signal Process. Lett. 29: 2437-2441 (2022) - [c318]Yuan Gong
, Cheng-I Lai, Yu-An Chung, James R. Glass:
SSAST: Self-Supervised Audio Spectrogram Transformer. AAAI 2022: 10699-10709 - [c317]Alexander H. Liu, SouYoung Jin, Cheng-I Lai, Andrew Rouditchenko, Aude Oliva, James R. Glass:
Cross-Modal Discrete Representation Learning. ACL (1) 2022: 3013-3035 - [c316]Jiabao Ji, Yoon Kim, James R. Glass, Tianxing He:
Controlling the Focus of Pretrained Language Generation Models. ACL (Findings) 2022: 3291-3306 - [c315]Nina Shvetsova, Brian Chen, Andrew Rouditchenko, Samuel Thomas, Brian Kingsbury, Rogério Feris, David Harwath, James R. Glass, Hilde Kuehne
:
Everything at Once - Multi-modal Fusion Transformer for Video Retrieval. CVPR 2022: 19988-19997 - [c314]Nauman Dawalatabad, Yuan Gong, Sameer Khurana, Rhoda Au, James R. Glass:
Detecting Dementia from Long Neuropsychological Interviews. EMNLP (Findings) 2022: 5270-5283 - [c313]Yuan Gong
, Jin Yu, James R. Glass:
Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition. ICASSP 2022: 151-155 - [c312]Sameer Khurana, Antoine Laurent, James R. Glass:
Magic Dust for Cross-Lingual Adaptation of Monolingual Wav2vec-2.0. ICASSP 2022: 6647-6651 - [c311]R'mani Haulcy, Katerina Placek, Brian Tracey, Adam Vogel, James R. Glass:
Repetition Assessment for Speech and Language Disorders: A Study of the Logopenic Variant of Primary Progressive Aphasia. ICASSP 2022: 6932-6936 - [c310]Yuan Gong
, Ziyi Chen, Iek-Heng Chu, Peng Chang, James R. Glass:
Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment. ICASSP 2022: 7262-7266 - [c309]Cheng-I Jeff Lai, Erica Cooper, Yang Zhang, Shiyu Chang, Kaizhi Qian, Yi-Lun Liao, Yung-Sung Chuang, Alexander H. Liu, Junichi Yamagishi, David D. Cox, James R. Glass:
On the Interplay between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis. ICASSP 2022: 8447-8451 - [c308]Alexander H. Liu, Cheng-I Lai, Wei-Ning Hsu, Michael Auli, Alexei Baevski, James R. Glass:
Simple and Effective Unsupervised Speech Synthesis. INTERSPEECH 2022: 843-847 - [c307]Christopher Song, David Harwath, Tuka Alhanai, James R. Glass:
Speak: A Toolkit Using Amazon Mechanical Turk to Collect and Validate Speech Audio Recordings. LREC 2022: 7253-7258 - [c306]Hongyin Luo, Shang-Wen Li, Mingye Gao, Seunghak Yu, James R. Glass:
Cooperative Self-training of Machine Reading Comprehension. NAACL-HLT 2022: 244-257 - [c305]Yung-Sung Chuang, Rumen Dangovski, Hongyin Luo, Yang Zhang, Shiyu Chang, Marin Soljacic, Shang-Wen Li, Scott Yih, Yoon Kim, James R. Glass:
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings. NAACL-HLT 2022: 4207-4218 - [i125]Jiabao Ji, Yoon Kim, James R. Glass, Tianxing He:
Controlling the Focus of Pretrained Language Generation Models. CoRR abs/2203.01146 (2022) - [i124]Yuan Gong, Sameer Khurana, Andrew Rouditchenko, James R. Glass:
CMKD: CNN/Transformer-Based Cross-Model Knowledge Distillation for Audio Classification. CoRR abs/2203.06760 (2022) - [i123]Alexander H. Liu, Cheng-I Jeff Lai, Wei-Ning Hsu, Michael Auli, Alexei Baevski, James R. Glass:
Simple and Effective Unsupervised Speech Synthesis. CoRR abs/2204.02524 (2022) - [i122]Yung-Sung Chuang
, Rumen Dangovski, Hongyin Luo, Yang Zhang, Shiyu Chang, Marin Soljacic, Shang-Wen Li, Wen-tau Yih, Yoon Kim, James R. Glass:
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings. CoRR abs/2204.10298 (2022) - [i121]Yuan Gong, Ziyi Chen, Iek-Heng Chu, Peng Chang, James R. Glass:
Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment. CoRR abs/2205.03432 (2022) - [i120]Yuan Gong
, Jin Yu, James R. Glass:
Vocalsound: A Dataset for Improving Human Vocal Sounds Recognition. CoRR abs/2205.03433 (2022) - [i119]Sameer Khurana, Antoine Laurent, James R. Glass:
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation. CoRR abs/2205.08180 (2022) - [i118]Vijay Gadepally, Gregory Angelides, Andrei Barbu, Andrew Bowne, Laura J. Brattain, Tamara Broderick
, Armando Cabrera, Glenn Carl, Ronisha Carter, Miriam Cha, Emilie Cowen, Jesse Cummings, Bill Freeman, James R. Glass, Sam Goldberg, Mark Hamilton, Thomas Heldt, Kuan Wei Huang, Phillip Isola, Boris Katz, Jamie Koerner, Yen-Chen Lin, David Mayo, Kyle McAlpin, Taylor Perron, Jean E. Piou, Hrishikesh M. Rao, Hayley Reynolds, Kaira Samuel, Siddharth Samsi, Morgan Schmidt, Leslie Shing, Olga Simek, Brandon Swenson, Vivienne Sze, Jonathan Taylor, Paul Tylkin, Mark Veillette, Matthew L. Weiss, Allan B. Wollaber, Sophia Yuditskaya, Jeremy Kepner:
Developing a Series of AI Challenges for the United States Department of the Air Force. CoRR abs/2207.07033 (2022) - [i117]Yuan Gong
, Alexander H. Liu, Andrew Rouditchenko, James R. Glass:
UAVM: A Unified Model for Audio-Visual Learning. CoRR abs/2208.00061 (2022) - [i116]Andrew Rouditchenko, Yung-Sung Chuang, Nina Shvetsova, Samuel Thomas, Rogério Feris, Brian Kingsbury, Leonid Karlinsky, David Harwath, Hilde Kuehne, James R. Glass:
C2KD: Cross-Lingual Cross-Modal Knowledge Distillation for Multilingual Text-Video Retrieval. CoRR abs/2210.03625 (2022) - [i115]Jingyu Zhang, James R. Glass, Tianxing He:
PCFG-based Natural Language Interface Improves Generalization for Controlled Text Generation. CoRR abs/2210.07431 (2022) - [i114]Yuan Gong
, Andrew Rouditchenko, Alexander H. Liu, David Harwath, Leonid Karlinsky, Hilde Kuehne
, James R. Glass:
Contrastive Audio-Visual Masked Autoencoder. CoRR abs/2210.07839 (2022) - [i113]Nauman Dawalatabad, Sameer Khurana, Antoine Laurent, James R. Glass:
On Unsupervised Uncertainty-Driven Speech Pseudo-Label Filtering and Model Calibration. CoRR abs/2211.07795 (2022) - [i112]Tianxing He, Jingyu Zhang, Tianle Wang, Sachin Kumar, Kyunghyun Cho, James R. Glass, Yulia Tsvetkov:
On the Blind Spots of Model-Based Evaluation Metrics for Text Generation. CoRR abs/2212.10020 (2022) - 2021
- [j34]Lin Zhu
, Chengwen Zhang
, He Yin
, Dingrui Li, Yu Su
, Ishita Ray
, Jiaojiao Dong
, Fred Wang, Leon M. Tolbert
, Yilu Liu
, Yiwei Ma, Bruce Rogers, Jim Glass
, Lilian Bruce, Samuel Delay, Peter Gregory, Mario Garcia-Sanz, Mirjana Marden:
A Smart and Flexible Microgrid With a Low-Cost Scalable Open-Source Controller. IEEE Access 9: 162214-162230 (2021) - [j33]Yuan Gong
, Yu-An Chung
, James R. Glass:
PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3292-3306 (2021) - [c304]Wei-Ning Hsu, David Harwath, Tyler Miller, Christopher Song, James R. Glass:
Text-Free Image-to-Speech Synthesis Using Learned Segmental Units. ACL/IJCNLP (1) 2021: 5284-5300 - [c303]Mathew Monfort, SouYoung Jin, Alexander H. Liu, David Harwath, Rogério Feris, James R. Glass, Aude Oliva:
Spoken Moments: Learning Joint Audio-Visual Representations From Video Descriptions. CVPR 2021: 14871-14881 - [c302]Tianxing He, Jun Liu, Kyunghyun Cho, Myle Ott, Bing Liu, James R. Glass, Fuchun Peng:
Analyzing the Forgetting Problem in Pretrain-Finetuning of Open-domain Dialogue Response Models. EACL 2021: 1121-1133 - [c301]Tianxing He, Jingzhao Zhang, Zhiming Zhou, James R. Glass:
Exposure Bias versus Self-Recovery: Are Distortions Really Incremental for Autoregressive Text Generation? EMNLP (1) 2021: 5087-5102 - [c300]Yu-An Chung, Yonatan Belinkov, James R. Glass:
Similarity Analysis of Self-Supervised Speech Representations. ICASSP 2021: 3040-3044 - [c299]Cheng-I Lai, Yung-Sung Chuang, Hung-Yi Lee, Shang-Wen Li, James R. Glass:
Semi-Supervised Spoken Language Understanding via Self-Supervised Speech and Language Model Pretraining. ICASSP 2021: 7468-7472 - [c298]Brian Chen, Andrew Rouditchenko, Kevin Duarte, Hilde Kuehne
, Samuel Thomas, Angie W. Boggust, Rameswar Panda, Brian Kingsbury, Rogério Feris, David Harwath, James R. Glass, Michael Picheny, Shih-Fu Chang:
Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos. ICCV 2021: 7992-8001 - [c297]Yuan Gong
, Yu-An Chung, James R. Glass:
AST: Audio Spectrogram Transformer. Interspeech 2021: 571-575 - [c296]Andrew Rouditchenko, Angie W. Boggust, David Harwath, Brian Chen, Dhiraj Joshi, Samuel Thomas, Kartik Audhkhasi, Hilde Kuehne
, Rameswar Panda, Rogério Schmidt Feris, Brian Kingsbury, Michael Picheny, Antonio Torralba, James R. Glass:
AVLnet: Learning Audio-Visual Language Representations from Instructional Videos. Interspeech 2021: 1584-1588 - [c295]R'mani Haulcy, James R. Glass:
CLAC: A Speech Corpus of Healthy English Speakers. Interspeech 2021: 2966-2970 - [c294]Andrew Rouditchenko, Angie W. Boggust, David Harwath, Samuel Thomas, Hilde Kuehne
, Brian Chen, Rameswar Panda, Rogério Feris, Brian Kingsbury, Michael Picheny, James R. Glass:
Cascaded Multilingual Audio-Visual Learning from Videos. Interspeech 2021: 3006-3010 - [c293]Hongyin Luo, James R. Glass, Garima Lalwani, Yi Zhang, Shang-Wen Li:
Joint Retrieval-Extraction Training for Evidence-Aware Dialog Response Selection. Interspeech 2021: 3241-3245 - [c292]Ian Palmer, Andrew Rouditchenko, Andrei Barbu, Boris Katz, James R. Glass:
Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset. Interspeech 2021: 3650-3654 - [c291]Alexander H. Liu, Yu-An Chung, James R. Glass:
Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies. Interspeech 2021: 3730-3734 - [c290]Cheng-I Jeff Lai, Yang Zhang, Alexander H. Liu, Shiyu Chang, Yi-Lun Liao, Yung-Sung Chuang, Kaizhi Qian, Sameer Khurana, David D. Cox, Jim Glass:
PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition. NeurIPS 2021: 21256-21272 - [c289]Seunghak Yu, Giovanni Da San Martino, Mitra Mohtarami, James R. Glass, Preslav Nakov:
Interpretable Propaganda Detection in News Articles. RANLP 2021: 1597-1605 - [i111]Hongyin Luo, Shang-Wen Li, James R. Glass:
Knowledge Grounded Conversational Symptom Detection with Graph Memory Networks. CoRR abs/2101.09773 (2021) - [i110]Yuan Gong, Yu-An Chung, James R. Glass:
PSLA: Improving Audio Event Classification with Pretraining, Sampling, Labeling, and Aggregation. CoRR abs/2102.01243 (2021) - [i109]Hongyin Luo, Shang-Wen Li, Seunghak Yu, James R. Glass:
Cooperative Learning of Zero-Shot Machine Reading Comprehension. CoRR abs/2103.07449 (2021) - [i108]Yuan Gong, Yu-An Chung, James R. Glass:
AST: Audio Spectrogram Transformer. CoRR abs/2104.01778 (2021) - [i107]Brian Chen, Andrew Rouditchenko, Kevin Duarte, Hilde Kuehne, Samuel Thomas, Angie W. Boggust, Rameswar Panda, Brian Kingsbury, Rogério Schmidt Feris, David Harwath, James R. Glass, Michael Picheny, Shih-Fu Chang:
Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos. CoRR abs/2104.12671 (2021) - [i106]Mathew Monfort, SouYoung Jin, Alexander H. Liu, David Harwath, Rogério Feris, James R. Glass, Aude Oliva:
Spoken Moments: Learning Joint Audio-Visual Representations from Video Descriptions. CoRR abs/2105.04489 (2021) - [i105]Alexander H. Liu, SouYoung Jin, Cheng-I Jeff Lai, Andrew Rouditchenko, Aude Oliva, James R. Glass:
Cross-Modal Discrete Representation Learning. CoRR abs/2106.05438 (2021) - [i104]Cheng-I Jeff Lai, Yang Zhang, Alexander H. Liu, Shiyu Chang, Yi-Lun Liao, Yung-Sung Chuang, Kaizhi Qian, Sameer Khurana, David D. Cox, James R. Glass:
PARP: Prune, Adjust and Re-Prune for Self-Supervised Speech Recognition. CoRR abs/2106.05933 (2021) - [i103]Yung-Sung Chuang, Mingye Gao, Hongyin Luo, James R. Glass, Hung-Yi Lee, Yun-Nung Chen, Shang-Wen Li:
Mitigating Biases in Toxic Language Detection through Invariant Rationalization. CoRR abs/2106.07240 (2021) - [i102]Seunghak Yu, Giovanni Da San Martino, Mitra Mohtarami, James R. Glass, Preslav Nakov:
Interpretable Propaganda Detection in News Articles. CoRR abs/2108.12802 (2021) - [i101]Tianxing He, Kyunghyun Cho, James R. Glass:
An Empirical Study on Few-shot Knowledge Probing for Pretrained Language Models. CoRR abs/2109.02772 (2021) - [i100]Cheng-I Jeff Lai, Erica Cooper, Yang Zhang, Shiyu Chang, Kaizhi Qian, Yi-Lun Liao, Yung-Sung Chuang, Alexander H. Liu, Junichi Yamagishi, David D. Cox, James R. Glass:
On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis. CoRR abs/2110.01147 (2021) - [i99]Sameer Khurana, Antoine Laurent, James R. Glass:
Magic dust for cross-lingual adaptation of monolingual wav2vec-2.0. CoRR abs/2110.03560 (2021) - [i98]Ian Palmer, Andrew Rouditchenko, Andrei Barbu, Boris Katz, James R. Glass:
Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset. CoRR abs/2110.07575 (2021) - [i97]Yuan Gong, Cheng-I Jeff Lai, Yu-An Chung, James R. Glass:
SSAST: Self-Supervised Audio Spectrogram Transformer. CoRR abs/2110.09784 (2021) - [i96]Andrew Rouditchenko, Angie W. Boggust, David Harwath, Samuel Thomas, Hilde Kuehne, Brian Chen, Rameswar Panda, Rogério Feris, Brian Kingsbury, Michael Picheny, James R. Glass:
Cascaded Multilingual Audio-Visual Learning from Videos. CoRR abs/2111.04823 (2021) - [i95]Kevin Duarte, Brian Chen, Nina Shvetsova, Andrew Rouditchenko, Samuel Thomas, Alexander H. Liu, David Harwath, James R. Glass, Hilde Kuehne, Mubarak Shah:
Routing with Self-Attention for Multimodal Capsule Networks. CoRR abs/2112.00775 (2021) - [i94]Nina Shvetsova, Brian Chen, Andrew Rouditchenko, Samuel Thomas, Brian Kingsbury, Rogério Feris, David Harwath, James R. Glass, Hilde Kuehne:
Everything at Once - Multi-modal Fusion Transformer for Video Retrieval. CoRR abs/2112.04446 (2021) - 2020
- [j32]Yonatan Belinkov, Nadir Durrani, Fahim Dalvi, Hassan Sajjad
, James R. Glass:
On the Linguistic Representational Power of Neural Machine Translation Models. Comput. Linguistics 46(1): 1-52 (2020) - [j31]David Harwath
, Adrià Recasens, Dídac Surís, Galen Chuang, Antonio Torralba, James R. Glass:
Jointly Discovering Visual Objects and Spoken Words from Raw Sensory Input. Int. J. Comput. Vis. 128(3): 620-641 (2020) - [c288]Tianxing He, James R. Glass:
Negative Training for Neural Dialogue Response Generation. ACL 2020: 2044-2058 - [c287]Yu-An Chung, James R. Glass:
Improved Speech Representations with Multi-Target Autoregressive Predictive Coding. ACL 2020: 2353-2358 - [c286]Ramy Baly, Georgi Karadzhov, Jisun An, Haewoon Kwak, Yoan Dinkov, Ahmed Ali, James R. Glass, Preslav Nakov:
What Was Written vs. Who Read It: News Media Profiling Using Text Analysis and Social Media Context. ACL 2020: 3364-3374 - [c285]John M. Wu, Yonatan Belinkov, Hassan Sajjad, Nadir Durrani, Fahim Dalvi, James R. Glass:
Similarity Analysis of Contextual Word Representation Models. ACL 2020: 4638-4655 - [c284]Hongyin Luo, Shang-Wen Li, James R. Glass:
Knowledge Grounded Conversational Symptom Detection with Graph Memory Networks. ClinicalNLP@EMNLP 2020: 136-145 - [c283]Ramy Baly, Giovanni Da San Martino, James R. Glass, Preslav Nakov:
We Can Detect Your Bias: Predicting the Political Ideology of News Articles. EMNLP (1) 2020: 4982-4991 - [c282]Yu-An Chung, James R. Glass:
Generative Pre-Training for Speech with Autoregressive Predictive Coding. ICASSP 2020: 3497-3501 - [c281]Yasunori Ohishi, Akisato Kimura, Takahito Kawanishi, Kunio Kashino, David Harwath, James R. Glass:
Trilingual Semantic Embeddings of Visually Grounded Speech with Self-Attention Mechanisms. ICASSP 2020: 4352-4356 - [c280]François Grondin, Hao Tang, James R. Glass:
Audio-Visual Calibration with Polynomial Regression for 2-D Projection Using SVD-PHAT. ICASSP 2020: 4856-4860 - [c279]Jennifer Drexler, James R. Glass:
Learning a Subword Inventory Jointly with End-to-End Automatic Speech Recognition. ICASSP 2020: 6439-6443 - [c278]Suwon Shon, Ahmed Ali, Younes Samih
, Hamdy Mubarak, James R. Glass:
ADI17: A Fine-Grained Arabic Dialect Identification Dataset. ICASSP 2020: 8244-8248 - [c277]David Harwath, Wei-Ning Hsu, James R. Glass:
Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech. ICLR 2020 - [c276]Moin Nadeem, Tianxing He, Kyunghyun Cho, James R. Glass:
A Systematic Characterization of Sampling Algorithms for Open-ended Language Generation. AACL/IJCNLP 2020: 334-346 - [c275]Michael Gump, Wei-Ning Hsu, James R. Glass:
Unsupervised Methods for Evaluating Speech Representations. INTERSPEECH 2020: 170-174 - [c274]Shammur A. Chowdhury, Ahmed Ali, Suwon Shon, James R. Glass:
What Does an End-to-End Dialect Identification Model Learn About Non-Dialectal Information? INTERSPEECH 2020: 462-466 - [c273]