default search action
Ron J. Weiss
Person information
- affiliation: Google
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2022
- [c51]Gary Wang, Ekin D. Cubuk, Andrew Rosenberg, Shuyang Cheng, Ron J. Weiss, Bhuvana Ramabhadran, Pedro J. Moreno, Quoc V. Le, Daniel S. Park:
G-Augment: Searching for the Meta-Structure of Data Augmentation Policies for ASR. SLT 2022: 23-30 - [i32]Gary Wang, Ekin D. Cubuk, Andrew Rosenberg, Shuyang Cheng, Ron J. Weiss, Bhuvana Ramabhadran, Pedro J. Moreno, Quoc V. Le, Daniel S. Park:
G-Augment: Searching for the Meta-Structure of Data Augmentation Policies for ASR. CoRR abs/2210.10879 (2022) - 2021
- [c50]Ron J. Weiss, R. J. Skerry-Ryan, Eric Battenberg, Soroosh Mariooryad, Diederik P. Kingma:
Wave-Tacotron: Spectrogram-Free End-to-End Text-to-Speech Synthesis. ICASSP 2021: 5679-5683 - [c49]Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang, Ye Jia, Ron J. Weiss, Yonghui Wu:
Parallel Tacotron: Non-Autoregressive and Controllable TTS. ICASSP 2021: 5709-5713 - [c48]Nanxin Chen, Yu Zhang, Heiga Zen, Ron J. Weiss, Mohammad Norouzi, William Chan:
WaveGrad: Estimating Gradients for Waveform Generation. ICLR 2021 - [c47]Peidong Wang, Tara N. Sainath, Ron J. Weiss:
Multitask Training with Text Data for End-to-End Speech Recognition. Interspeech 2021: 2566-2570 - [c46]Nanxin Chen, Yu Zhang, Heiga Zen, Ron J. Weiss, Mohammad Norouzi, Najim Dehak, William Chan:
WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis. Interspeech 2021: 3765-3769 - [c45]Scott Wisdom, Aren Jansen, Ron J. Weiss, Hakan Erdogan, John R. Hershey:
Sparse, Efficient, and Semantic Mixture Invariant Training: Taming In-the-Wild Unsupervised Sound Separation. WASPAA 2021: 51-55 - [i31]Scott Wisdom, Aren Jansen, Ron J. Weiss, Hakan Erdogan, John R. Hershey:
Sparse, Efficient, and Semantic Mixture Invariant Training: Taming In-the-Wild Unsupervised Sound Separation. CoRR abs/2106.00847 (2021) - [i30]Nanxin Chen, Yu Zhang, Heiga Zen, Ron J. Weiss, Mohammad Norouzi, Najim Dehak, William Chan:
WaveGrad 2: Iterative Refinement for Text-to-Speech Synthesis. CoRR abs/2106.09660 (2021) - 2020
- [c44]Guangzhi Sun, Yu Zhang, Ron J. Weiss, Yuan Cao, Heiga Zen, Yonghui Wu:
Fully-Hierarchical Fine-Grained Prosody Modeling For Interpretable Speech Synthesis. ICASSP 2020: 6264-6268 - [c43]Guangzhi Sun, Yu Zhang, Ron J. Weiss, Yuan Cao, Heiga Zen, Andrew Rosenberg, Bhuvana Ramabhadran, Yonghui Wu:
Generating Diverse and Natural Text-to-Speech Samples Using a Quantized Fine-Grained VAE and Autoregressive Prosody Prior. ICASSP 2020: 6699-6703 - [c42]Tara N. Sainath, Ruoming Pang, Ron J. Weiss, Yanzhang He, Chung-Cheng Chiu, Trevor Strohman:
An Attention-Based Joint Acoustic and Text on-Device End-To-End Model. ICASSP 2020: 7039-7043 - [c41]Scott Wisdom, Efthymios Tzinis, Hakan Erdogan, Ron J. Weiss, Kevin W. Wilson, John R. Hershey:
Unsupervised Sound Separation Using Mixture Invariant Training. NeurIPS 2020 - [i29]Guangzhi Sun, Yu Zhang, Ron J. Weiss, Yuan Cao, Heiga Zen, Yonghui Wu:
Fully-hierarchical fine-grained prosody modeling for interpretable speech synthesis. CoRR abs/2002.03785 (2020) - [i28]Guangzhi Sun, Yu Zhang, Ron J. Weiss, Yuan Cao, Heiga Zen, Andrew Rosenberg, Bhuvana Ramabhadran, Yonghui Wu:
Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and auto-regressive prosody prior. CoRR abs/2002.03788 (2020) - [i27]Scott Wisdom, Efthymios Tzinis, Hakan Erdogan, Ron J. Weiss, Kevin W. Wilson, John R. Hershey:
Unsupervised Sound Separation Using Mixtures of Mixtures. CoRR abs/2006.12701 (2020) - [i26]Nanxin Chen, Yu Zhang, Heiga Zen, Ron J. Weiss, Mohammad Norouzi, William Chan:
WaveGrad: Estimating Gradients for Waveform Generation. CoRR abs/2009.00713 (2020) - [i25]Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang, Ye Jia, Ron J. Weiss, Yonghui Wu:
Parallel Tacotron: Non-Autoregressive and Controllable TTS. CoRR abs/2010.11439 (2020) - [i24]Peidong Wang, Tara N. Sainath, Ron J. Weiss:
Multitask Training with Text Data for End-to-End Speech Recognition. CoRR abs/2010.14318 (2020) - [i23]Ron J. Weiss, R. J. Skerry-Ryan, Eric Battenberg, Soroosh Mariooryad, Diederik P. Kingma:
Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis. CoRR abs/2011.03568 (2020)
2010 – 2019
- 2019
- [j6]Jan Chorowski, Ron J. Weiss, Samy Bengio, Aäron van den Oord:
Unsupervised Speech Representation Learning Using WaveNet Autoencoders. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 2041-2053 (2019) - [c40]Joseph M. Antognini, Matt Hoffman, Ron J. Weiss:
Audio Texture Synthesis with Random Neural Networks: Improving Diversity and Quality. ICASSP 2019: 3587-3591 - [c39]Jinxi Guo, Tara N. Sainath, Ron J. Weiss:
A Spelling Correction Model for End-to-end Speech Recognition. ICASSP 2019: 5651-5655 - [c38]Wei-Ning Hsu, Yu Zhang, Ron J. Weiss, Yu-An Chung, Yuxuan Wang, Yonghui Wu, James R. Glass:
Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorization. ICASSP 2019: 5901-5905 - [c37]Ye Jia, Melvin Johnson, Wolfgang Macherey, Ron J. Weiss, Yuan Cao, Chung-Cheng Chiu, Naveen Ari, Stella Laurenzo, Yonghui Wu:
Leveraging Weakly Supervised Data to Improve End-to-end Speech-to-text Translation. ICASSP 2019: 7180-7184 - [c36]Wei-Ning Hsu, Yu Zhang, Ron J. Weiss, Heiga Zen, Yonghui Wu, Yuxuan Wang, Yuan Cao, Ye Jia, Zhifeng Chen, Jonathan Shen, Patrick Nguyen, Ruoming Pang:
Hierarchical Generative Modeling for Controllable Speech Synthesis. ICLR (Poster) 2019 - [c35]Ye Jia, Ron J. Weiss, Fadi Biadsy, Wolfgang Macherey, Melvin Johnson, Zhifeng Chen, Yonghui Wu:
Direct Speech-to-Speech Translation with a Sequence-to-Sequence Model. INTERSPEECH 2019: 1123-1127 - [c34]Heiga Zen, Viet Dang, Rob Clark, Yu Zhang, Ron J. Weiss, Ye Jia, Zhifeng Chen, Yonghui Wu:
LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech. INTERSPEECH 2019: 1526-1530 - [c33]Yu Zhang, Ron J. Weiss, Heiga Zen, Yonghui Wu, Zhifeng Chen, R. J. Skerry-Ryan, Ye Jia, Andrew Rosenberg, Bhuvana Ramabhadran:
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning. INTERSPEECH 2019: 2080-2084 - [c32]Quan Wang, Hannah Muckenhirn, Kevin W. Wilson, Prashant Sridhar, Zelin Wu, John R. Hershey, Rif A. Saurous, Ron J. Weiss, Ye Jia, Ignacio López-Moreno:
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking. INTERSPEECH 2019: 2728-2732 - [c31]Fadi Biadsy, Ron J. Weiss, Pedro J. Moreno, Dimitri Kanvesky, Ye Jia:
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation. INTERSPEECH 2019: 4115-4119 - [i22]Jan Chorowski, Ron J. Weiss, Samy Bengio, Aäron van den Oord:
Unsupervised speech representation learning using WaveNet autoencoders. CoRR abs/1901.08810 (2019) - [i21]Jinxi Guo, Tara N. Sainath, Ron J. Weiss:
A spelling correction model for end-to-end speech recognition. CoRR abs/1902.07178 (2019) - [i20]Jonathan Shen, Patrick Nguyen, Yonghui Wu, Zhifeng Chen, Mia Xu Chen, Ye Jia, Anjuli Kannan, Tara N. Sainath, Yuan Cao, Chung-Cheng Chiu, Yanzhang He, Jan Chorowski, Smit Hinsu, Stella Laurenzo, James Qin, Orhan Firat, Wolfgang Macherey, Suyog Gupta, Ankur Bapna, Shuyuan Zhang, Ruoming Pang, Ron J. Weiss, Rohit Prabhavalkar, Qiao Liang, Benoit Jacob, Bowen Liang, HyoukJoong Lee, Ciprian Chelba, Sébastien Jean, Bo Li, Melvin Johnson, Rohan Anil, Rajat Tibrewal, Xiaobing Liu, Akiko Eriguchi, Navdeep Jaitly, Naveen Ari, Colin Cherry, Parisa Haghani, Otavio Good, Youlong Cheng, Raziel Alvarez, Isaac Caswell, Wei-Ning Hsu, Zongheng Yang, Kuan-Chieh Wang, Ekaterina Gonina, Katrin Tomanek, Ben Vanik, Zelin Wu, Llion Jones, Mike Schuster, Yanping Huang, Dehao Chen, Kazuki Irie, George F. Foster, John Richardson, Klaus Macherey, Antoine Bruguier, Heiga Zen, Colin Raffel, Shankar Kumar, Kanishka Rao, David Rybach, Matthew Murray, Vijayaditya Peddinti, Maxim Krikun, Michiel Bacchiani, Thomas B. Jablin, Robert Suderman, Ian Williams, Benjamin Lee, Deepti Bhatia, Justin Carlson, Semih Yavuz, Yu Zhang, Ian McGraw, Max Galkin, Qi Ge, Golan Pundak, Chad Whipkey, Todd Wang, Uri Alon, Dmitry Lepikhin, Ye Tian, Sara Sabour, William Chan, Shubham Toshniwal, Baohua Liao, Michael Nirschl, Pat Rondon:
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling. CoRR abs/1902.08295 (2019) - [i19]Heiga Zen, Viet Dang, Rob Clark, Yu Zhang, Ron J. Weiss, Ye Jia, Zhifeng Chen, Yonghui Wu:
LibriTTS: A Corpus Derived from LibriSpeech for Text-to-Speech. CoRR abs/1904.02882 (2019) - [i18]Fadi Biadsy, Ron J. Weiss, Pedro J. Moreno, Dimitri Kanvesky, Ye Jia:
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation. CoRR abs/1904.04169 (2019) - [i17]Ye Jia, Ron J. Weiss, Fadi Biadsy, Wolfgang Macherey, Melvin Johnson, Zhifeng Chen, Yonghui Wu:
Direct speech-to-speech translation with a sequence-to-sequence model. CoRR abs/1904.06037 (2019) - [i16]Yu Zhang, Ron J. Weiss, Heiga Zen, Yonghui Wu, Zhifeng Chen, R. J. Skerry-Ryan, Ye Jia, Andrew Rosenberg, Bhuvana Ramabhadran:
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning. CoRR abs/1907.04448 (2019) - 2018
- [c30]Jan Chorowski, Ron J. Weiss, Rif A. Saurous, Samy Bengio:
On Using Backpropagation for Speech Texture Generation and Voice Conversion. ICASSP 2018: 2256-2260 - [c29]Chung-Cheng Chiu, Tara N. Sainath, Yonghui Wu, Rohit Prabhavalkar, Patrick Nguyen, Zhifeng Chen, Anjuli Kannan, Ron J. Weiss, Kanishka Rao, Ekaterina Gonina, Navdeep Jaitly, Bo Li, Jan Chorowski, Michiel Bacchiani:
State-of-the-Art Speech Recognition with Sequence-to-Sequence Models. ICASSP 2018: 4774-4778 - [c28]Jonathan Shen, Ruoming Pang, Ron J. Weiss, Mike Schuster, Navdeep Jaitly, Zongheng Yang, Zhifeng Chen, Yu Zhang, Yuxuan Wang, R. J. Skerry-Ryan, Rif A. Saurous, Yannis Agiomyrgiannakis, Yonghui Wu:
Natural TTS Synthesis by Conditioning Wavenet on MEL Spectrogram Predictions. ICASSP 2018: 4779-4783 - [c27]Shubham Toshniwal, Tara N. Sainath, Ron J. Weiss, Bo Li, Pedro J. Moreno, Eugene Weinstein, Kanishka Rao:
Multilingual Speech Recognition with a Single End-to-End Model. ICASSP 2018: 4904-4908 - [c26]R. J. Skerry-Ryan, Eric Battenberg, Ying Xiao, Yuxuan Wang, Daisy Stanton, Joel Shor, Ron J. Weiss, Rob Clark, Rif A. Saurous:
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron. ICML 2018: 4700-4709 - [c25]Ye Jia, Yu Zhang, Ron J. Weiss, Quan Wang, Jonathan Shen, Fei Ren, Zhifeng Chen, Patrick Nguyen, Ruoming Pang, Ignacio López-Moreno, Yonghui Wu:
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis. NeurIPS 2018: 4485-4495 - [i15]R. J. Skerry-Ryan, Eric Battenberg, Ying Xiao, Yuxuan Wang, Daisy Stanton, Joel Shor, Ron J. Weiss, Rob Clark, Rif A. Saurous:
Towards End-to-End Prosody Transfer for Expressive Speech Synthesis with Tacotron. CoRR abs/1803.09047 (2018) - [i14]Ye Jia, Yu Zhang, Ron J. Weiss, Quan Wang, Jonathan Shen, Fei Ren, Zhifeng Chen, Patrick Nguyen, Ruoming Pang, Ignacio López-Moreno, Yonghui Wu:
Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis. CoRR abs/1806.04558 (2018) - [i13]Joseph M. Antognini, Matt Hoffman, Ron J. Weiss:
Synthesizing Diverse, High-Quality Audio Textures. CoRR abs/1806.08002 (2018) - [i12]Quan Wang, Hannah Muckenhirn, Kevin W. Wilson, Prashant Sridhar, Zelin Wu, John R. Hershey, Rif A. Saurous, Ron J. Weiss, Ye Jia, Ignacio López-Moreno:
VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking. CoRR abs/1810.04826 (2018) - [i11]Wei-Ning Hsu, Yu Zhang, Ron J. Weiss, Heiga Zen, Yonghui Wu, Yuxuan Wang, Yuan Cao, Ye Jia, Zhifeng Chen, Jonathan Shen, Patrick Nguyen, Ruoming Pang:
Hierarchical Generative Modeling for Controllable Speech Synthesis. CoRR abs/1810.07217 (2018) - [i10]Ye Jia, Melvin Johnson, Wolfgang Macherey, Ron J. Weiss, Yuan Cao, Chung-Cheng Chiu, Naveen Ari, Stella Laurenzo, Yonghui Wu:
Leveraging Weakly Supervised Data to Improve End-to-End Speech-to-Text Translation. CoRR abs/1811.02050 (2018) - 2017
- [j5]Tara N. Sainath, Ron J. Weiss, Kevin W. Wilson, Bo Li, Arun Narayanan, Ehsan Variani, Michiel Bacchiani, Izhak Shafran, Andrew W. Senior, Kean K. Chin, Ananya Misra, Chanwoo Kim:
Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 25(5): 965-979 (2017) - [c24]Shawn Hershey, Sourish Chaudhuri, Daniel P. W. Ellis, Jort F. Gemmeke, Aren Jansen, R. Channing Moore, Manoj Plakal, Devin Platt, Rif A. Saurous, Bryan Seybold, Malcolm Slaney, Ron J. Weiss, Kevin W. Wilson:
CNN architectures for large-scale audio classification. ICASSP 2017: 131-135 - [c23]Colin Raffel, Minh-Thang Luong, Peter J. Liu, Ron J. Weiss, Douglas Eck:
Online and Linear-Time Attention by Enforcing Monotonic Alignments. ICML 2017: 2837-2846 - [c22]Bo Li, Tara N. Sainath, Arun Narayanan, Joe Caroselli, Michiel Bacchiani, Ananya Misra, Izhak Shafran, Hasim Sak, Golan Pundak, Kean K. Chin, Khe Chai Sim, Ron J. Weiss, Kevin W. Wilson, Ehsan Variani, Chanwoo Kim, Olivier Siohan, Mitchel Weintraub, Erik McDermott, Richard Rose, Matt Shannon:
Acoustic Modeling for Google Home. INTERSPEECH 2017: 399-403 - [c21]Ron J. Weiss, Jan Chorowski, Navdeep Jaitly, Yonghui Wu, Zhifeng Chen:
Sequence-to-Sequence Models Can Directly Translate Foreign Speech. INTERSPEECH 2017: 2625-2629 - [c20]Yuxuan Wang, R. J. Skerry-Ryan, Daisy Stanton, Yonghui Wu, Ron J. Weiss, Navdeep Jaitly, Zongheng Yang, Ying Xiao, Zhifeng Chen, Samy Bengio, Quoc V. Le, Yannis Agiomyrgiannakis, Rob Clark, Rif A. Saurous:
Tacotron: Towards End-to-End Speech Synthesis. INTERSPEECH 2017: 4006-4010 - [p1]Tara N. Sainath, Ron J. Weiss, Kevin W. Wilson, Arun Narayanan, Michiel Bacchiani, Bo Li, Ehsan Variani, Izhak Shafran, Andrew W. Senior, Kean K. Chin, Ananya Misra, Chanwoo Kim:
Raw Multichannel Processing Using Deep Neural Networks. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 105-133 - [i9]Ron J. Weiss, Jan Chorowski, Navdeep Jaitly, Yonghui Wu, Zhifeng Chen:
Sequence-to-Sequence Models Can Directly Transcribe Foreign Speech. CoRR abs/1703.08581 (2017) - [i8]Yuxuan Wang, R. J. Skerry-Ryan, Daisy Stanton, Yonghui Wu, Ron J. Weiss, Navdeep Jaitly, Zongheng Yang, Ying Xiao, Zhifeng Chen, Samy Bengio, Quoc V. Le, Yannis Agiomyrgiannakis, Rob Clark, Rif A. Saurous:
Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model. CoRR abs/1703.10135 (2017) - [i7]Colin Raffel, Minh-Thang Luong, Peter J. Liu, Ron J. Weiss, Douglas Eck:
Online and Linear-Time Attention by Enforcing Monotonic Alignments. CoRR abs/1704.00784 (2017) - [i6]Shubham Toshniwal, Tara N. Sainath, Ron J. Weiss, Bo Li, Pedro J. Moreno, Eugene Weinstein, Kanishka Rao:
Multilingual Speech Recognition With A Single End-To-End Model. CoRR abs/1711.01694 (2017) - [i5]Chung-Cheng Chiu, Tara N. Sainath, Yonghui Wu, Rohit Prabhavalkar, Patrick Nguyen, Zhifeng Chen, Anjuli Kannan, Ron J. Weiss, Kanishka Rao, Katya Gonina, Navdeep Jaitly, Bo Li, Jan Chorowski, Michiel Bacchiani:
State-of-the-art Speech Recognition With Sequence-to-Sequence Models. CoRR abs/1712.01769 (2017) - [i4]Jonathan Shen, Ruoming Pang, Ron J. Weiss, Mike Schuster, Navdeep Jaitly, Zongheng Yang, Zhifeng Chen, Yu Zhang, Yuxuan Wang, R. J. Skerry-Ryan, Rif A. Saurous, Yannis Agiomyrgiannakis, Yonghui Wu:
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. CoRR abs/1712.05884 (2017) - [i3]Jan Chorowski, Ron J. Weiss, Rif A. Saurous, Samy Bengio:
On Using Backpropagation for Speech Texture Generation and Voice Conversion. CoRR abs/1712.08363 (2017) - 2016
- [c19]Tara N. Sainath, Ron J. Weiss, Kevin W. Wilson, Arun Narayanan, Michiel Bacchiani:
Factored spatial and spectral multichannel raw waveform CLDNNs. ICASSP 2016: 5075-5079 - [c18]Tara N. Sainath, Arun Narayanan, Ron J. Weiss, Ehsan Variani, Kevin W. Wilson, Michiel Bacchiani, Izhak Shafran:
Reducing the Computational Complexity of Multimicrophone Acoustic Models with Integrated Feature Extraction. INTERSPEECH 2016: 1971-1975 - [c17]Bo Li, Tara N. Sainath, Ron J. Weiss, Kevin W. Wilson, Michiel Bacchiani:
Neural Network Adaptive Beamforming for Robust Multichannel Speech Recognition. INTERSPEECH 2016: 1976-1980 - [i2]Shawn Hershey, Sourish Chaudhuri, Daniel P. W. Ellis, Jort F. Gemmeke, Aren Jansen, R. Channing Moore, Manoj Plakal, Devin Platt, Rif A. Saurous, Bryan Seybold, Malcolm Slaney, Ron J. Weiss, Kevin W. Wilson:
CNN Architectures for Large-Scale Audio Classification. CoRR abs/1609.09430 (2016) - 2015
- [c16]Tara N. Sainath, Ron J. Weiss, Kevin W. Wilson, Arun Narayanan, Michiel Bacchiani, Andrew W. Senior:
Speaker location and microphone spacing invariant acoustic modeling from raw multichannel waveforms. ASRU 2015: 30-36 - [c15]Yedid Hoshen, Ron J. Weiss, Kevin W. Wilson:
Speech acoustic modeling from raw multichannel waveforms. ICASSP 2015: 4624-4628 - [c14]Tara N. Sainath, Ron J. Weiss, Andrew W. Senior, Kevin W. Wilson, Oriol Vinyals:
Learning the speech front-end with raw waveform CLDNNs. INTERSPEECH 2015: 1-5 - 2014
- [c13]Jason Weston, Ron J. Weiss, Hector Yee:
Affinity Weighted Embedding. ICML 2014: 1215-1223 - 2013
- [c12]Jason Weston, Ron J. Weiss, Hector Yee:
Nonlinear latent factorization by embedding multiple user interests. RecSys 2013: 65-68 - [c11]Jason Weston, Hector Yee, Ron J. Weiss:
Learning to rank recommendations with the k-order statistic loss. RecSys 2013: 245-248 - [c10]Jason Weston, Ron J. Weiss, Hector Yee:
Affinity Weighted Embedding. ICLR (Workshop) 2013 - 2012
- [c9]Jason Weston, Chong Wang, Ron J. Weiss, Adam Berenzweig:
Latent Collaborative Retrieval. ICML 2012 - [i1]Jason Weston, Chong Wang, Ron J. Weiss, Adam Berenzweig:
Latent Collaborative Retrieval. CoRR abs/1206.4603 (2012) - 2011
- [j4]Ron J. Weiss, Juan Pablo Bello:
Unsupervised Discovery of Temporal Structure in Music. IEEE J. Sel. Top. Signal Process. 5(6): 1240-1251 (2011) - [j3]Ron J. Weiss, Michael I. Mandel, Daniel P. W. Ellis:
Combining localization cues and source model constraints for binaural source separation. Speech Commun. 53(5): 606-621 (2011) - [c8]Thierry Bertin-Mahieux, Graham Grindlay, Ron J. Weiss, Daniel P. W. Ellis:
Evaluating music sequence models through missing data. ICASSP 2011: 177-180 - 2010
- [j2]Ron J. Weiss, Daniel P. W. Ellis:
Speech separation using speaker-adapted eigenvoice speech models. Comput. Speech Lang. 24(1): 16-29 (2010) - [j1]Michael I. Mandel, Ron J. Weiss, Daniel P. W. Ellis:
Model-Based Expectation-Maximization Source Separation and Localization. IEEE Trans. Speech Audio Process. 18(2): 382-394 (2010) - [c7]Thierry Bertin-Mahieux, Ron J. Weiss, Daniel P. W. Ellis:
Clustering Beat-Chroma Patterns in a Large Music Database. ISMIR 2010: 111-116 - [c6]Ron J. Weiss, Juan Pablo Bello:
Identifying Repeated Patterns in Music Using Sparse Convolutive Non-negative Matrix Factorization. ISMIR 2010: 123-128
2000 – 2009
- 2009
- [c5]Ron J. Weiss, Daniel P. W. Ellis:
A variational EM algorithm for learning eigenvoice parameters in mixed signals. ICASSP 2009: 113-116 - 2008
- [c4]Ron J. Weiss, Trausti T. Kristjansson:
DySANA: dynamic speech and noise adaptation for voice activity detection. INTERSPEECH 2008: 127-130 - [c3]Ron J. Weiss, Michael I. Mandel, Daniel P. W. Ellis:
Source separation based on binaural cues and source model constraints. INTERSPEECH 2008: 419-422 - 2006
- [c2]Daniel P. W. Ellis, Ron J. Weiss:
Model-Based Monaural Source Separation Using a Vector-Quantized Phase-Vocoder Representation. ICASSP (5) 2006: 957-960 - [c1]Ron J. Weiss, Daniel P. W. Ellis:
Estimating single-channel source separation masks: relevance vector machine classifiers vs. pitch-based masking. SAPA@INTERSPEECH 2006: 31-36
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-07 21:31 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint