


Остановите войну!
for scientists:


default search action
Gordon Wichern
Person information

Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [j11]Zhong-Qiu Wang
, Gordon Wichern
, Shinji Watanabe
, Jonathan Le Roux
:
STFT-Domain Neural Speech Enhancement With Very Low Algorithmic Latency. IEEE ACM Trans. Audio Speech Lang. Process. 31: 397-410 (2023) - 2022
- [c33]Satvik Venkatesh, Gordon Wichern, Aswin Shanmugam Subramanian, Jonathan Le Roux:
Improved Domain Generalization via Disentangled Multi-Task Learning in Unsupervised Anomalous Sound Detection. DCASE 2022 - [c32]Darius Petermann, Gordon Wichern, Zhong-Qiu Wang, Jonathan Le Roux:
The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks. ICASSP 2022: 526-530 - [c31]Olga Slizovskaia, Gordon Wichern, Zhong-Qiu Wang, Jonathan Le Roux:
Locate This, Not that: Class-Conditioned Sound Event DOA Estimation. ICASSP 2022: 711-715 - [c30]Efthymios Tzinis, Gordon Wichern, Aswin Shanmugam Subramanian, Paris Smaragdis, Jonathan Le Roux:
Heterogeneous Target Speech Separation. INTERSPEECH 2022: 1796-1800 - [i28]Olga Slizovskaia, Gordon Wichern, Zhong-Qiu Wang, Jonathan Le Roux:
Locate This, Not That: Class-Conditioned Sound Event DOA Estimation. CoRR abs/2203.04197 (2022) - [i27]Efthymios Tzinis, Gordon Wichern, Aswin Shanmugam Subramanian, Paris Smaragdis, Jonathan Le Roux:
Heterogeneous Target Speech Separation. CoRR abs/2204.03594 (2022) - [i26]Zhong-Qiu Wang, Gordon Wichern, Shinji Watanabe, Jonathan Le Roux:
STFT-Domain Neural Speech Enhancement with Very Low Algorithmic Latency. CoRR abs/2204.09911 (2022) - [i25]Zexu Pan, Gordon Wichern, François G. Germain, Aswin Shanmugam Subramanian, Jonathan Le Roux:
Towards End-to-end Speaker Diarization in the Wild. CoRR abs/2211.01299 (2022) - [i24]Hao Yen, François G. Germain, Gordon Wichern, Jonathan Le Roux:
Cold Diffusion for Speech Enhancement. CoRR abs/2211.02527 (2022) - [i23]Efthymios Tzinis, Gordon Wichern, Paris Smaragdis, Jonathan Le Roux:
Optimal Condition Training for Target Source Separation. CoRR abs/2211.05927 (2022) - [i22]Ankush Chakrabarty, Gordon Wichern, Christopher R. Laughman:
Meta-Learning of Neural State-Space Models Using Data From Similar Systems. CoRR abs/2211.07768 (2022) - [i21]Rohith Aralikatti, Christoph Böddeker, Gordon Wichern, Aswin Shanmugam Subramanian, Jonathan Le Roux:
Reverberation as Supervision for Speech Separation. CoRR abs/2211.08303 (2022) - [i20]Dimitrios Bralios, Efthymios Tzinis, Gordon Wichern, Paris Smaragdis, Jonathan Le Roux:
Latent Iterative Refinement for Modular Source Separation. CoRR abs/2211.11917 (2022) - [i19]Darius Petermann, Gordon Wichern, Aswin Shanmugam Subramanian, Jonathan Le Roux:
Hyperbolic Audio Source Separation. CoRR abs/2212.05008 (2022) - [i18]Darius Petermann, Gordon Wichern, Aswin Shanmugam Subramanian, Zhong-Qiu Wang, Jonathan Le Roux:
Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks. CoRR abs/2212.07327 (2022) - 2021
- [j10]Zhong-Qiu Wang
, Gordon Wichern
, Jonathan Le Roux
:
On the Compensation Between Magnitude and Phase in Speech Separation. IEEE Signal Process. Lett. 28: 2018-2022 (2021) - [j9]Zhong-Qiu Wang
, Gordon Wichern
, Jonathan Le Roux
:
Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3476-3490 (2021) - [c29]Yun-Ning Hung, Gordon Wichern, Jonathan Le Roux:
Transcription Is All You Need: Learning To Separate Musical Mixtures With Score As Supervision. ICASSP 2021: 46-50 - [c28]Zhong-Qiu Wang, Gordon Wichern, Jonathan Le Roux:
Convolutive Prediction for Reverberant Speech Separation. WASPAA 2021: 56-60 - [c27]Gordon Wichern, Ankush Chakrabarty, Zhong-Qiu Wang, Jonathan Le Roux:
Anomalous Sound Detection Using Attentive Neural Processes. WASPAA 2021: 186-190 - [i17]Ankush Chakrabarty, Gordon Wichern, Christopher R. Laughman:
Attentive Neural Processes and Batch Bayesian Optimization for Scalable Calibration of Physics-Informed Digital Twins. CoRR abs/2106.15502 (2021) - [i16]Zhong-Qiu Wang, Gordon Wichern, Jonathan Le Roux:
On The Compensation Between Magnitude and Phase in Speech Separation. CoRR abs/2108.05470 (2021) - [i15]Zhong-Qiu Wang, Gordon Wichern, Jonathan Le Roux:
Convolutive Prediction for Reverberant Speech Separation. CoRR abs/2108.07194 (2021) - [i14]Zhong-Qiu Wang, Gordon Wichern, Jonathan Le Roux:
Convolutive Prediction for Monaural Speech Dereverberation and Noisy-Reverberant Speaker Separation. CoRR abs/2108.07376 (2021) - [i13]Zhong-Qiu Wang, Gordon Wichern, Jonathan Le Roux:
Leveraging Low-Distortion Target Estimates for Improved Speech Enhancement. CoRR abs/2110.00570 (2021) - [i12]Darius Petermann, Gordon Wichern, Zhong-Qiu Wang, Jonathan Le Roux:
The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World Soundtracks. CoRR abs/2110.09958 (2021) - 2020
- [j8]Fatemeh Pishdadian, Gordon Wichern, Jonathan Le Roux:
Finding Strength in Weakness: Learning to Separate Sounds With Weak Supervision. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2386-2399 (2020) - [c26]Fatemeh Pishdadian, Gordon Wichern, Jonathan Le Roux:
Learning to Separate Sounds from Weakly Labeled Scenes. ICASSP 2020: 91-95 - [c25]Matthew Maciejewski, Gordon Wichern, Emmett McQuinn, Jonathan Le Roux:
WHAMR!: Noisy and Reverberant Single-Channel Speech Separation. ICASSP 2020: 696-700 - [c24]Niko Moritz, Gordon Wichern, Takaaki Hori, Jonathan Le Roux:
All-in-One Transformer: Unifying Speech Recognition, Audio Tagging, and Event Detection. INTERSPEECH 2020: 3112-3116 - [c23]Ethan Manilow, Gordon Wichern, Jonathan Le Roux:
Hierarchical Musical Instrument Separation. ISMIR 2020: 376-383 - [c22]Prem Seetharaman, Gordon Wichern, Bryan Pardo, Jonathan Le Roux:
Autoclip: Adaptive Gradient Clipping for Source Separation Networks. MLSP 2020: 1-6 - [i11]Prem Seetharaman, Gordon Wichern, Bryan Pardo, Jonathan Le Roux:
AutoClip: Adaptive Gradient Clipping for Source Separation Networks. CoRR abs/2007.14469 (2020) - [i10]Yun-Ning Hung, Gordon Wichern, Jonathan Le Roux:
Transcription Is All You Need: Learning to Separate Musical Mixtures with Score as Supervision. CoRR abs/2010.11904 (2020)
2010 – 2019
- 2019
- [j7]Jonathan Le Roux
, Gordon Wichern
, Shinji Watanabe
, Andy M. Sarroff, John R. Hershey:
Phasebook and Friends: Leveraging Discrete Representations for Source Separation. IEEE J. Sel. Top. Signal Process. 13(2): 370-382 (2019) - [c21]Jonathan Le Roux, Gordon Wichern, Shinji Watanabe
, Andy M. Sarroff, John R. Hershey:
The Phasebook: Building Complex Masks via Discrete Representations for Source Separation. ICASSP 2019: 66-70 - [c20]Prem Seetharaman, Gordon Wichern, Shrikant Venkataramani, Jonathan Le Roux:
Class-conditional Embeddings for Music Source Separation. ICASSP 2019: 301-305 - [c19]Prem Seetharaman, Gordon Wichern, Jonathan Le Roux, Bryan Pardo:
Bootstrapping Single-channel Source Separation via Unsupervised Spatial Clustering on Stereo Mixtures. ICASSP 2019: 356-360 - [c18]Ryo Aihara, Toshiyuki Hanazawa, Yohei Okato, Gordon Wichern, Jonathan Le Roux:
Teacher-student Deep Clustering for Low-delay Single Channel Speech Separation. ICASSP 2019: 690-694 - [c17]Chiori Hori, Huda AlAmri, Jue Wang, Gordon Wichern, Takaaki Hori, Anoop Cherian, Tim K. Marks, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Irfan Essa, Dhruv Batra, Devi Parikh:
End-to-end Audio Visual Scene-aware Dialog Using Multimodal Attention-based Video Features. ICASSP 2019: 2352-2356 - [c16]Gordon Wichern, Joe Antognini, Michael Flynn, Licheng Richard Zhu, Emmett McQuinn, Dwight Crow, Ethan Manilow, Jonathan Le Roux:
WHAM!: Extending Speech Separation to Noisy Environments. INTERSPEECH 2019: 1368-1372 - [c15]Ethan Manilow, Gordon Wichern, Prem Seetharaman, Jonathan Le Roux:
Cutting Music Source Separation Some Slakh: A Dataset to Study the Impact of Training Data Quality and Quantity. WASPAA 2019: 45-49 - [i9]Gordon Wichern, Joe Antognini, Michael Flynn, Licheng Richard Zhu, Emmett McQuinn, Dwight Crow, Ethan Manilow, Jonathan Le Roux:
WHAM!: Extending Speech Separation to Noisy Environments. CoRR abs/1907.01160 (2019) - [i8]Ethan Manilow, Gordon Wichern, Prem Seetharaman, Jonathan Le Roux:
Cutting Music Source Separation Some Slakh: A Dataset to Study the Impact of Training Data Quality and Quantity. CoRR abs/1909.08494 (2019) - [i7]Matthew Maciejewski, Gordon Wichern, Emmett McQuinn, Jonathan Le Roux:
WHAMR!: Noisy and Reverberant Single-Channel Speech Separation. CoRR abs/1910.10279 (2019) - [i6]Prem Seetharaman, Gordon Wichern, Jonathan Le Roux, Bryan Pardo:
Bootstrapping deep music separation from primitive auditory grouping principles. CoRR abs/1910.11133 (2019) - [i5]Fatemeh Pishdadian, Gordon Wichern, Jonathan Le Roux:
Finding Strength in Weakness: Learning to Separate Sounds with Weak Supervision. CoRR abs/1911.02182 (2019) - 2018
- [c14]Chiori Hori, Takaaki Hori, Gordon Wichern, Jue Wang, Teng-Yok Lee, Anoop Cherian, Tim K. Marks:
Multimodal Attention for Fusion of Audio and Spatiotemporal Features for Video Description. CVPR Workshops 2018: 2528-2531 - [c13]Gordon Wichern, Jonathan Le Roux:
Phase Reconstruction with Learned Time-Frequency Representations for Single-Channel Speech Separation. IWAENC 2018: 396-400 - [i4]Chiori Hori, Huda AlAmri, Jue Wang, Gordon Wichern, Takaaki Hori, Anoop Cherian, Tim K. Marks, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Irfan Essa, Dhruv Batra, Devi Parikh:
End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features. CoRR abs/1806.08409 (2018) - [i3]Jonathan Le Roux, Gordon Wichern, Shinji Watanabe, Andy M. Sarroff, John R. Hershey:
Phasebook and Friends: Leveraging Discrete Representations for Source Separation. CoRR abs/1810.01395 (2018) - [i2]Prem Seetharaman, Gordon Wichern, Jonathan Le Roux, Bryan Pardo:
Bootstrapping single-channel source separation via unsupervised spatial clustering on stereo mixtures. CoRR abs/1811.02130 (2018) - [i1]Prem Seetharaman, Gordon Wichern, Shrikant Venkataramani, Jonathan Le Roux:
Class-conditional embeddings for music source separation. CoRR abs/1811.03076 (2018) - 2017
- [c12]Gordon Wichern, Alexey Lukin:
Low-Latency approximation of bidirectional recurrent networks for speech denoising. WASPAA 2017: 66-70 - 2013
- [j6]Makoto Yamada, Gordon Wichern, Kazunobu Kondo, Masashi Sugiyama
, Hiroshi Sawada:
Noise adaptive optimization of matrix initialization for frequency-domain independent component analysis. Digit. Signal Process. 23(1): 1-8 (2013) - 2011
- [j5]Makoto Yamada, Masashi Sugiyama, Gordon Wichern, Jaak Simm:
Improving the Accuracy of Least-Squares Probabilistic Classifiers. IEICE Trans. Inf. Syst. 94-D(6): 1337-1340 (2011) - 2010
- [j4]Gordon Wichern, Brandon Mechtley, Alex Fink, Harvey D. Thornburg, Andreas Spanias:
An Ontological Framework for Retrieving Environmental Sounds Using Semantics and Acoustic Content. EURASIP J. Audio Speech Music. Process. 2010 (2010) - [j3]Makoto Yamada, Masashi Sugiyama, Gordon Wichern, Jaak Simm:
Direct Importance Estimation with a Mixture of Probabilistic Principal Component Analyzers. IEICE Trans. Inf. Syst. 93-D(10): 2846-2849 (2010) - [j2]Gordon Wichern, Jiachen Xue, Harvey D. Thornburg, Brandon Mechtley, Andreas Spanias:
Segmentation, Indexing, and Retrieval for Environmental and Natural Sounds. IEEE Trans. Speech Audio Process. 18(3): 688-707 (2010) - [c11]Gordon Wichern, Makoto Yamada, Harvey D. Thornburg, Masashi Sugiyama, Andreas Spanias:
Automatic audio tagging using covariate shift adaptation. ICASSP 2010: 253-256 - [c10]Makoto Yamada, Masashi Sugiyama, Gordon Wichern, Tomoko Matsui:
Acceleration of sequence kernel computation for real-time speaker identification. ICASSP 2010: 1626-1629 - [c9]Makoto Yamada, Masashi Sugiyama, Gordon Wichern:
Direct importance estimation with probabilistic principal component analyzers. ICASSP 2010: 1962-1965 - [c8]Brandon Mechtley, Gordon Wichern, Harvey D. Thornburg, Andreas Spanias:
Combining semantic, social, and acoustic similarity for retrieval of environmental sounds. ICASSP 2010: 2402-2405
2000 – 2009
- 2009
- [c7]Gordon Wichern, Harvey D. Thornburg, Andreas Spanias:
Multi-channel audio segmentation for continuous observation and archival of large spaces. ICASSP 2009: 237-240 - [c6]Gordon Wichern, Homin Kwon, Andreas Spanias, Alex Fink, Harvey D. Thornburg:
Continuous observation and archival of acoustic scenes using wireless sensor networks. DPS 2009: 1-6 - [c5]Gordon Wichern, Harvey D. Thornburg, Andreas Spanias:
Unifying semantic and content-based approaches for retrieval of environmental sounds. WASPAA 2009: 13-16 - 2008
- [c4]Jiachen Xue, Gordon Wichern, Harvey D. Thornburg, Andreas Spanias:
Fast query by example of environmental sounds via robust and efficient cluster-based indexing. ICASSP 2008: 5-8 - 2007
- [j1]Gordon Wichern, Mahmood R. Azimi-Sadjadi, Michael Mungiole:
Environmentally adaptive acoustic transmission loss prediction in turbulent and nonturbulent atmospheres. Neural Networks 20(4): 484-497 (2007) - [c3]Gordon Wichern, Harvey D. Thornburg, Brandon Mechtley, Alex Fink, Kai Tu, Andreas Spanias:
Robust Multi-Features Segmentation and Indexing for Natural Sound Environments. CBMI 2007: 69-76 - [c2]Michael McCarron, Mahmood R. Azimi-Sadjadi, Gordon Wichern, Michael Mungiole:
An Operationally Adaptive System for Rapid Acoustic Transmission Loss Prediction. IJCNN 2007: 2262-2267 - 2006
- [c1]Gordon Wichern, Mahmood R. Azimi-Sadjadi, Michael Mungiole:
An Environmentally Adaptive System for Rapid Acoustic Transmission Loss Prediction. IJCNN 2006: 5118-5125
Coauthor Index
aka: Zhong-Qiu Wang

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
load content from web.archive.org
Privacy notice: By enabling the option above, your browser will contact the API of web.archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
Tweets on dblp homepage
Show tweets from on the dblp homepage.
Privacy notice: By enabling the option above, your browser will contact twitter.com and twimg.com to load tweets curated by our Twitter account. At the same time, Twitter will persistently store several cookies with your web browser. While we did signal Twitter to not track our users by setting the "dnt" flag, we do not have any control over how Twitter uses your data. So please proceed with care and consider checking the Twitter privacy policy.
last updated on 2023-01-03 21:29 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint