default search action
Search dblp
Full-text search
- > Home
Please enter a search query
- case-insensitive prefix search: default
e.g., sig matches "SIGIR" as well as "signal" - exact word search: append dollar sign ($) to word
e.g., graph$ matches "graph", but not "graphics" - boolean and: separate words by space
e.g., codd model - boolean or: connect words by pipe symbol (|)
e.g., graph|network
Update May 7, 2017: Please note that we had to disable the phrase search operator (.) and the boolean not operator (-) due to technical problems. For the time being, phrase search queries will yield regular prefix search result, and search terms preceded by a minus will be interpreted as regular (positive) search terms.
Author search results
Exact matches
Venue search results
no matches
Refine list
refine by author
- no options
- temporarily not available
refine by venue
- no options
- temporarily not available
refine by type
- no options
- temporarily not available
refine by access
- no options
- temporarily not available
refine by year
- no options
- temporarily not available
Publication search results
found 30 matches
- 2024
- Thilo von Neumann, Christoph Böddeker, Tobias Cord-Landwehr, Marc Delcroix, Reinhold Haeb-Umbach:
Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization. ICASSP Workshops 2024: 775-779 - 2023
- Thilo von Neumann, Keisuke Kinoshita, Christoph Böddeker, Marc Delcroix, Reinhold Haeb-Umbach:
Segment-Less Continuous Speech Separation of Meetings: Training and Evaluation Criteria. IEEE ACM Trans. Audio Speech Lang. Process. 31: 576-589 (2023) - Thilo von Neumann, Christoph Böddeker, Keisuke Kinoshita, Marc Delcroix, Reinhold Haeb-Umbach:
On Word Error Rate Definitions and Their Efficient Computation for Multi-Speaker Speech Recognition Systems. ICASSP 2023: 1-5 - Thilo von Neumann, Christoph Böddeker, Marc Delcroix, Reinhold Haeb-Umbach:
MeetEval: A Toolkit for Computation of Word Error Rates for Meeting Transcription Systems. CoRR abs/2307.11394 (2023) - Peter Vieting, Simon Berger, Thilo von Neumann, Christoph Böddeker, Ralf Schlüter, Reinhold Haeb-Umbach:
Mixture Encoder Supporting Continuous Speech Separation for Meeting Recognition. CoRR abs/2309.08454 (2023) - Thilo von Neumann, Christoph Böddeker, Tobias Cord-Landwehr, Marc Delcroix, Reinhold Haeb-Umbach:
Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization. CoRR abs/2309.16482 (2023) - 2022
- Thilo von Neumann, Keisuke Kinoshita, Christoph Böddeker, Marc Delcroix, Reinhold Haeb-Umbach:
SA-SDR: A Novel Loss Function for Separation of Meeting Style Data. ICASSP 2022: 6022-6026 - Christoph Böddeker, Tobias Cord-Landwehr, Thilo von Neumann, Reinhold Haeb-Umbach:
An Initialization Scheme for Meeting Separation with Spatial Mixture Models. INTERSPEECH 2022: 271-275 - Keisuke Kinoshita, Thilo von Neumann, Marc Delcroix, Christoph Böddeker, Reinhold Haeb-Umbach:
Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. INTERSPEECH 2022: 1486-1490 - Tobias Cord-Landwehr, Christoph Böddeker, Thilo von Neumann, Catalin Zorila, Rama Doddipatla, Reinhold Haeb-Umbach:
Monaural Source Separation: From Anechoic To Reverberant Environments. IWAENC 2022: 1-5 - Tobias Cord-Landwehr, Thilo von Neumann, Christoph Böddeker, Reinhold Haeb-Umbach:
MMS-MSG: A Multi-Purpose Multi-Speaker Mixture Signal Generator. IWAENC 2022: 1-5 - Christoph Böddeker, Tobias Cord-Landwehr, Thilo von Neumann, Reinhold Haeb-Umbach:
An Initialization Scheme for Meeting Separation with Spatial Mixture Models. CoRR abs/2204.01338 (2022) - Tobias Gburrek, Christoph Böddeker, Thilo von Neumann, Tobias Cord-Landwehr, Joerg Schmalenstroeer, Reinhold Haeb-Umbach:
A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network. CoRR abs/2205.00944 (2022) - Keisuke Kinoshita, Thilo von Neumann, Marc Delcroix, Christoph Böddeker, Reinhold Haeb-Umbach:
Utterance-by-utterance overlap-aware neural diarization with Graph-PIT. CoRR abs/2207.13888 (2022) - Thilo von Neumann, Christoph Böddeker, Keisuke Kinoshita, Marc Delcroix, Reinhold Haeb-Umbach:
On Word Error Rate Definitions and their Efficient Computation for Multi-Speaker Speech Recognition Systems. CoRR abs/2211.16112 (2022) - 2021
- Thilo von Neumann, Christoph Böddeker, Keisuke Kinoshita, Marc Delcroix, Reinhold Haeb-Umbach:
Speeding Up Permutation Invariant Training for Source Separation. ITG Conference on Speech Communication 2021: 1-5 - Thilo von Neumann, Keisuke Kinoshita, Christoph Böddeker, Marc Delcroix, Reinhold Haeb-Umbach:
Graph-PIT: Generalized Permutation Invariant Training for Continuous Separation of Arbitrary Numbers of Speakers. Interspeech 2021: 3490-3494 - Thilo von Neumann, Christoph Böddeker, Keisuke Kinoshita, Marc Delcroix, Reinhold Haeb-Umbach:
Speeding Up Permutation Invariant Training for Source Separation. CoRR abs/2107.14445 (2021) - Thilo von Neumann, Keisuke Kinoshita, Christoph Böddeker, Marc Delcroix, Reinhold Haeb-Umbach:
Graph-PIT: Generalized permutation invariant training for continuous separation of arbitrary numbers of speakers. CoRR abs/2107.14446 (2021) - Thilo von Neumann, Keisuke Kinoshita, Christoph Böddeker, Marc Delcroix, Reinhold Haeb-Umbach:
SA-SDR: A novel loss function for separation of meeting style data. CoRR abs/2110.15581 (2021) - Tobias Cord-Landwehr, Christoph Böddeker, Thilo von Neumann, Catalin Zorila, Rama Doddipatla, Reinhold Haeb-Umbach:
Monaural source separation: From anechoic to reverberant environments. CoRR abs/2111.07578 (2021) - 2020
- Thilo von Neumann, Keisuke Kinoshita, Lukas Drude, Christoph Böddeker, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach:
End-to-End Training of Time Domain Audio Separation and Recognition. ICASSP 2020: 7004-7008 - Keisuke Kinoshita, Thilo von Neumann, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach:
Multi-Path RNN for Hierarchical Modeling of Long Sequential Data and its Application to Speaker Stream Separation. INTERSPEECH 2020: 2652-2656 - Thilo von Neumann, Christoph Böddeker, Lukas Drude, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach:
Multi-Talker ASR for an Unknown Number of Sources: Joint Training of Source Counting, Separation and ASR. INTERSPEECH 2020: 3097-3101 - Thilo von Neumann, Christoph Böddeker, Lukas Drude, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach:
Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR. CoRR abs/2006.02786 (2020) - Keisuke Kinoshita, Thilo von Neumann, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach:
Multi-path RNN for hierarchical modeling of long sequential data and its application to speaker stream separation. CoRR abs/2006.13579 (2020) - 2019
- Thilo von Neumann, Keisuke Kinoshita, Marc Delcroix, Shoko Araki, Tomohiro Nakatani, Reinhold Haeb-Umbach:
All-neural Online Source Separation, Counting, and Diarization for Meeting Analysis. ICASSP 2019: 91-95 - Thilo von Neumann, Keisuke Kinoshita, Marc Delcroix, Shoko Araki, Tomohiro Nakatani, Reinhold Haeb-Umbach:
All-neural online source separation, counting, and diarization for meeting analysis. CoRR abs/1902.07881 (2019) - Thilo von Neumann, Keisuke Kinoshita, Lukas Drude, Christoph Böddeker, Marc Delcroix, Tomohiro Nakatani, Reinhold Haeb-Umbach:
End-to-end training of time domain audio separation and recognition. CoRR abs/1912.08462 (2019) - 2018
- Lukas Drude, Thilo von Neumann, Reinhold Haeb-Umbach:
Deep Attractor Networks for Speaker Re-Identification and Blind Source Separation. ICASSP 2018: 11-15
loading more results
failed to load more results, please try again later
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
retrieved on 2024-11-04 23:07 CET from data curated by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint