default search action

combined dblp search
author search
venue search
publication search

ask others

Soroosh Mariooryad

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/naacl/BattenbergSSMSSK25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/BattenbergSSMSSK25
Eric Battenberg, R. J. Skerry-Ryan, Daisy Stanton, Soroosh Mariooryad, Matt Shannon, Julian Salazar, David Kao:
Robust and Unbounded Length Generalization in Autoregressive Transformer-Based Text-to-Speech. NAACL (Long Papers) 2025: 11789-11806
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-23292
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-23292
R. J. Skerry-Ryan, Julian Salazar, Soroosh Mariooryad, David Kao, Daisy Stanton, Eric Battenberg, Matt Shannon, Ron J. Weiss, Robin Scheibler, Jonas Rothfuss, Tom Bagby:
SequenceLayers: Sequence Processing and Streaming Neural Networks Made Easy. CoRR abs/2507.23292 (2025)
2024
[c12]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/NachmaniLHSAMRS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/NachmaniLHSAMRS24
Eliya Nachmani, Alon Levkovitch, Roy Hirsch, Julian Salazar, Chulayuth Asawaroengchai, Soroosh Mariooryad, Ehud Rivlin, R. J. Skerry-Ryan, Michelle Tadmor Ramanovich:
Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM. ICLR 2024
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-22179
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-22179
Eric Battenberg, R. J. Skerry-Ryan, Daisy Stanton, Soroosh Mariooryad, Matt Shannon, Julian Salazar, David Kao:
Very Attentive Tacotron: Robust and Unbounded Length Generalization in Autoregressive Transformer-Based Text-to-Speech. CoRR abs/2410.22179 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-08356
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-08356
Alon Levkovitch, Julian Salazar, Soroosh Mariooryad, R. J. Skerry-Ryan, Nadav Bar, Bastiaan Kleijn, Eliya Nachmani:
Zero-Shot Mono-to-Binaural Speech Synthesis. CoRR abs/2412.08356 (2024)
2023
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-15255
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-15255
Eliya Nachmani, Alon Levkovitch, Julian Salazar, Chulayuth Asawaroengchai, Soroosh Mariooryad, R. J. Skerry-Ryan, Michelle Tadmor Ramanovich:
LMs with a Voice: Spoken Language Modeling beyond Speech Tokens. CoRR abs/2305.15255 (2023)
2022
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/StantonSMSBBK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/StantonSMSBBK22
Daisy Stanton, Matt Shannon, Soroosh Mariooryad, R. J. Skerry-Ryan, Eric Battenberg, Tom Bagby, David Kao:
Speaker Generation. ICASSP 2022: 7897-7901
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-03232
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-03232
Soroosh Mariooryad, Matt Shannon, Siyuan Ma, Tom Bagby, David Kao, Daisy Stanton, Eric Battenberg, R. J. Skerry-Ryan:
Learning the joint distribution of two sequences using little or no paired data. CoRR abs/2212.03232 (2022)
2021
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WeissSBMK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WeissSBMK21
Ron J. Weiss, R. J. Skerry-Ryan, Eric Battenberg, Soroosh Mariooryad, Diederik P. Kingma:
Wave-Tacotron: Spectrogram-Free End-to-End Text-to-Speech Synthesis. ICASSP 2021: 5679-5683
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-05095
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-05095
Daisy Stanton, Matt Shannon, Soroosh Mariooryad, R. J. Skerry-Ryan, Eric Battenberg, Tom Bagby, David Kao:
Speaker Generation. CoRR abs/2111.05095 (2021)
2020
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BattenbergSMSKS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BattenbergSMSKS20
Eric Battenberg, R. J. Skerry-Ryan, Soroosh Mariooryad, Daisy Stanton, David Kao, Matt Shannon, Tom Bagby:
Location-Relative Attention Mechanisms for Robust Long-Form Speech Synthesis. ICASSP 2020: 6194-6198
[c8]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/HabibMSBSSKB20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/HabibMSBSSKB20
Raza Habib, Soroosh Mariooryad, Matt Shannon, Eric Battenberg, R. J. Skerry-Ryan, Daisy Stanton, David Kao, Tom Bagby:
Semi-Supervised Generative Modeling for Controllable Speech Synthesis. ICLR 2020
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-08029
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-08029
Matt Shannon, Ben Poole, Soroosh Mariooryad, Tom Bagby, Eric Battenberg, David Kao, Daisy Stanton, R. J. Skerry-Ryan:
Non-saturating GAN training as divergence minimization. CoRR abs/2010.08029 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-03568
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-03568
Ron J. Weiss, R. J. Skerry-Ryan, Eric Battenberg, Soroosh Mariooryad, Diederik P. Kingma:
Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis. CoRR abs/2011.03568 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1906-03402
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1906-03402
Eric Battenberg, Soroosh Mariooryad, Daisy Stanton, R. J. Skerry-Ryan, Matt Shannon, David Kao, Tom Bagby:
Effective Use of Variational Embedding Capacity in Expressive End-to-End Speech Synthesis. CoRR abs/1906.03402 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-01709
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-01709
Raza Habib, Soroosh Mariooryad, Matt Shannon, Eric Battenberg, R. J. Skerry-Ryan, Daisy Stanton, David Kao, Tom Bagby:
Semi-Supervised Generative Modeling for Controllable Speech Synthesis. CoRR abs/1910.01709 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-10288
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-10288
Eric Battenberg, R. J. Skerry-Ryan, Soroosh Mariooryad, Daisy Stanton, David Kao, Matt Shannon, Tom Bagby:
Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis. CoRR abs/1910.10288 (2019)
2017
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/taffco/MariooryadB17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/MariooryadB17
Soroosh Mariooryad, Carlos Busso:
The Cost of Dichotomizing Continuous Labels for Binary Classification Problems: Deriving a Bayesian-Optimal Classifier. IEEE Trans. Affect. Comput. 8(1): 119-130 (2017)
2016
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/taffco/MariooryadB16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/MariooryadB16
Soroosh Mariooryad, Carlos Busso:
Facial Expression Recognition in the Presence of Speech Using Blind Lexical Compensation. IEEE Trans. Affect. Comput. 7(4): 346-359 (2016)
2015
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/taffco/MariooryadB15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/MariooryadB15
Soroosh Mariooryad, Carlos Busso:
Correcting Time-Continuous Emotional Labels by Modeling the Reaction Lag of Evaluators. IEEE Trans. Affect. Comput. 6(2): 97-108 (2015)
2014
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/MariooryadB14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/MariooryadB14
Soroosh Mariooryad, Carlos Busso:
Compensating for speaker or lexical variabilities in speech for emotion recognition. Speech Commun. 57: 1-12 (2014)
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MariooryadKHS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MariooryadKHS14
Soroosh Mariooryad, Anitha Kannan, Dilek Hakkani-Tür, Elizabeth Shriberg:
Automatic characterization of speaking styles in educational videos. ICASSP 2014: 4848-4852
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MariooryadLB14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MariooryadLB14
Soroosh Mariooryad, Reza Lotfian, Carlos Busso:
Building a naturalistic emotional speech corpus by retrieving expressive behaviors from existing speech corpora. INTERSPEECH 2014: 238-242
2013
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/taffco/MariooryadB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/MariooryadB13
Soroosh Mariooryad, Carlos Busso:
Exploring Cross-Modality Affective Reactions for Audiovisual Emotion Recognition. IEEE Trans. Affect. Comput. 4(2): 183-196 (2013)
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taffco/BussoMMN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taffco/BussoMMN13
Carlos Busso, Soroosh Mariooryad, Angeliki Metallinou, Shrikanth S. Narayanan:
Iterative Feature Normalization Scheme for Automatic Emotion Detection from Speech. IEEE Trans. Affect. Comput. 4(4): 386-397 (2013)
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/acii/MariooryadB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acii/MariooryadB13
Soroosh Mariooryad, Carlos Busso:
Analysis and Compensation of the Reaction Lag of Evaluators in Continuous Emotional Annotations. ACII 2013: 85-90
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/fgr/MariooryadB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/fgr/MariooryadB13
Soroosh Mariooryad, Carlos Busso:
Feature and model level compensation of lexical content for facial emotion recognition. FG 2013: 1-6
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TranMB13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TranMB13
Tam Tran, Soroosh Mariooryad, Carlos Busso:
Audiovisual corpus to analyze whisper speech. ICASSP 2013: 8101-8105
2012
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/MariooryadB12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/MariooryadB12
Soroosh Mariooryad, Carlos Busso:
Generating Human-Like Behaviors Using Joint, Speech-Driven Models for Conversational Agents. IEEE Trans. Speech Audio Process. 20(8): 2329-2340 (2012)
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/MariooryadB12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/MariooryadB12
Soroosh Mariooryad, Carlos Busso:
Factorizing speaker, lexical and emotional variabilities observed in facial expressions. ICIP 2012: 2605-2608
2011
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RahmanMKLHB11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RahmanMKLHB11
Tauhidur Rahman, Soroosh Mariooryad, Shalini Keshavamurthy, Gang Liu, John H. L. Hansen, Carlos Busso:
Detecting Sleepiness by Fusing Classifiers Trained with Novel Acoustic Features. INTERSPEECH 2011: 3285-3288

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.