Stop the war!

Остановите войну!

for scientists:

default search action

combined dblp search
author search
venue search
publication search

ask others

EUROSPEECH/INTERSPEECH 2003: Geneva, Switzerland

> Home > Conferences and Workshops > EUROSPEECH
> Home > Conferences and Workshops > INTERSPEECH

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/2003
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/2003
8th European Conference on Speech Communication and Technology, EUROSPEECH 2003 - INTERSPEECH 2003, Geneva, Switzerland, September 1-4, 2003. ISCA 2003

Plenary Talks

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Church03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Church03
Kenneth Ward Church:
Speech and language processing: where have we been and where are we going? 1-4
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kollmeier03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kollmeier03
Birger Kollmeier:
Auditory principles in speech processing - do computers need silicon ears ? 5-8

Aurora Noise Robustness on SMALL Vocabulary Databases

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YaoVKL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YaoVKL03
Kaisheng Yao, Erik M. Visser, Oh-Wook Kwon, Te-Won Lee:
A speech processing front-end with eigenspace normalization for robust speech recognition in noisy automobile environments. 9-12
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LaiS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LaiS03
Yiu-Pong Lai, Man-Hung Siu:
Maximum likelihood normalization for robust speech recognition. 13-16
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StoutenHDW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StoutenHDW03
Veronique Stouten, Hugo Van hamme, Kris Demuynck, Patrick Wambacq:
Robust speech recognition using model-based feature enhancement. 17-20
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuH03
Jian Wu, Qiang Huo:
Several HKU approaches for robust speech recognition and their evaluation on Aurora connected digit recognition tasks. 21-24
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangHAK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangHAK03
Yadong Wang, Jesse Hansen, Gopi Krishna Allu, Ramdas Kumaresan:
Average instantaneous frequency (AIF) and average log-envelopes (ALE) for ASR with the Aurora 2 database. 25-28
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SasouATN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SasouATN03
Akira Sasou, Futoshi Asano, Kazuyo Tanaka, Satoshi Nakamura:
Adaptation of acoustic model using the gain-adapted HMM decomposition method. 29-32

ISCA Special Interest Group Session: "Hot Topics" in Speech Science and Technology

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BonastreBBCRM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BonastreBBCRM03
Jean-François Bonastre, Frédéric Bimbot, Louis-Jean Boë, Joseph P. Campbell, Douglas A. Reynolds, Ivan Magrin-Chagnolleau:
Person authentication by voice: a need for caution. 33-36
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BaillyCM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BaillyCM03
Gérard Bailly, Nick Campbell, Bernd Möbius:
ISCA special session: hot topics in speech synthesis. 37-40
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Gelder03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Gelder03
Béatrice de Gelder:
Perceiving emotions by ear and by eye. 41-44
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Greenberg03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Greenberg03
Steven Greenberg:
Strategies for automatic multi-tier annotation of spoken language corpora. 45-48
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeHCC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeHCC03
Lin-Shan Lee, Yuan Ho, Jia-fu Chen, Shun-Chuan Chen:
Why is the special structure of the language important for Chinese spoken language processing? - examples on spoken document retrieval, segmentation and summarization. 49-52

Speech Signal Processing 1-4

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WeruagaK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WeruagaK03
Luis Weruaga, Marián Képesi:
Speech analysis with the short-time chirp transform. 53-56
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ArroabarrenC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ArroabarrenC03
Ixone Arroabarren, Alfonso Carlosena:
Glottal spectrum based inverse filtering. 57-60
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KiranS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KiranS03
G. V. Kiran, Thippur V. Sreenivas:
A novel method of analysing and comparing responses of hearing aid algorithms using auditory time-frequency representation. 61-64
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PaliwalA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PaliwalA03
Kuldip K. Paliwal, Bishnu S. Atal:
Frequency-related representation of speech. 65-68
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RaykarDYP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RaykarDYP03
Vikas C. Raykar, Ramani Duraiswami, B. Yegnanarayana, S. R. Mahadeva Prasanna:
Tracking a moving speaker using excitation source information. 69-72
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DengBA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DengBA03
Li Deng, Issam Bazzi, Alex Acero:
Tracking vocal tract resonances using an analytical nonlinear predictor and a target-guided temporal constraint. 73-76
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LashkariM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LashkariM03
Khosrow Lashkari, Toshio Miki:
Optimization of the CELP model in the LSP domain. 1709-1712
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GillettK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GillettK03
Ben Gillett, Simon King:
Transforming voice quality. 1713-1716
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HiokaH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HiokaH03
Yusuke Hioka, Nozomu Hamada:
DOA estimation of speech signal using equilateral-triangular microphone array. 1717-1720
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PotamitisTFK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PotamitisTFK03
Ilyas Potamitis, George Tremoulis, Nikos Fakotakis, George Kokkinakis:
Multi-array fusion for beamforming and localization of moving speakers. 1721-1724
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShaoMC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShaoMC03
Xu Shao, Ben P. Milner, Stephen J. Cox:
Integrated pitch and MFCC extraction for speech reconstruction and speech recognition applications. 1725-1728
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LaaksonenHHN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LaaksonenHHN03
Lasse Laaksonen, Sakari Himanen, Ari Heikkinen, Jani Nurminen:
Exploiting time warping in AMR-NB and AMR-WB speech coders. 1729-1732
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Grashey03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Grashey03
Stephan Grashey:
A new approach to voice activity detection based on self-organizing maps. 1733-1736
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShigaK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShigaK03
Yoshinori Shiga, Simon King:
Estimating the spectral envelope of voiced speech using multi-frame analysis. 1737-1740
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JaferM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JaferM03
Essa Jafer, Abdulhussain E. Mahdi:
Adaptive noise estimation using second generation and perceptual wavelet transforms. 1741-1744
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Bourgeois03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Bourgeois03
Julien Bourgeois:
A clustering approach to on-line audio source separation. 1745-1748
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShigaK03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShigaK03a
Yoshinori Shiga, Simon King:
Estimation of voice source and vocal tract characteristics based on multi-frame analysis. 1749-1752
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/En-NajjaryRC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/En-NajjaryRC03
Taoufik En-Najjary, Olivier Rosec, Thierry Chonavel:
A new method for pitch prediction from spectral envelope and its application in voice conversion. 1753-1756
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OrlandiSF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OrlandiSF03
Marco Orlandi, Alfiero Santarelli, Daniele Falavigna:
Maximum likelihood endpoint detection with time-domain features. 1757-1760
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ArroabarrenC03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ArroabarrenC03a
Ixone Arroabarren, Alfonso Carlosena:
Unified analysis of glottal source spectrum. 1761-1764
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BouzidE03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BouzidE03
Aïcha Bouzid, Noureddine Ellouze:
Local regularity analysis at glottal opening and closure instants in electroglottogram signal using wavelet transform modulus maxima. 2837-2840
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchaffonerKKW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchaffonerKKW03
Martin Schafföner, Marcel Katz, Sven E. Krüger, Andreas Wendemuth:
Improved robustness of automatic speech recognition using a new class definition in linear discriminant analysis. 2841-2844
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TurkA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TurkA03
Oytun Türk, Levent M. Arslan:
Voice conversion methods for vocal tract and pitch contour modification. 2845-2848
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Schreiner03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Schreiner03
Olaf Schreiner:
Modulation spectrum for pitch and speech pause detection. 2849-2852
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DimitriadisM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DimitriadisM03
Dimitrios Dimitriadis, Petros Maragos:
Robust energy demodulation based on continuous models with application to speech recognition. 2853-2856
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimKY03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimKY03
Jong Uk Kim, Sang-Gyun Kim, Chang D. Yoo:
A robust and sensitive word boundary decision algorithm. 2857-2860
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SeoJLY03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SeoJLY03
Seongho Seo, Dalwon Jang, Sunil Lee, Chang D. Yoo:
A novel transcoding algorithm for SMV and g.723.1 speech coders via direct parameter transformation. 2861-2864
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JangSLY03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JangSLY03
Dalwon Jang, Seongho Seo, Sunil Lee, Chang D. Yoo:
A novel rate selection algorithm for transcoding CELP-type codec and SMV. 2865-2868
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoyHBSSC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoyHBSSC03
Gary Choy, David Hermann, Robert L. Brennan, Todd Schneider, Hamid Sheikhzadeh, Etienne Cornu:
Subband-based acoustic shock limiting algorithm on a low-resource DSP system. 2869-2872
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PelleC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PelleC03
Patricia A. Pelle, Matias L. Capeletto:
Pitch estimation using phase locked loops. 2873-2876
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ArifiantoK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ArifiantoK03
Dhany Arifianto, Takao Kobayashi:
Performance evaluation of IFAS-based fundamental frequency estimator in noisy environment. 2877-2880
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KruschkeL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KruschkeL03
Hans Kruschke, Michael Lenz:
Estimation of the parameters of the quantitative intonation model with continuous wavelet analysis. 2881-2884
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RodriguezLEM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RodriguezLEM03
Francisco Romero Rodriguez, Wei Ming Liu, Nicholas W. D. Evans, John S. D. Mason:
Morphological filtering of speech spectrograms in the context of additive noise. 2885-2888
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LathoudMM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LathoudMM03
Guillaume Lathoud, Iain McCowan, Darren Moore:
Segmenting multiple concurrent speakers using microphone arrays. 2889-2892
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NagarajanMH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NagarajanMH03
T. Nagarajan, Hema A. Murthy, Rajesh M. Hegde:
Segmentation of speech into syllable-like units. 2893-2896
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PetrilloC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PetrilloC03
Massimo Petrillo, Francesco Cutugno:
A syllable segmentation algorithm for English and italian. 2913-2916
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VermaK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VermaK03
Ashish Verma, Arun Kumar:
Modeling speaking rate for voice fonts. 2917-2920
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Pohjalainen03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Pohjalainen03
Jouni Pohjalainen:
A new HMM-based approach to broad phonetic classification of speech. 2921-2924
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhongCL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhongCL03
Xin Zhong, Mark A. Clements, Sung Lim:
Acoustic change detection and segment clustering of two-way telephone conversations. 2925-2928
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Levin03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Levin03
David N. Levin:
Blind normalization of speech from different channels. 2929-2932
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GurijalaD03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GurijalaD03
Aparna Gurijala, John R. Deller Jr.:
Speech watermarking by parametric embedding with an l_(infinity) fidelity criterion. 2933-2936

Phonology and Phonetics I

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Tseng03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Tseng03
Shu-Chuan Tseng:
Features of contracted syllables of spontaneous Mandarin. 77-80
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Samudravijaya03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Samudravijaya03
K. Samudravijaya:
Durational characteristics of hindi stop consonants. 81-84
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Isei-Jaakkola03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Isei-Jaakkola03
Toshiko Isei-Jaakkola:
Quantity comparison of Japanese and finnish in various word structures. 85-88
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Baltazani03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Baltazani03
Mary Baltazani:
Broad focus across sentence types in greek. 89-92
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HansakunbuntheungTSS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HansakunbuntheungTSS03
Chatchawarn Hansakunbuntheung, Virongrong Tesprasit, Rungkarn Siricharoenchai, Yoshinori Sagisaka:
Analysis and modeling of syllable duration for Thai speech synthesis. 93-96
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Chen03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Chen03
Aoju Chen:
Reaction time as an indicator of discrete intonational contrasts in English. 97-100
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Gibbon03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Gibbon03
Dafydd Gibbon:
Corpus-based syntax-prosody tree matching. 761-764
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YingGW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YingGW03
D. W. Ying, W. Gao, W. Q. Wang:
A new approach to segment and detect syllables from high-speed speech. 765-768
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SonP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SonP03
R. J. J. H. van Son, Louis C. W. Pols:
Information structure and efficiency in speech production. 769-772
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CorazzaB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CorazzaB03
Anna Corazza, Louis ten Bosch:
Learning rule ranking by dynamic construction of context-free grammars using AND/OR graphs. 773-776
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZvonikC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZvonikC03
Elena Zvonik, Fred Cummins:
The effect of surrounding phrase lengths on pause duration. 777-780
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OkawaS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OkawaS03
Shigeki Okawa, Katsuhiko Shirai:
Statistical estimation of phoneme's most stable point based on universal constraint. 781-784
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Beringer03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Beringer03
Nicole Beringer:
Independent automatic segmentation by self-learning categorial pronunciation rules. 785-788
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BraunL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BraunL03
Bettina Braun, D. Robert Ladd:
Prosodic correlates of contrastive and non-contrastive themes in German. 789-792
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Chen03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Chen03a
Yiya Chen:
Accentual lengthening in standard Chinese: evidence from four-syllable constituents. 793-796
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kanokphara03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kanokphara03
Supphanat Kanokphara:
Syllable structure based phonetic units for context-dependent continuous Thai speech recognition. 797-800
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hu03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hu03
Fang Hu:
An acoustic phonetic analysis of diphthongs in ningbo Chinese. 801-804
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OtakeS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OtakeS03
Takashi Otake, Yoko Sakamoto:
Latent ability to manipulate phonemes by Japanese preliterates in roman alphabet. 805-808
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Pfitzinger03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Pfitzinger03
Hartmut R. Pfitzinger:
The /i/-/a/-/u/-ness of spoken vowels. 809-812

Topics in Prosody and Emotional Speech

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GillettK03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GillettK03a
Ben Gillett, Simon King:
Transforming F0 contours. 101-104
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CookFT03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CookFT03
Norman D. Cook, Takeshi Fujisawa, Kazuaki Takami:
Evaluation of the affect of speech intonation using a model of the perception of interval dissonance and harmonic tension. 105-108
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LaiWC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LaiWC03
Wen-Hsing Lai, Yih-Ru Wang, Sin-Horng Chen:
A new pitch modeling approach for Mandarin speech. 109-112
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZervasMFK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZervasMFK03
Panagiotis Zervas, Manolis Maragoudakis, Nikos Fakotakis, George Kokkinakis:
Bayesian induction of intonational phrase breaks. 113-116
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EhretteCdM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EhretteCdM03
Thibaut Ehrette, Noël Chateau, Christophe d'Alessandro, Valérie Maffiolo:
Predicting the perceptive judgment of voices in a telecom context: selection of acoustic parameters. 117-120
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Mattys03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Mattys03
Sven L. Mattys:
Stress-based speech segmentation revisited. 121-124
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KwonCHL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KwonCHL03
Oh-Wook Kwon, Kwokleung Chan, Jiucang Hao, Te-Won Lee:
Emotion recognition by speech signals. 125-128
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Tamburini03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Tamburini03
Fabio Tamburini:
Automatic prosodic prominence detection in speech using acoustic features: an unsupervised system. 129-132
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HozjanK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HozjanK03
Vladimir Hozjan, Zdravko Kacic:
Improved emotion recognition with large set of statistical features. 133-136
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CharnvivitTMLJ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CharnvivitTMLJ03
Patavee Charnvivit, Nuttakorn Thubthong, Ekkarit Maneenoi, Sudaporn Luksaneeyanawin, Somchai Jitapunkul:
Recognition of intonation patterns in Thai utterance. 137-140
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HiroseFNMF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HiroseFNMF03
Keikichi Hirose, Yusuke Furuyama, Shuichi Narusawa, Nobuaki Minematsu, Hiroya Fujisaki:
Use of linguistic information for automatic extraction of f_0 contour generation process model parameters. 141-144
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DohenLCS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DohenLCS03
Marion Dohen, Hélène Loevenbruck, Marie-Agnès Cathiard, Jean-Luc Schwartz:
Potential audiovisual correlates of contrastive focus in French. 145-148
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HatanoHI03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HatanoHI03
Toshie Hatano, Yasuo Horiuchi, Akira Ichikawa:
How does human segment the speech by prosody ? 149-152
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WalkerLMS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WalkerLMS03
Brenton D. Walker, Bradley C. Lackey, Jennifer S. Muller, Patrick John Schone:
Language-reconfigurable universal phone recognition. 153-156
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeN03
Chul Min Lee, Shrikanth S. Narayanan:
Emotion recognition using a data-driven fuzzy inference system. 157-160
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SuzukiYTK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SuzukiYTK03
Noriko Suzuki, Yohei Yabuta, Yugo Takeuchi, Yasuhiro Katagiri:
Effects of voice prosody by computers on human behaviors. 161-164
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JokischK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JokischK03
Oliver Jokisch, Marco Kühne:
An investigation of intensity patterns for German. 165-168
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TeixeiraF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TeixeiraF03
João Paulo Ramos Teixeira, Diamantino Freitas:
Segmental durations predicted with a neural network. 169-172
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamashitaS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamashitaS03
Takumi Yamashita, Yoshinori Sagisaka:
Generation and perception of f_0 markedness in conversational speech with adverbs expressing degrees. 173-176
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MixdorffBFM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MixdorffBFM03
Hansjörg Mixdorff, Nguyen Hung Bach, Hiroya Fujisaki, Chi Mai Luong:
Quantitative analysis and synthesis of syllabic tones in vietnamese. 177-180
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KiriyamaMHHIK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KiriyamaMHHIK03
Shinya Kiriyama, Yoshifumi Mitsuta, Yuta Hosokawa, Yoshikazu Hashimoto, Toshihiko Itoh, Shigeyoshi Kitazawa:
Japanese prosodic labeling support system utilizing linguistic information. 181-184
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AubergeAR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AubergeAR03
Véronique Aubergé, Nicolas Audibert, Albert Rilliard:
Why and how to control the authentic emotional speech corpora. 185-188
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DevillersV03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DevillersV03
Laurence Devillers, Ioana Vasilescu:
Prosodic cues for emotion characterization in real-life spoken dialogs. 189-192

Language Modeling, Discourse and Dialog

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PolifroniCS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PolifroniCS03
Joseph Polifroni, Grace Chung, Stephanie Seneff:
Towards the automatic generation of mixed-initiative dialogue systems from web content. 193-196
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FiliskoS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FiliskoS03
Edward Filisko, Stephanie Seneff:
A context resolution server for the galaxy conversational systems. 197-200
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HardyBBDRS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HardyBBDRS03
Hilda Hardy, Kirk Baker, Hélène Bonneau-Maynard, Laurence Devillers, Sophie Rosset, Tomek Strzalkowski:
Semantic and dialogic annotation for automated multilingual customer service. 201-204
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NicholsonBAFKSMLC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NicholsonBAFKSMLC03
H. B. M. Nicholson, Ellen Gurman Bard, Anne H. Anderson, María L. Flecha-García, David Kenicer, Lucy Smallwood, Jim Mullin, Robin J. Lickley, Yiya Chen:
Disfluency under feedback and time-pressure. 205-208
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeemanYS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeemanYS03
Peter A. Heeman, Fan Yang, Susan E. Strayer:
Control in task-oriented dialogues. 209-212
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/McTaitA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/McTaitA03
Kevin McTait, Martine Adda-Decker:
The 300k LIMSI German broadcast news transcription system. 213-216
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TianSH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TianSH03
Jilei Tian, Janne Suontausta, Juha Häkkinen:
Weighted entropy training for the decision tree based text-to-phoneme mapping. 217-220
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OgawaYSK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OgawaYSK03
Yoshihiko Ogawa, Hirofumi Yamamoto, Yoshinori Sagisaka, Gen-ichiro Kikui:
Word class modeling for speech recognition with out-of-task words using a hierarchical language model. 221-224
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OrdelmanHJ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OrdelmanHJ03
Roeland Ordelman, Arjan van Hessen, Franciska de Jong:
Compound decomposition in dutch large vocabulary speech recognition. 225-228
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SavovaB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SavovaB03
Guergana K. Savova, Joan Bachenko:
Designing for errors: similarities and differences of disfluency rates and prosodic characteristics across domains. 229-232
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Wester03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Wester03
Mirjam Wester:
Syllable classification using articulatory-acoustic features. 233-236
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZitouniSL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZitouniSL03
Imed Zitouni, Olivier Siohan, Chin-Hui Lee:
Hierarchical class n-gram language models: towards better estimation of unseen events in speech recognition. 237-240
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BarrachinaV03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BarrachinaV03
Sergio Barrachina, Juan Miguel Vilar:
Incremental and iterative monolingual clustering algorithms. 241-244
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VenkataramanW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VenkataramanW03
Anand Venkataraman, Wen Wang:
Techniques for effective vocabulary selection. 245-248
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Galescu03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Galescu03
Lucian Galescu:
Recognition of out-of-vocabulary words with sub-lexical language models. 249-252
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Bonneau-MaynardR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Bonneau-MaynardR03
Hélène Bonneau-Maynard, Sophie Rosset:
A semantic representation for spoken dialogs. 253-256
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Adda-Decker03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Adda-Decker03
Martine Adda-Decker:
A corpus-based decompounding algorithm for German lexical modeling in LVCSR. 257-260
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeC03
Kyong-Nim Lee, Minhwa Chung:
Modeling cross-morpheme pronunciation variations for korean large vocabulary continuous speech recognition. 261-264

Speech Synthesis: Unit Selection 1, 2

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouZ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouZ03
Yi Zhou, Yiqing Zu:
Unit selection based on voice recognition. 265-268
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuCDGL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuCDGL03
Jun Xu, Thomas Choy, Minghui Dong, Cuntai Guan, Haizhou Li:
On unit analysis for Cantonese corpus-based TTS. 269-272
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LambertBECM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LambertBECM03
Tanya Lambert, Andrew P. Breen, Barry Eggleton, Stephen J. Cox, Ben P. Milner:
Unit selection in concatenative TTS synthesis systems based on mel filter bank amplitudes and phonetic context. 273-276
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BozkurtOD03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BozkurtOD03
Baris Bozkurt, Özlem Öztürk, Thierry Dutoit:
Text design for TTS speech corpus building using a modified greedy selection. 277-280
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParkKK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParkKK03
Seung Seop Park, Chong Kyu Kim, Nam Soo Kim:
Discriminative weight training for unit-selection based speech synthesis. 281-284
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RuttenF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RuttenF03
Peter Rutten, Justin Fackrell:
The application of interactive speech unit selection in TTS systems. 285-288
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DiazB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DiazB03
Francisco Campillo Díaz, Eduardo Rodríguez Banga:
On the design of cost functions for unit-selection speech synthesis. 289-292
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VepaK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VepaK03
Jithendra Vepa, Simon King:
Kalman-filter based join cost for unit-selection speech synthesis. 293-296
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TodaKT03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TodaKT03
Tomoki Toda, Hisashi Kawai, Minoru Tsuzaki:
Optimizing integrated cost function for segment selection in concatenative speech synthesis based on perceptual evaluations. 297-300
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MatousekTP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MatousekTP03
Jindrich Matousek, Daniel Tihelka, Josef Psutka:
Automatic segmentation for czech concatenative speech synthesis using statistical approach with boundary-specific correction. 301-304
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KuoKCC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KuoKCC03
Chih-Chung Kuo, Chi-Shiang Kuo, Jau-Hung Chen, Sen-Chia Chang:
Automatic speech segmentation and verification for concatenative synthesis. 305-308
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PauloO03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PauloO03
Sérgio Paulo, Luís C. Oliveira:
DTW-based phonetic alignment using multiple acoustic features. 309-312
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KominekBB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KominekBB03
John Kominek, Christina L. Bennett, Alan W. Black:
Evaluating and correcting phoneme segmentation for unit selection synthesis. 313-316
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KlabbersS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KlabbersS03
Esther Klabbers, Jan P. H. van Santen:
Control and prediction of the impact of pitch modification on synthetic speech quality. 317-320
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AylettFR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AylettFR03
Matthew P. Aylett, Justin Fackrell, Peter Rutten:
My voice, your prosody: sharing a speaker specific prosody model across speakers in unit selection TTS. 321-324
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TesprasitCS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TesprasitCS03
Virongrong Tesprasit, Paisarn Charoenpornsawat, Virach Sornlertlamvanich:
Learning phrase break detection in Thai text-to-speech. 325-328
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KainS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KainS03
Alexander Kain, Jan P. H. van Santen:
A speech model of acoustic inventories based on asynchronous interpolation. 329-332
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HiroseOM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HiroseOM03
Keikichi Hirose, Takayuki Ono, Nobuaki Minematsu:
Corpus-based synthesis of fundamental frequency contours of Japanese using automatically-generated prosodic corpus and generation process model. 333-336
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KishoreB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KishoreB03
S. Prahallad Kishore, Alan W. Black:
Unit size in unit selection speech synthesis. 1317-1320
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchweitzerBKMS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchweitzerBKMS03
Antje Schweitzer, Norbert Braunschweiler, Tanja Klankert, Bernd Möbius, Bettina Säuberlich:
Restricted unlimited domain synthesis. 1321-1324
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FrancoisB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FrancoisB03
Hélène François, Olivier Boëffard:
Evaluation of units selection criteria in corpus-based speech synthesis. 1325-1328
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PucherNRNG03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PucherNRNG03
Michael Pucher, Friedrich Neubarth, Erhard Rank, Georg Niklfeld, Qi Guan:
Combining non-uniform unit selection with diphone based synthesis. 1329-1332
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AliasL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AliasL03
Francesc Alías, Xavier Llorà:
Evolutionary weight tuning based on diphone pairs for unit selection speech synthesis. 1333-1336
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AndersenH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AndersenH03
Ove Andersen, Charles Hoequist:
Keeping rare events rare. 1337-1340

Aurora Noise Robustness on LARGE Vocabulary Databases

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PariharP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PariharP03
Naveen Parihar, Joseph Picone:
Analysis of the Aurora large vocabulary evaluations. 337-340
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HilgerN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HilgerN03
Florian Hilger, Hermann Ney:
Evaluation of quantile based histogram equalization with filter combination on the Aurora 3 and 4 databases. 341-344
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RigazioNKJ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RigazioNKJ03
Luca Rigazio, Patrick Nguyen, David Kryze, Jean-Claude Junqua:
Large vocabulary noise robustness on Aurora4. 345-348
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StoutenHDW03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StoutenHDW03a
Veronique Stouten, Hugo Van hamme, Jacques Duchateau, Patrick Wambacq:
Evaluation of model-based feature enhancement on the AURORA-4 task. 349-352
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SeguraRBTR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SeguraRBTR03
José C. Segura, Javier Ramírez, M. Carmen Benítez, Ángel de la Torre, Antonio J. Rubio:
Improved feature extraction based on spectral noise reduction and nonlinear feature normalization. 353-356
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimKLK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimKLK03
Young Joon Kim, Hyun Woo Kim, Woohyung Lim, Nam Soo Kim:
Feature compensation technique for robust speech recognition in noisy environments. 357-360

Multilingual Speech-to-Speech Translation

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Ney03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Ney03
Hermann Ney:
The statistical approach to machine translation and a roadmap for speech translation. 361-364
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Gao03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Gao03
Yuqing Gao:
Coupling vs. unifying: modeling techniques for speech-to-speech translation. 365-368
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WaibelBBFGLLLTRSWWZ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WaibelBBFGLLLTRSWWZ03
Alex Waibel, Ahmed Badran, Alan W. Black, Robert E. Frederking, Donna Gates, Alon Lavie, Lori S. Levin, Kevin A. Lenzo, Laura Mayfield Tomokiyo, Jürgen Reichert, Tanja Schultz, Dorcas Wallace, Monika Woszczyna, Jing Zhang:
Speechalator: two-way speech-to-speech translation on a consumer PDA. 369-372
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FrancoZPCAVVBRS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FrancoZPCAVVBRS03
Horacio Franco, Jing Zheng, Kristin Precoda, Federico Cesari, Victor Abrash, Dimitra Vergyri, Anand Venkataraman, Harry Bratt, Colleen Richey, Ace Sarich:
Development of phrase translation systems for handheld computers: from concept to field. 373-376
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Federico03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Federico03
Marcello Federico:
Evaluation frameworks for speech translation technologies. 377-380
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KikuiSTY03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KikuiSTY03
Gen-ichiro Kikui, Eiichiro Sumita, Toshiyuki Takezawa, Seiichi Yamamoto:
Creating corpora for speech-to-speech translation. 381-384

Prosody

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MinematsuMH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MinematsuMH03
Nobuaki Minematsu, Bungo Matsuoka, Keikichi Hirose:
Prosodic analysis and modeling of the NAGAUTA singing to synthesize its prosodic patterns from the standard notation. 385-388
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GharavianA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GharavianA03
Davood Gharavian, Seyed Mohammad Ahadi:
Statistical evaluation of the influence of stress on pitch frequency and phoneme durations in farsi language. 389-392
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenBHC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenBHC03
Ken Chen, Sarah Borys, Mark Hasegawa-Johnson, Jennifer Cole:
Prosody dependent speech recognition with explicit duration modelling at intonational phrase boundaries. 393-396
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TeixeiraFF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TeixeiraFF03
João Paulo Ramos Teixeira, Diamantino Freitas, Hiroya Fujisaki:
Prediction of fujisaki model's phrase commands. 397-400
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MutoSNMKS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MutoSNMKS03
Makiko Muto, Yoshinori Sagisaka, Takuro Naito, Daiju Maeki, Aki Kondo, Katsuhiko Shirai:
Corpus-based modeling of naturalness estimation in timing control for non-native speech. 401-404
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IshiMC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IshiMC03
Carlos Toshinori Ishi, Parham Mokhtari, Nick Campbell:
Perceptually-related acoustic-prosodic features of phrase finals in spontaneous speech. 405-408

Language Modeling

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LangloisSH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LangloisSH03
David Langlois, Kamel Smaïli, Jean Paul Haton:
Efficient linear combination for distant n-gram models. 409-412
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Emami03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Emami03
Ahmad Emami:
Improving a connectionist based syntactical language model. 413-416
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NakanoH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NakanoH03
Mikio Nakano, Timothy J. Hazen:
Using untranscribed user utterances for improving language models based on confidence scoring. 417-420
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChangLL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChangLL03
Pi-Chuan Chang, Shuo-Peng Liao, Lin-Shan Lee:
Improved Chinese broadcast news transcription by language modeling with temporally consistent training corpora and iterative phrase extraction. 421-424
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoriNI03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoriNI03
Shinsuke Mori, Masafumi Nishimura, Nobuyasu Itoh:
Language model adaptation using word clustering. 425-428
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LaneKMN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LaneKMN03
Ian R. Lane, Tatsuya Kawahara, Tomoko Matsui, Satoshi Nakamura:
Hierarchical topic classification for dialog speech recognition based on language model switching. 429-432

Speech Modeling and Features 1-4

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AlkuB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AlkuB03
Paavo Alku, Tom Bäckström:
Linear predictive method with low-frequency emphasis. 433-436
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JainH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JainH03
Pratibha Jain, Hynek Hermansky:
Beyond a single critical-band in TRAP based ASR. 437-440
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ValenteW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ValenteW03
Fabio Valente, Christian Wellekens:
Variational Bayesian GMM for speech recognition. 441-444
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WadaS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WadaS03
Yamato Wada, Masahide Sugiyama:
Time alignment for scenario and sounds with voice, music and BGM. 445-448
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NguyenA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NguyenA03
Phu Chien Nguyen, Masato Akagi:
Efficient quantization of speech excitation parameters using temporal decomposition. 449-452
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KommerH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KommerH03
Robert van Kommer, Béat Hirsbrunner:
Distributed genetic algorithm to discover a wavelet packet best basis for speech recognition. 453-456
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangLW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangLW03
Chao-Shih Huang, Chin-Hui Lee, Hsiao-Chuan Wang:
New model-based HMM distances with applications to run-time ASR error estimation and model tuning. 457-460
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KaburagiK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KaburagiK03
Tokihiko Kaburagi, Koji Kawai:
Analysis of voice source characteristics using a constrained polynomial model. 461-464
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NiK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NiK03
Jinfu Ni, Hisashi Kawai:
Tone pattern discrimination combining parametric modeling and maximum likelihood estimation. 465-468
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WrigleyBWR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WrigleyBWR03
Stuart N. Wrigley, Guy J. Brown, Vincent Wan, Steve Renals:
Feature selection for the classification of crosstalk in multi-channel audio. 469-472
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Liu03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Liu03
Jingwei Liu:
A DTW-based DAG technique for speech and speaker feature analysis. 473-476
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SomervuoCZ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SomervuoCZ03
Panu Somervuo, Barry Y. Chen, Qifeng Zhu:
Feature transformations and combinations for improving ASR performance. 477-480
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Tseng03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Tseng03a
Chiu-yu Tseng:
On the role of intonation in the organization of Mandarin Chinese speech prosody. 481-484
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OhkawaYSIM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OhkawaYSIM03
Yuichi Ohkawa, Akihiro Yoshida, Motoyuki Suzuki, Akinori Ito, Shozo Makino:
An optimized multi-duration HMM for spontaneous speech recognition. 485-488
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimBMS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimBMS03
Hyoung-Gook Kim, Edgar Berdahl, Nicolas Moreau, Thomas Sikora:
Speaker recognition using MPEG-7 descriptors. 489-492
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MachereyN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MachereyN03
Wolfgang Macherey, Hermann Ney:
A comparative study on maximum entropy and discriminative training for acoustic modeling in automatic speech recognition. 493-496
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZolnaySN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZolnaySN03
András Zolnay, Ralf Schlüter, Hermann Ney:
Extraction methods of voicing feature for robust speech recognition. 497-500
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ArmaniMOS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ArmaniMOS03
Luca Armani, Marco Matassoni, Maurizio Omologo, Piergiorgio Svaizer:
Use of a CSP-based voice activity detector for distant-talking ASR. 501-504
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OmarH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OmarH03
Mohamed Kamal Omar, Mark Hasegawa-Johnson:
Maximum conditional mutual information projection for speech recognition. 505-508
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GibbonGHLTT03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GibbonGHLTT03
Dafydd Gibbon, Ulrike Gut, Benjamin Hell, Karin Looks, Alexandra Thies, Thorsten Trippel:
A computational model of arm gestures in conversation. 813-816
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PitsikalisKM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PitsikalisKM03
Vassilis Pitsikalis, Iasonas Kokkinos, Petros Maragos:
Nonlinear analysis of speech signals: generalized dimensions and lyapunov exponents. 817-820
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MotlicekC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MotlicekC03
Petr Motlícek, Jan Cernocký:
Time-domain based temporal processing with application of orthogonal transformations. 821-824
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchwarzMC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchwarzMC03
Petr Schwarz, Pavel Matejka, Jan Cernocký:
Recognition of phoneme strings using TRAP technique. 825-828
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FegyoMT03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FegyoMT03
Tibor Fegyó, Péter Mihajlik, Péter Tatai:
Comparative study on hungarian acoustic model sets and training methods. 829-832
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CheveigneB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CheveigneB03
Alain de Cheveigné, Alexis Baskind:
F_0 estimation of one or several voices. 833-836
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SivadasH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SivadasH03
Sunil Sivadas, Hynek Hermansky:
In search of target class definition in tandem feature extraction. 837-840
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AdamiH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AdamiH03
André Gustavo Adami, Hynek Hermansky:
Segmentation of speech for speaker and language recognition. 841-844
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiS03
Xiang Li, Richard M. Stern:
Feature generation based on maximum classification probability for improved speech recognition. 845-848
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YaoPL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YaoPL03
Kaisheng Yao, Kuldip K. Paliwal, Te-Won Lee:
Speech recognition with a generative factor analyzed hidden Markov model. 849-852
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenCS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenCS03
Barry Y. Chen, Shuangyu Chang, Sunil Sivadas:
Learning discriminative temporal patterns in speech: development of novel TRAPS-like classifiers. 853-856
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ScanlonER03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ScanlonER03
Patricia Scanlon, Daniel P. W. Ellis, Richard B. Reilly:
Using mutual information to design class-specific phone recognizers. 857-860
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DuxansB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DuxansB03
Helenca Duxans, Antonio Bonafonte:
Estimation of GMM in voice conversion including unaligned data. 861-864
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TokudaZK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TokudaZK03
Keiichi Tokuda, Heiga Zen, Tadashi Kitamura:
Trajectory modeling based on HMMs with the explicit relationship between static and dynamic features. 865-868
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BauereckerNP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BauereckerNP03
Hermann Bauerecker, Climent Nadeu, Jaume Padrell:
On the advantage of frequency-filtering features for speech recognition with variable sampling frequencies. experiments with speechdatcar databases. 869-872
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MixdorffFCH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MixdorffFCH03
Hansjörg Mixdorff, Hiroya Fujisaki, Gao Peng Chen, Yu Hu:
Towards the automatic extraction of fujisaki model parameters for Mandarin. 873-876
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AireyG03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AireyG03
S. S. Airey, Mark J. F. Gales:
Product of Gaussians as a distributed representation for speech recognition. 877-880
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Petrinovic03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Petrinovic03
Davor Petrinovic:
Harmonic weighting for all-pole modeling of the voiced speech. 881-884
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NishizawaHM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NishizawaHM03
Nobuyuki Nishizawa, Keikichi Hirose, Nobuaki Minematsu:
Estimation of resonant characteristics based on AR-HMM modeling and spectral envelope conversion of vowel sounds. 885-888
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HermanskyJ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HermanskyJ03
Hynek Hermansky, Pratibha Jain:
Band-independent speech-event categories for TRAP based ASR. 1013-1016
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GrezlH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GrezlH03
Frantisek Grézl, Hynek Hermansky:
Local averaging and differentiating of spectral plane for TRAP-based ASR. 1017-1020
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WolfelMW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WolfelMW03
Matthias Wölfel, John W. McDonough, Alex Waibel:
Minimum variance distortionless response on a warped frequency scale. 1021-1024
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangO03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangO03
Xuechuan Wang, Douglas D. O'Shaughnessy:
Improving the efficiency of automatic speech recognition by feature transformation and dimensionality reduction. 1025-1028
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StadermannR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StadermannR03
Jan Stadermann, Gerhard Rigoll:
Distributed speech recognition on the WSJ task. 1029-1032
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StukerMSW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StukerMSW03
Sebastian Stüker, Florian Metze, Tanja Schultz, Alex Waibel:
Integrating multilingual articulatory features into speech recognition. 1033-1036
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Petek03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Petek03
Bojan Petek:
Locus equations determination using the speechdat(II). 2301-2304
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EmontsL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EmontsL03
Michael Emonts, Deryle Lonsdale:
A memory-based approach to Cantonese tone recognition. 2305-2308
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ManceboCB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ManceboCB03
David Escudero Mancebo, Valentín Cardeñoso-Payo, Antonio Bonafonte:
Experimental evaluation of the relevance of prosodic features in Spanish using machine learning techniques. 2309-2312
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NakataniIZ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NakataniIZ03
Tomohiro Nakatani, Toshio Irino, Parham Zolfaghari:
Dominance spectrum based v/UV classification and f_0 estimation. 2313-2316
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FujisakiNOF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FujisakiNOF03
Hiroya Fujisaki, Shuichi Narusawa, Sumio Ohno, Diamantino Freitas:
Analysis and modeling of f_0 contours of portuguese utterances based on the command-response model. 2317-2320
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JacksonMRH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JacksonMRH03
Philip J. B. Jackson, David M. Moreno, Martin J. Russell, Javier Hernando:
Covariation and weighting of harmonically decomposed streams for ASR. 2321-2324

Speech Enhancement 1, 2

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeracleousNS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeracleousNS03
Panikos Heracleous, Satoshi Nakamura, Kiyohiro Shikano:
A semi-blind source separation method for hands-free speech recognition of multiple talkers. 509-512
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KrasnyK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KrasnyK03
Leonid G. Krasny, Ali S. Khayrallah:
Influence of the waveguide propagation on the antenna performance in a car cabin. 513-516
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PotamitisTF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PotamitisTF03
Ilyas Potamitis, George Tremoulis, Nikos Fakotakis:
Multi-speaker DOA tracking using interactive multiple models and probabilistic data association. 517-520
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuW03
Ching-Ta Lu, Hsiao-Chuan Wang:
Speech enhancement using weighting function based on the variance of wavelet coefficients. 521-524
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PotamitisF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PotamitisF03
Ilyas Potamitis, Eran Fishler:
Microphone array voice activity detection and noise suppression using wideband generalized likelihood ratio. 525-528
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaricJ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaricJ03
Zoran Saric, Slobodan Jovicic:
Adaptive beamforming in room with reverberation. 529-532
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JuL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JuL03
Gwo-hwa Ju, Lin-Shan Lee:
Perceptually-constrained generalized singular value decomposition-based approach for enhancing speech corrupted by colored noise. 533-536
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamajoSTNS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamajoSTNS03
Hiroaki Yamajo, Hiroshi Saruwatari, Tomoya Takatani, Tsuyoki Nishikawa, Kiyohiro Shikano:
Blind separation and deconvolution for convolutive mixture of speech using SIMO-model-based ICA and multichannel inverse filtering. 537-540
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RazaC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RazaC03
D. G. Raza, C. F. Chan:
Quality enhancement of CELP coded speech by using an MFCC based Gaussian mixture model. 541-544
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimSMS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimSMS03
Hyoung-Gook Kim, Markus Schwab, Nicolas Moreau, Thomas Sikora:
Enhancement of noisy speech for noise robust front-end and speech reconstruction at back-end of DSR system. 545-548
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WeiDYZ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WeiDYZ03
Jianqiang Wei, Limin Du, Zhaoli Yan, Hui Zeng:
Improved kalman filter-based speech enhancement. 549-552
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IrinoPK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IrinoPK03
Toshio Irino, Roy D. Patterson, Hideki Kawahara:
Speech segregation based on fundamental event information using an auditory vocoder. 553-556
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YanDWZ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YanDWZ03
Zhaoli Yan, Limin Du, Jianqiang Wei, Hui Zeng:
Time delay estimation based on hearing characteristic. 557-560
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StolbovKK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StolbovKK03
Mikhail Stolbov, Serguei Koval, Mikhail Khitrov:
Parametric multi-band automatic gain control for noisy speech enhancement. 561-564
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IserS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IserS03
Bernd Iser, Gerhard Schmidt:
Neural networks versus codebooks in an application for bandwidth extension of speech signals. 565-568
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JaferM03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JaferM03a
Essa Jafer, Abdulhussain E. Mahdi:
Wavelet-based perceptual speech enhancement using adaptive threshold estimation. 569-572
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PotamitisFK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PotamitisFK03
Ilyas Potamitis, Nikos Fakotakis, George Kokkinakis:
A trainable speech enhancement technique based on mixture models for speech and noise. 573-576
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FuW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FuW03
Qiang Fu, Eric A. Wan:
Perceptual wavelet adaptive denoising of speech. 577-580
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YegnanarayanaPM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YegnanarayanaPM03
B. Yegnanarayana, S. R. Mahadeva Prasanna, Mathew Magimai-Doss:
Enhancement of speech in multispeaker environment. 581-584
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MizumachiN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MizumachiN03
Mitsunori Mizumachi, Satoshi Nakamura:
Noise reduction using paired-microphones on non-equally-spaced microphone arrangement. 585-588
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HodoshimaAIKK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HodoshimaAIKK03
Nao Hodoshima, Takayuki Arai, Tsuyoshi Inoue, Keisuke Kinoshita, Akiko Kusumoto:
Improving speech intelligibility by steady-state suppression as pre-processing in small to medium sized halls. 1365-1368
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeYCC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeYCC03
Chen-Long Lee, Ya-Ru Yang, Wen-Whei Chang, Yuan-Chuan Chiang:
Enhancement of hearing-impaired Mandarin speech. 1369-1372
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AlvarezNGM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AlvarezNGM03
Agustín Álvarez, Victor Nieto Lluis, Pedro Gómez Vilda, Rafael Martínez:
Speech enhancement for a car environment using LP residual signal and spectral subtraction. 1373-1376
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JuL03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JuL03a
Gwo-hwa Ju, Lin-Shan Lee:
Speech enhancement and improved recognition accuracy by integrating wavelet transform and spectral subtraction algorithm. 1377-1380
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaheG03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaheG03
Gaël Mahé, André Gilloire:
Multi-referenced correction of the voice timbre distortions in telephone networks. 1381-1384
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeLL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeLL03
J. J. Lee, J. H. Lee, K. Y. Lee:
Efficient speech enhancement based on left-right HMM with state sequence detection using LRT. 1385-1388
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GnabaAJS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GnabaAJS03
H. Gnaba, Monia Turki-Hadj Alouane, Meriem Jaïdane-Saïdane, Pascal Scalart:
Introduction of the CELP structure of the GSM coder in the acoustic echo canceller for the GSM network. 1389-1392
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SodoyerGJS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SodoyerGJS03
David Sodoyer, Laurent Girin, Christian Jutten, Jean-Luc Schwartz:
Extracting an AV speech source from a mixture of signals. 1393-1396
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Puder03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Puder03
Henning Puder:
Speech enhancement for hands-free car phones by adaptive compensation of harmonic engine noise components. 1397-1400
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HouJ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HouJ03
Zhaorong Hou, Ying Jia:
Enhance low-frequency suppression of GSC beamforming. 1401-1404
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SrinivasanSK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SrinivasanSK03
Sriram Srinivasan, Jonas Samuelsson, W. Bastiaan Kleijn:
Speech enhancement using a-priori information. 1405-1408
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HogdenVKM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HogdenVKM03
John Hogden, Patrick Valdez, Shigeru Katagiri, Erik McDermott:
Blind inversion of multidimensional functions for speech enhancement. 1409-1412
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AbutalebiSBF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AbutalebiSBF03
Hamid Reza Abutalebi, Hamid Sheikhzadeh, Robert L. Brennan, George H. Freeman:
Convergence improvement for oversampled subband adaptive noise and echo cancellation. 1413-1416
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/UnokiSA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/UnokiSA03
Masashi Unoki, Keigo Sakata, Masato Akagi:
A speech dereverberation method based on the MTF concept. 1417-1420
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimKY03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimKY03a
Sang-Gyun Kim, Jong Uk Kim, Chang D. Yoo:
Accuracy improved double-talk detector based on state transition diagram. 1421-1424
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NatarajanHAR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NatarajanHAR03
Ajay Natarajan, John H. L. Hansen, Kathryn Hoberg Arehart, Jessica Rossi-Katz:
Perceptual based speech enhancement for normal-hearing and hearing-impaired individuals. 1425-1428
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OrtegaLM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OrtegaLM03
Alfonso Ortega, Eduardo Lleida, Enrique Masgrau:
Residual echo power estimation for speech reinforcement systems in vehicles. 1429-1432
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianK03
Yasheng Qian, Peter Kabal:
Dual-mode wideband speech recovery from narrowband speech. 1433-1436
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Al-NaimiSK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Al-NaimiSK03
Khaldoon Al-Naimi, Christian Sturt, Ahmet M. Kondoz:
A robust noise and echo canceller. 1437-1440
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NixKH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NixKH03
Johannes Nix, Michael Kleinschmidt, Volker Hohmann:
Computational auditory scene analysis by using statistics of high-dimensional speech dynamics and sound source direction. 1441-1444

Spoken Dialog Systems 1, 2

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WittW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WittW03
Silke M. Witt, Jason D. Williams:
Two studies of open vs. directed dialog strategies in spoken dialog systems. 589-592
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ONeillHLM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ONeillHLM03
Ian M. O'Neill, Philip Hanna, Xingkun Liu, Michael F. McTear:
The queen's communicator: an object-oriented dialogue manager. 593-596
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BohusR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BohusR03
Dan Bohus, Alexander I. Rudnicky:
Ravenclaw: dialog management using hierarchical task decomposition and an expectation agenda. 597-600
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MachereyN03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MachereyN03a
Klaus Macherey, Hermann Ney:
Features for tree based dialogue course management. 601-604
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TorresSS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TorresSS03
Francisco Torres, Emilio Sanchis, Encarna Segarra:
Development of a stochastic dialog manager driven by semantics. 605-608
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TakeuchiKN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TakeuchiKN03
Masashi Takeuchi, Norihide Kitaoka, Seiichi Nakagawa:
Generation of natural response timing using decision tree based on prosodic and linguistic information. 609-612
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BellG03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BellG03
Linda Bell, Joakim Gustafson:
Child and adult speaker adaptation during error resolution in a publicly available spoken dialogue system. 613-616
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EsteveRBM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EsteveRBM03
Yannick Estève, Christian Raymond, Frédéric Béchet, Renato de Mori:
Conceptual decoding for spoken dialog systems. 617-620
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangL03
Huei-Ming Wang, Yi-Chung Lin:
Sentence verification in spoken dialogue system. 621-624
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KitaokaKN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KitaokaKN03
Norihide Kitaoka, Naoko Kakutani, Seiichi Nakagawa:
Detection and recognition of correction utterance in spontaneously spoken dialog. 625-628
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EkanadhamH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EkanadhamH03
Chaitanya Ekanadham, Juan M. Huerta:
Topic-specific parser design in an air travel natural language understanding application. 629-632
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CoxC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CoxC03
Stephen J. Cox, Gavin C. Cawley:
The use of confidence measures in vector based call-routing. 633-636
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BechetRH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BechetRH03
Frédéric Béchet, Giuseppe Riccardi, Dilek Z. Hakkani-Tür:
Multi-channel sentence classification for spoken dialogue language modeling. 637-640
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SeneffWH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SeneffWH03
Stephanie Seneff, Chao Wang, Timothy J. Hazen:
Automatic induction of n-gram language models from a natural language grammar. 641-644
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VilarCS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VilarCS03
David Vilar, María José Castro, Emilio Sanchis:
Connectionist classification and specific stochastic models in the understanding process of a dialogue system. 645-648
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BoyeW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BoyeW03
Johan Boye, Mats Wirén:
Robust parsing of utterances in negotiative dialogue. 649-652
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuY03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuY03
Chung-Hsien Wu, Gwo-Lang Yan:
Flexible speech act identification of spontaneous speech with disfluency. 653-656
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DohsakaYA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DohsakaYA03
Kohji Dohsaka, Norihito Yasuda, Kiyoaki Aikawa:
Efficient spoken dialogue control depending on the speech recognition rate and system's database. 657-660
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TakahashiMMT03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TakahashiMMT03
Shinya Takahashi, Tsuyoshi Morimoto, Sakashi Maeda, Naoyuki Tsuruta:
Robust speech understanding based on expected discourse plan. 661-664
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IsobeHMMTI03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IsobeHMMTI03
Toshihiro Isobe, Shoji Hayakawa, Hiroya Murao, Tatsuji Mizutani, Kazuya Takeda, Fumitada Itakura:
A study on domain recognition of spoken dialogue systems. 1889-1892
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeLY03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeLY03
Wei He, Honglian Li, Baozong Yuan:
Domain adaptation augmented by state-dependence in spoken dialog systems. 1893-1896
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PorteleGEKTV03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PorteleGEKTV03
Thomas Portele, Silke Goronzy, Martin C. Emele, Andreas Kellner, Sunna Torge, Jürgen te Vrugt:
Smartkom-home - an advanced multi-modal interface to home entertainment. 1897-1900
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuDAN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuDAN03
Yunbiao Xu, Fengying Di, Masahiro Araki, Yasuhisa Niimi:
Methods to improve its portability of a spoken dialog system both on task domains and languages. 1901-1904
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FegyoMSTT03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FegyoMSTT03
Tibor Fegyó, Péter Mihajlik, Máté Szarvas, Péter Tatai, Gábor Tatai:
Voxenter^TM - intelligent voice enabled call center for hungarian. 1905-1908
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangC03
Qiang Huang, Stephen J. Cox:
Automatic call-routing without transcriptions. 1909-1912
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TurunenH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TurunenH03
Markku Turunen, Jaakko Hakulinen:
Jaspis^2 - an architecture for supporting distributed spoken dialogues. 1913-1916
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZibertMHIM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZibertMHIM03
Janez Zibert, Sanda Martincic-Ipsic, Melita Hajdinjak, Ivo Ipsic, France Mihelic:
Development of a bilingual spoken dialog system for weather information retrieval. 1917-1920
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AllenADF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AllenADF03
James Allen, David Attwater, Peter J. Durston, Mark Farrell:
Improving "how may i help you?" systems using the output of recognition lattices. 1921-1924
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AndornoFLNPRV03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AndornoFLNPRV03
Marco Andorno, Luciano Fissore, Pietro Laface, Mario Nigra, Cosmin Popovici, Franco Ravera, Claudio Vair:
Incremental learning of new user formulations in automatic directory assistance. 1925-1928
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BacaZGP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BacaZGP03
Julie Baca, Feng Zheng, Hualin Gao, Joseph Picone:
Dialog systems for automotive environments. 1929-1932
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NetoMCO03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NetoMCO03
João Paulo Neto, Nuno J. Mamede, Renato Cassaca, Luís C. Oliveira:
The development of a multi-purpose spoken dialogue system. 1933-1936
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GoronzyVES03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GoronzyVES03
Silke Goronzy, Zica Valsan, Martin C. Emele, Juergen Schimanowski:
The dynamic, multi-lingual lexicon in smartkom. 1937-1940
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HigashinakaMNA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HigashinakaMNA03
Ryuichiro Higashinaka, Noboru Miyazaki, Mikio Nakano, Kiyoaki Aikawa:
Evaluating discourse understanding in spoken dialogue systems. 1941-1944
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Larsen03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Larsen03
Lars Bo Larsen:
Assessment of spoken dialogue system usability - what are we really measuring? 1945-1948
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SmeeleW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SmeeleW03
Paula M. T. Smeele, Juliette A. J. S. Waals:
Evaluation of a speech-driven telephone information service using the PARADISE framework: a closer look at subjective measures. 1949-1952
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MollerS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MollerS03
Sebastian Möller, Janto Skowronek:
Quantifying the impact of system characteristics on perceived quality dimensions of a spoken dialogue service. 1953-1956
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RamaswamyZA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RamaswamyZA03
Ganesh N. Ramaswamy, Ran D. Zilca, Oleg Alecksandrovich:
A programmable policy manager for conversational biometrics. 1957-1960
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HazenJPKR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HazenJPKR03
Timothy J. Hazen, Douglas A. Jones, Alex Park, Linda C. Kukolich, Douglas A. Reynolds:
Integration of speaker recognition into conversational spoken dialogue systems. 1961-1964

Robust Speech Recognition - Noise Compensation

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ObuchiS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ObuchiS03
Yasunari Obuchi, Richard M. Stern:
Normalization of time-derivative parameters using histogram equalization. 665-668
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangOF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangOF03
Zhipeng Zhang, Kiyotaka Otsuji, Sadaoki Furui:
Tree-structured noise-adapted HMM modeling for piecewise linear-transformation-based adaptation. 669-672
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuNPW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuNPW03
Donglai Zhu, Satoshi Nakamura, Kuldip K. Paliwal, Ren-Hua Wang:
Maximum likelihood sub-band weighting for robust speech recognition. 673-676
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimAK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimAK03
Wooil Kim, Sungjoo Ahn, Hanseok Ko:
Feature compensation scheme based on parallel combined mixture model. 677-680
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DroppoDA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DroppoDA03
Jasha Droppo, Li Deng, Alex Acero:
A comparison of three non-linear observation models for noisy speech features. 681-684
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DaoudiD03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DaoudiD03
Khalid Daoudi, Murat Deviren:
A new supervised-predictive compensation scheme for noisy speech recognition. 685-688

Forensic Speaker Recognition

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DrygajloMA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DrygajloMA03
Andrzej Drygajlo, Didier Meuwly, Anil Alexander:
Statistical methods and Bayesian interpretation of evidence in forensic automatic speaker recognition. 689-692
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Gonzalez-RodriguezGGRO03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Gonzalez-RodriguezGGRO03
Joaquin Gonzalez-Rodriguez, Daniel Garcia-Romero, Marta Garcia-Gomar, Daniel Ramos, Javier Ortega-Garcia:
Robust likelihood ratio estimation in Bayesian forensic speaker recognition. 693-696
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Nakasone03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Nakasone03
Hirotaka Nakasone:
Automated speaker recognition in real world conditions: controlling the uncontrollable. 697-700
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PfisterB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PfisterB03
Beat Pfister, René Beutler:
Estimating the weight of evidence in forensic speaker verification. 701-704
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Gfrorer03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Gfrorer03
Stefan G. Gfrörer:
Auditory-instrumental forensic speaker recognition. 705-708
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KerstholtJAB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KerstholtJAB03
Jose H. Kerstholt, E. J. M. Jansen, A. G. van Amelsvoort, A. P. A. Broeders:
Earwitness line-ups: effects of speech duration, retention interval and acoustic environment on identification accuracy. 709-712

Emotion in Speech

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AmirZC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AmirZC03
Noam Amir, Shirley Ziv, Rachel Cohen:
Characteristics of authentic anger in hebrew speech. 713-716
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SeppanenVT03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SeppanenVT03
Tapio Seppänen, Eero Väyrynen, Juhani Toivanen:
Prosody-based classification of emotions in spoken finnish. 717-720
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RahurkarH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RahurkarH03
Mandar A. Rahurkar, John H. L. Hansen:
Frequency distribution based weighted sub-band approach for classification of emotional/stressful content in speech. 721-724
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiscombeVH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiscombeVH03
Jackson Liscombe, Jennifer J. Venditti, Julia Hirschberg:
Classifying subject ratings of emotional speech using acoustic features. 725-728
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YacoubSLB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YacoubSLB03
Sherif M. Yacoub, Steven J. Simske, Xiaofan Lin, John Burns:
Recognition of emotions in interactive voice response systems. 729-732
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BatlinerZFASN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BatlinerZFASN03
Anton Batliner, Viktor Zeißler, Carmen Frank, Johann Adelhardt, Rui Ping Shi, Elmar Nöth:
We are not amused - but how do you know? user states in a multi-modal dialogue system. 733-736

Dialog System User and Domain Modeling

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Bernsen03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Bernsen03
Niels Ole Bernsen:
On-line user modelling in a mobile spoken dialogue system. 737-740
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Pakucs03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Pakucs03
Botond Pakucs:
Towards dynamic multi-domain dialogue processing. 741-744
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KomataniUKO03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KomataniUKO03
Kazunori Komatani, Shinichi Ueno, Tatsuya Kawahara, Hiroshi G. Okuno:
User modeling in spoken dialogue systems for flexible guidance generation. 745-748
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SeneffCW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SeneffCW03
Stephanie Seneff, Grace Chung, Chao Wang:
Empowering end users to personalize dialogue systems through spoken interaction. 749-752
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RauxLBE03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RauxLBE03
Antoine Raux, Brian Langner, Alan W. Black, Maxine Eskénazi:
LET's GO: improving spoken dialog systems for the elderly and non-natives. 753-756
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HakulinenTS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HakulinenTS03
Jaakko Hakulinen, Markku Turunen, Esa-Pekka Salonen:
Agents for integrated tutoring in spoken dialogue systems. 757-760

Topics in Speech Recognition and Segmentation

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimK03
Taeyoon Kim, Hanseok Ko:
Utterance verification under distributed detection and fusion framework. 889-892
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoM03
Simon Ka-Lung Ho, Brian Mak:
Joint estimation of thresholds in a bi-threshold verification problem. 893-896
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NeftiBM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NeftiBM03
Samir Nefti, Olivier Boëffard, Thierry Moudenc:
Confidence measures for phonetic segmentation of continuous speech. 897-900
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WiggersR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WiggersR03
Pascal Wiggers, Léon J. M. Rothkrantz:
Using confidence measures and domain knowledge to improve speech recognition. 901-904
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ThambiratnamS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ThambiratnamS03
Kishan Thambiratnam, Sridha Sridharan:
Isolated word verification using cohort word-level verification. 905-908
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AuS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AuS03
Wing-Hei Au, Man-Hung Siu:
A new approach to minimize utterance verification error rate for a specific operating point. 909-912
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YanGZ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YanGZ03
Binfeng Yan, Rui Guo, Xiaoyan Zhu:
Continuous speech recognition and verification based on a combination score. 913-916
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FabianLRT03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FabianLRT03
Tibor Fábián, Robert Lieb, Günther Ruske, Matthias Thomae:
Impact of word graph density on the quality of posterior probability based confidence measures. 917-920
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeracleousS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeracleousS03
Panikos Heracleous, Tohru Shimizu:
An efficient keyword spotting technique using a complementary language for filler models training. 921-924
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LevitAGN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LevitAGN03
Michael Levit, Hiyan Alshawi, Allen L. Gorin, Elmar Nöth:
Context-sensitive evaluation and correction of phone recognition output. 925-928
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DengMA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DengMA03
Yonggang Deng, Milind Mahajan, Alex Acero:
Estimating speech recognition error rate without acoustic test data. 929-932
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BisaniN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BisaniN03
Maximilian Bisani, Hermann Ney:
Multigram-based grapheme-to-phoneme conversion for LVCSR. 933-936
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BeutlerP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BeutlerP03
René Beutler, Beat Pfister:
Integrating statistical and rule-based knowledge for continuous German speech recognition. 937-940
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VandecatseyeM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VandecatseyeM03
An Vandecatseye, Jean-Pierre Martens:
A fast, accurate and stream-based speaker segmentation and clustering algorithm. 941-944
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChengW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChengW03
Shih-Sian Cheng, Hsin-Min Wang:
A sequential metric-based audio segmentation method via the Bayesian information criterion. 945-948
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SrivastavaK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SrivastavaK03
Amit Srivastava, Francis Kubala:
Sentence boundary detection in arabic speech. 949-952
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FranzRWP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FranzRWP03
Martin Franz, Bhuvana Ramabhadran, Todd Ward, Michael Picheny:
Automated transcription and topic segmentation of large spoken archives. 953-956
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuSS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuSS03
Yang Liu, Elizabeth Shriberg, Andreas Stolcke:
Automatic disfluency identification in conversational speech using multiple knowledge sources. 957-960
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamamotoOA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamamotoOA03
Natsuo Yamamoto, Jun Ogata, Yasuo Ariki:
Topic segmentation and retrieval system for lecture videos based on spontaneous speech recognition. 961-964

Robust Speech Recognition - Acoustic Modeling

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MarkovDIN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MarkovDIN03
Konstantin Markov, Jianwu Dang, Yosuke Iizuka, Satoshi Nakamura:
Hybrid HMM/BN ASR system integrating spectrum and articulatory features. 965-968
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StemmerZHNN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StemmerZHNN03
Georg Stemmer, Viktor Zeißler, Christian Hacker, Elmar Nöth, Heinrich Niemann:
Context-dependent output densities for hidden Markov models in speech recognition. 969-972
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShinozakiF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShinozakiF03
Takahiro Shinozaki, Sadaoki Furui:
Time adjustable mixture weights for speaking rate fluctuation. 973-976
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuH03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuH03a
Jian Wu, Qiang Huo:
A switching linear Gaussian hidden Markov model and its application to nonstationary noise compensation for robust speech recognition. 977-980
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TyagiMBM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TyagiMBM03
Vivek Tyagi, Iain McCowan, Hervé Bourlard, Hemant Misra:
On factorizing spectral dynamics for robust speech recognition. 981-984
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiaDX03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiaDX03
Chuan Jia, Peng Ding, Bo Xu:
Joint model and feature based compensation for robust speech recognition under non-stationary noise environments. 985-988

Advanced Machine Learning Algorithms for Speech and Language Processing

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CortesHM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CortesHM03
Corinna Cortes, Patrick Haffner, Mehryar Mohri:
Weighted automata kernels - general framework and algorithms. 989-992
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AltunH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AltunH03
Yasemin Altun, Thomas Hofmann:
Large margin methods for label sequence learning. 993-996
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Ratsch03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Ratsch03
Gunnar Rätsch:
Robust multi-class boosting. 997-1000
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaulSL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaulSL03
Lawrence K. Saul, Fei Sha, Daniel D. Lee:
Statistical signal processing with nonnegativity constraints. 1001-1004
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GargW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GargW03
Ashutosh Garg, Manfred K. Warmuth:
Inline updates for HMMs. 1005-1008
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Roweis03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Roweis03
Sam T. Roweis:
Factorial models and refiltering for speech separation and denoising. 1009-1012

Multi-Modal Spoken Language Processing

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KleinT03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KleinT03
Alexandra Klein, Harald Trost:
Using corpus-based methods for spoken access to news texts on the web. 1037-1040
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BrungartSK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BrungartSK03
Douglas Brungart, Brian D. Simpson, Alexander J. Kordik:
Cross-modal informational masking due to mismatched audio cues in a speechreading task. 1041-1044
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Berthommier03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Berthommier03
Frédéric Berthommier:
Audiovisual speech enhancement based on the association between speech envelope and video features. 1045-1048
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WasingerSK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WasingerSK03
Rainer Wasinger, Christoph Stahl, Antonio Krüger:
Robust speech interaction in a mobile environment through the use of multiple and different media input types. 1049-1052
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WoltjerTC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WoltjerTC03
Rogier Woltjer, Wah Jin Tan, Fang Chen:
Speech-based, manual-visual, and multi-modal interaction with an in-car computer - evaluation of a pilot study. 1053-1056
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ProdanovD03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ProdanovD03
Plamen J. Prodanov, Andrzej Drygajlo:
Bayesian networks for spoken dialogue management in multimodal systems of tour-guide robots. 1057-1060

Speech Coding and Transmission

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChuM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChuM03
Wai C. Chu, Toshio Miki:
Optimization of window and LSF interpolation factor for the ITU-t g.729 speech coding standard. 1061-1064
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChangSK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChangSK03
Joon-Hyuk Chang, Jong Won Shin, Nam Soo Kim:
Likelihood ratio test with complex laplacian model for voice activity detection. 1065-1068
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Nurminen03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Nurminen03
Jani Nurminen:
Multi-mode quantization of adjacent speech parameters using a low-complexity prediction scheme. 1069-1072
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SinervoNHS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SinervoNHS03
Ulpu Sinervo, Jani Nurminen, Ari Heikkinen, Jukka Saarinen:
Multi-mode matrix quantizer for low bit rate LSF quantization. 1073-1076
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MertzTVV03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MertzTVV03
Frank Mertz, Hervé Taddei, Imre Varga, Peter Vary:
Voicing controlled frame loss concealment for adaptive multi-rate (AMR) speech frames in voice-over-IP. 1077-1080
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LahdekorpiNHS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LahdekorpiNHS03
Marja Lahdekorpi, Jani Nurminen, Ari Heikkinen, Jukka Saarinen:
Perceptual irrelevancy removal in narrowband speech coding. 1081-1084
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JeuCC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JeuCC03
Charles du Jeu, Maurice Charbit, Gérard Chollet:
Very-low-rate speech compression by indexation of polyphones. 1085-1088
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SanchezPGP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SanchezPGP03
Victoria E. Sánchez, Antonio M. Peinado, Angel M. Gomez, José L. Pérez-Córdoba:
Entropy-optimized channel error mitigation with application to speech recognition over wireless. 1089-1092
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KrishnanA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KrishnanA03
Venkatesh Krishnan, David V. Anderson:
Robust jointly optimized multistage vector quantization for speech coding. 1093-1096
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PoblothVK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PoblothVK03
Harald Pobloth, Renat Vafin, W. Bastiaan Kleijn:
Polar quantization of sinusoids from speech signal blocks. 1097-1100
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YoonCKY03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YoonCKY03
Sung-Wan Yoon, Jin-Kyu Choi, Hong-Goo Kang, Dae Hee Youn:
Transcoding algorithm for g.723.1 and AMR speech coders: for interoperability between voIP and mobile networks. 1101-1104
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PetrinovicP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PetrinovicP03
Davorka Petrinovic, Davor Petrinovic:
Quality-complexity trade-off in predictive LSF quantization. 1105-1108
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KikuiriNO03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KikuiriNO03
Kei Kikuiri, Nobuhiko Naka, Tomoyuki Ohya:
Variable bit rate control with trellis diagram approximation. 1109-1112
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SrinivasamurthyON03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SrinivasamurthyON03
Naveen Srinivasamurthy, Antonio Ortega, Shrikanth S. Narayanan:
Towards optimal encoding for classification with applications to distributed speech recognition. 1113-1116
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RaadBM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RaadBM03
Mohammed Raad, Ian S. Burnett, Alfred Mertins:
Multi-rate extension of the scalable to lossless PSPIHT audio coder. 1117-1120
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShabestaryHN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShabestaryHN03
Turaj Zakizadeh Shabestary, Per Hedelin, Fredrik Nordén:
Entropy constrained quantization of LSP parameters. 1121-1124

Speech Recognition - Search and Lexicon Modeling

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KobayashiON03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KobayashiON03
Akio Kobayashi, Franz Josef Och, Hermann Ney:
Named entity extraction from Japanese broadcast news. 1125-1128
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParkAC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParkAC03
Young-Hee Park, Dong-Hoon Ahn, Minhwa Chung:
Morpheme-based lexical modeling for korean broadcast news transcription. 1129-1132
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WachterDCW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WachterDCW03
Mathias De Wachter, Kris Demuynck, Dirk Van Compernolle, Patrick Wambacq:
Data driven example based continuous speech recognition. 1133-1136
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AstrovA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AstrovA03
Sergey Astrov, Bernt Andrassy:
Large vocabulary speaker independent isolated word recognition for embedded systems. 1137-1140
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Seward03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Seward03
Alexander Seward:
Low-latency incremental speech transcription in the synface project. 1141-1144
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KanthakN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KanthakN03
Stephan Kanthak, Hermann Ney:
Multilingual acoustic modeling using graphemes. 1145-1148
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FujiiIAI03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FujiiIAI03
Atsushi Fujii, Katunobu Itou, Tomoyosi Akiba, Tetsuya Ishikawa:
A cross-media retrieval system for lecture videos. 1149-1152
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FujiiI03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FujiiI03
Atsushi Fujii, Katunobu Itou:
Building a test collection for speech-driven web retrieval. 1153-1156
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NovakR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NovakR03
Miroslav Novak, Diego Ruiz:
Confidence measure driven scalable two-pass recognition strategy for large list grammars. 1157-1160
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AbdouS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AbdouS03
Sherif M. Abdou, Michael S. Scordilis:
An efficient, fast matching approach using posterior probability estimates in speech recognition. 1161-1164
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HaciogluPCOKC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HaciogluPCOKC03
Kadri Hacioglu, Bryan L. Pellom, Tolga Çiloglu, Özlem Öztürk, Mikko Kurimo, Mathias Creutz:
On lexicon creation for turkish LVCSR. 1165-1168
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Chen03b
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Chen03b
Stanley F. Chen:
Compiling large-context phonetic decision trees into finite-state transducers. 1169-1172
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaskeyH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaskeyH03
Sameer Maskey, Julia Hirschberg:
Automatic summarization of broadcast news using structural features. 1173-1176
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YanZZPHL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YanZZPHL03
Yonghong Yan, Chengyi Zheng, Jianping Zhang, Jielin Pan, Jiang Han, Jian Liu:
A dynamic cross-reference pruning strategy for multiple feature fusion at decoder run time. 1177-1180
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LamereKWGSRW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LamereKWGSRW03
Paul Lamere, Philip Kwok, William Walker, Evandro B. Gouvêa, Rita Singh, Bhiksha Raj, Peter Wolf:
Design of the CMU sphinx-4 decoder. 1181-1184
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CilingirD03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CilingirD03
Onur Cilingir, Mübeccel Demirekler:
A new decoder design for large vocabulary turkish speech recognition. 1185-1188

Speech Technology Applications

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GreenCHEHP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GreenCHEHP03
Phil D. Green, James Carmichael, Athanassios Hatzis, Pam Enderby, Mark S. Hawley, Mark Parker:
Automatic speech recognition with sparse training data for dysarthric speakers. 1189-1192
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/InoueMY03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/InoueMY03
Akira Inoue, Takayoshi Mikami, Yoichi Yamashita:
Prediction of sentence importance for speech summarization using prosodic parameters. 1193-1196
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangLC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangLC03
Chong-kai Wang, Ren-Yuan Lyu, Yuang-Chin Chiang:
An automatic singing transcription system with multilingual singing lyric recognizer and robust melody tracker. 1197-1200
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GotoOIK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GotoOIK03
Masataka Goto, Yukihiro Omoto, Katunobu Itou, Tetsunori Kobayashi:
Speech shift: direct speech-input-mode switching through intentional control of voice pitch. 1201-1204
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MatsushitaNUKN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MatsushitaNUKN03
Masahiko Matsushita, Hiromitsu Nishizaki, Takehito Utsuro, Yasuhiro Kodama, Seiichi Nakagawa:
Evaluating multiple LVCSR model combination in NTCIR-3 speech-driven web retrieval task. 1205-1208
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Wang03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Wang03
Kuansan Wang:
Semantic object synchronous understanding in SALT for highly interactive user interface. 1209-1212
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KneisslerKK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KneisslerKK03
Jan Kneissler, Anne K. Kienappel, Dietrich Klakow:
Information retrieval based call classification. 1213-1216
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LarsonE03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LarsonE03
Martha A. Larson, Stefan Eickeler:
Using syllable-based indexing features and language models to improve German spoken document retrieval. 1217-1220
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SundaramN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SundaramN03
Shiva Sundaram, Shrikanth S. Narayanan:
An empirical text transformation method for spontaneous speech synthesizers. 1221-1224
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GulAD03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GulAD03
Yilmaz Gul, Aladdin M. Ariyaeeinia, Oliver Dewhirst:
A new approach to reducing alarm noise in speech. 1225-1228
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuWMMA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuWMMA03
Dong Yu, Kuansan Wang, Milind Mahajan, Peter Mau, Alex Acero:
Improved name recognition with user modeling. 1229-1232
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BawabLXA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BawabLXA03
Ziad Al Bawab, Ivo Locher, Jianxia Xue, Abeer Alwan:
Speech recognition over bluetooth wireless channels. 1233-1236
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KitayamaGIK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KitayamaGIK03
Koji Kitayama, Masataka Goto, Katunobu Itou, Tetsunori Kobayashi:
Speech starter: noise-robust endpoint detection by using filled pauses. 1237-1240
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BoulianneBCCOD03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BoulianneBCCOD03
Gilles Boulianne, Jean-Francois Beaumont, Patrick Cardinal, Michel Comeau, Pierre Ouellet, Pierre Dumouchel:
Automatic segmentation of film dialogues into phonemes and graphemes. 1241-1244
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BrousseauBBCCCOO03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BrousseauBBCCCOO03
Julie Brousseau, Jean-Francois Beaumont, Gilles Boulianne, Patrick Cardinal, Claude Chapdelaine, Michel Comeau, Frédéric Osterrath, Pierre Ouellet:
Automated closed-captioning of live TV broadcast news in French. 1245-1248
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JanMMZ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JanMMZ03
E. E. Jan, Benoît Maison, Lidia Mangu, Geoffrey Zweig:
Automatic construction of unique signatures and confusable sets for natural language directory assistance applications. 1249-1252
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengLFHKLLC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengLFHKLLC03
Helen M. Meng, Yuk-Chi Li, Tien Ying Fung, Man Cheuk Ho, Chi-Kin Keung, Tin Hang Lo, Wai Kit Lo, P. C. Ching:
Recent enhancements in CU VOCAL for Chinese TTS-enabled applications. 1253-1256
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TrancosoNMA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TrancosoNMA03
Isabel Trancoso, João Paulo Neto, Hugo Meinedo, Rui Amaral:
Evaluation of an alert system for selective dissemination of broadcast news. 1257-1260
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MittalAC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MittalAC03
Udar Mittal, James P. Ashley, Edgardo M. Cruz-Zeno:
Low complexity joint optimization of excitation parameters in analysis-by-synthesis speech coding. 1261-1264
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HorlockK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HorlockK03
James Horlock, Simon King:
Named entity extraction from word lattices. 1265-1268
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BelfieldG03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BelfieldG03
William Belfield, Herbert Gish:
A topic classification system based on parametric trajectory mixture models. 1269-1272

Robust Speech Recognition - Front-end Processing

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YaoPN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YaoPN03
Kaisheng Yao, Kuldip K. Paliwal, Satoshi Nakamura:
Model based noisy speech recognition with environment parameters estimated by noise adaptive speech recognition with prior. 1273-1276
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SeltzerDA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SeltzerDA03
Michael L. Seltzer, Jasha Droppo, Alex Acero:
A harmonic-model-based front end for robust speech recognition. 1277-1280
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YapanelH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YapanelH03
Umit H. Yapanel, John H. L. Hansen:
A new perspective on feature extraction for robust in-vehicle speech recognition. 1281-1284
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SekiyaOK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SekiyaOK03
Toshiyuki Sekiya, Tetsuji Ogawa, Tetsunori Kobayashi:
Speech recognition of double talk using SAFIA-based audio segregation. 1285-1288
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangH03
Xianxian Zhang, John H. L. Hansen:
CFA-BF: a novel combined fixed/adaptive beamforming for robust speech recognition in real car environments. 1289-1292
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PotamianosN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PotamianosN03
Gerasimos Potamianos, Chalapathy Neti:
Audio-visual speech recognition in challenging environments. 1293-1296

Spoken Language Processing for e-Inclusion

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KarlssonFS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KarlssonFS03
Inger Karlsson, Andrew Faulkner, Giampiero Salvi:
SYNFACE - a talking face telephone. 1297-1300
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VesnicerZDPM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VesnicerZDPM03
Bostjan Vesnicer, Janez Zibert, Simon Dobrisek, Nikola Pavesic, France Mihelic:
A voice-driven web browser for blind people. 1301-1304
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MullerWB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MullerWB03
Christian A. Müller, Frank Wittig, Jörg Baus:
Exploiting speech for recognizing elderly users to respond to their special needs. 1305-1308
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Newell03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Newell03
Alan F. Newell:
Spoken language and e-inclusion. 1309-1312
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StemmerHSN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StemmerHSN03
Georg Stemmer, Christian Hacker, Stefan Steidl, Elmar Nöth:
Acoustic normalization of children's speech. 1313-1316

Language and Accent Identification

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MartinP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MartinP03
Alvin F. Martin, Mark A. Przybocki:
NIST 2003 language recognition evaluation. 1341-1344
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SingerTGCR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SingerTGCR03
Elliot Singer, Pedro A. Torres-Carrasquillo, Terry P. Gleason, William M. Campbell, Douglas A. Reynolds:
Acoustic, phonetic, and discriminative approaches to automatic language identification. 1345-1348
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenM03
Stanley F. Chen, Benoît Maison:
Using place name data to train language identification models. 1349-1352
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AngkititrakulH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AngkititrakulH03
Pongtep Angkititrakul, John H. L. Hansen:
Use of trajectory models for automatic accent classification. 1353-1356
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RamasubramanianJS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RamasubramanianJS03
V. Ramasubramanian, A. K. V. Sai Jayram, T. V. Sreenivas:
Language identification using parallel sub-word recognition - an ergodic HMM equivalence. 1357-1360
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BenZeghibaB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BenZeghibaB03
Mohamed Faouzi BenZeghiba, Hervé Bourlard:
On the combination of speech and speaker recognition. 1361-1364

Speech Recognition - Adaptation 1, 2

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PitzN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PitzN03
Michael Pitz, Hermann Ney:
Vocal tract normalization as linear transformation of MFCC. 1445-1448
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangS03
Zhirong Wang, Tanja Schultz:
Non-native spontaneous speech recognition through polyphone decision tree specialization. 1449-1452
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ArikiSKOF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ArikiSKOF03
Yasuo Ariki, Takeru Shigemori, Tsuyoshi Kaneko, Jun Ogata, Masakiyo Fujimoto:
Live speech recognition in sports games by adaptation of acoustic model and language model. 1453-1456
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OhKRSC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OhKRSC03
Se-Jin Oh, Kwang-Dong Kim, Duk-Gyoo Roh, Woo-Chang Sung, Hyun-Yeol Chung:
Speaker adaptation using regression classes generated by phonetic decision tree-based successive state splitting. 1457-1460
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimC03
Jiun Kim, Jaeho Chung:
Reduction of dimension of HMM parameters using ICA and PCA in MLLR framework for speaker adaptation. 1461-1464
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangX03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangX03
Huayun Zhang, Bo Xu:
Geometric constrained maximum likelihood linear regression on Mandarin dialect adaptation. 1465-1468
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AkibaIF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AkibaIF03
Tomoyosi Akiba, Katunobu Itou, Atsushi Fujii:
Adapting language models for frequent fixed phrases by emphasizing n-gram subsets. 1469-1472
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kienappel03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kienappel03
Anne K. Kienappel:
Learning intra-speaker model parameter correlations from many short speaker segments. 1473-1476
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KamLS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KamLS03
Patgi Kam, Tan Lee, Frank K. Soong:
Modeling Cantonese pronunciation variation by acoustic model refinement. 1477-1480
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParkSK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParkSK03
Jong Se Park, Hwa Jeon Song, Hyung Soon Kim:
Performance improvement of rapid speaker adaptation based on eigenvoice and bias compensation. 1481-1484
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FangGLS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FangGLS03
Xiaoshan Fang, Jianfeng Gao, Jianfeng Li, Huanye Sheng:
Training data optimization for language model adaptation. 1485-1488
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AalburgH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AalburgH03
Stefanie Aalburg, Harald Höge:
Approaches to foreign-accented speaker-independent speech recognition. 1489-1492
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamadeLSS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamadeLSS03
Shingo Yamade, Akinobu Lee, Hiroshi Saruwatari, Kiyohiro Shikano:
Unsupervised speaker adaptation based on HMM sufficient statistics in various noisy environments. 1493-1496
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LauriIFK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LauriIFK03
Fabrice Lauri, Irina Illina, Dominique Fohr, Filipp Korkmazsky:
Using genetic algorithms for rapid speaker adaptation. 1497-1500
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BarreaudIFK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BarreaudIFK03
Vincent Barreaud, Irina Illina, Dominique Fohr, Filipp Korkmazsky:
Structural state-based frame synchronous compensation. 1501-1504
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LawsonHG03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LawsonHG03
Aaron D. Lawson, David M. Harris, John J. Grieco:
Effect of foreign accent on speech recognition in the NATO n-4 corpus. 1505-1508
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NedelS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NedelS03
Jon P. Nedel, Richard M. Stern:
Duration normalization and hypothesis combination for improved spontaneous speech recognition. 1509-1512
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChouH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChouH03
Wu Chou, Xiaodong He:
Maximum a posteriori linear regression (MAPLR) variance adaptation for continuous density HMMS. 1513-1516
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MyrvollS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MyrvollS03
Tor André Myrvoll, Frank K. Soong:
On divergence based clustering of normal distributions and its application to HMM adaptation. 1517-1520
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Balakrishnan03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Balakrishnan03
Sreeram V. Balakrishnan:
Fast incremental adaptation using maximum likelihood regression and stochastic gradient descent. 1521-1524
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AxelrodGKVG03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AxelrodGKVG03
Scott Axelrod, Vaibhava Goel, Brian Kingsbury, Karthik Visweswariah, Ramesh A. Gopinath:
Large vocabulary conversational speech recognition with a subspace constraint on inverse covariance matrices. 1613-1616
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JangJY03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JangJY03
Gyucheol Jang, Minho Jin, Chang D. Yoo:
Speaker adaptation based on confidence-weighted training. 1617-1620
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AbadNHP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AbadNHP03
Alberto Abad, Climent Nadeu, Javier Hernando, Jaume Padrell:
Jacobian adaptation based on the frequency-filtered spectral energies. 1621-1624
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MatroufBNLB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MatroufBNLB03
Driss Matrouf, Olivier Bellot, Pascal Nocera, Georges Linarès, Jean-François Bonastre:
Structural linear model-space transformations for speaker adaptation. 1625-1628
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeC03
Xiaodong He, Wu Chou:
Minimum classification error (MCE) model adaptation of continuous density HMMS. 1629-1632
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GunawardanaA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GunawardanaA03
Asela Gunawardana, Alex Acero:
Adapting acoustic models to new domains and conditions using untranscribed data. 1633-1636

Speech Resources and Standards

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BijankhanSRZGG03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BijankhanSRZGG03
Mahmood Bijankhan, Javad Sheykhzadegan, Mahmood R. Roohani, Rahman Zarrintare, Seyyed Z. Ghasemi, Mohammad E. Ghasedi:
Tfarsdat - the telephone farsi speech database. 1525-1528
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HartikainenMMSZ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HartikainenMMSZ03
Elviira Hartikainen, Giulio Maltese, Asunción Moreno, Shaunie Shammass, Ute Ziegenhain:
Large lexica for speech-to-speech translation: from specification to creation. 1529-1532
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OflazerI03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OflazerI03
Kemal Oflazer, Sharon Inkelas:
A pronunciation lexicon for turkish based on two-level morphology. 1533-1536
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhengL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhengL03
Hong Zheng, Yiqing Lu:
Using both global and local hidden Markov models for automatic speech unit segmentation. 1537-1540
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeuvelCHMOM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeuvelCHMOM03
Henk van den Heuvel, Khalid Choukri, Harald Höge, Bente Maegaard, Jan Odijk, Valérie Mapelli:
Quality control of language resources at ELRA. 1541-1544
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BaelBSH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BaelBSH03
Christophe Van Bael, Diana Binnenpoorte, Helmer Strik, Henk van den Heuvel:
Validation of phonetic transcriptions based on recognition performance. 1545-1548
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HernaezLNZGS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HernaezLNZGS03
Inmaculada Hernáez, Iker Luengo, Eva Navas, Maria Luisa Zubizarreta, Iñaki Gaminde, Jon Sánchez:
The basque speech_dat (II) database: a description and first test recognition results. 1549-1552
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaaseHKWH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaaseHKWH03
Jens Maase, Diane Hirschfeld, Uwe Koloska, Timo Westfeld, Jörg Helbig:
Towards an evaluation standard for speech control concepts in real-world scenarios. 1553-1556
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Draxler03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Draxler03
Christoph Draxler:
Orientel: recording telephone speech of turkish speakers in Germany. 1557-1560
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BackfriedC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BackfriedC03
Gerhard Backfried, Roser Jaquemot Caldes:
Spanish broadcast news transcription. 1561-1564
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DigalakisOPTVCD03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DigalakisOPTVCD03
Vassilios Digalakis, Dimitris Oikonomidis, Dimitris Pratsolis, Nikos Tsourakis, Christos Vosnidis, Nikos Chatzichrisafis, Vassilios Diakoloukas:
Large vocabulary continuous speech recognition in greek: corpus and an automatic dictation system. 1565-1568
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DaubiasD03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DaubiasD03
Philippe Daubias, Paul Deléglise:
The LIUM-AVS database : a corpus to test lip segmentation and speechreading systems in natural conditions. 1569-1572
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SalorPD03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SalorPD03
Özgül Salor, Bryan L. Pellom, Mübeccel Demirekler:
Implementation and evaluation of a text-to-speech synthesis system for turkish. 1573-1576
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KolarRP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KolarRP03
Jáchym Kolár, Jan Romportl, Josef Psutka:
The czech speech and prosody database both for ASR and TTS purposes. 1577-1580
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KishidaIYMKI03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KishidaIYMKI03
Itsuki Kishida, Yuki Irie, Yukiko Yamaguchi, Shigeki Matsubara, Nobuo Kawaguchi, Yasuyoshi Inagaki:
Construction of an advanced in-car spoken dialogue corpus and its characteristic analysis. 1581-1584
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JonesWGWFRZ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JonesWGWFRZ03
Douglas A. Jones, Florian Wolf, Edward Gibson, Elliott Williams, Evelina Fedorenko, Douglas A. Reynolds, Marc A. Zissman:
Measuring the readability of automatic speech-to-text transcripts. 1585-1588
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ManaBCBMMM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ManaBCBMMM03
Nadia Mana, Susanne Burger, Roldano Cattoni, Laurent Besacier, Victoria MacLaren, John W. McDonough, Florian Metze:
The NESPOLE! voIP multilingual corpora in tourism and medical domains. 1589-1592
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ConejeroGABPCM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ConejeroGABPCM03
David Conejero, Jesús Giménez, Victoria Arranz, Antonio Bonafonte, Neus Pascual, Núria Castell, Asunción Moreno:
Lexica and corpora for speech-to-speech translation: a trilingual approach. 1593-1596
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CieriMW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CieriMW03
Christopher Cieri, David Miller, Kevin Walker:
From switchboard to fisher: telephone collection protocols, their uses and yields. 1597-1600
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MeisterLM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MeisterLM03
Einar Meister, Jürgen Lasn, Lya Meister:
Development of the estonian speechdat-like database. 1601-1604
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SerralheiroTCCCG03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SerralheiroTCCCG03
António Joaquim Serralheiro, Isabel Trancoso, Diamantino Caseiro, Teresa Chambel, Luís Carriço, Nuno Guimarães:
Towards a repository of digital talking books. 1605-1608
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StrasselMWC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StrasselMWC03
Stephanie M. Strassel, David Miller, Kevin Walker, Christopher Cieri:
Shared resources for robust speech-to-text technology. 1609-1612

Towards Synthesizing Expressive Speech

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Campbell03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Campbell03
Nick Campbell:
Towards synthesising expressive speech; designing and collecting expressive speech data. 1637-1640
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BanzigerMS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BanzigerMS03
Tanja Bänziger, Michel Morel, Klaus R. Scherer:
Is there an emotion signature in intonational patterns? and can it be used in synthesis? 1641-1644
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EideBHP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EideBHP03
Ellen Eide, Raimo Bakis, Wael Hamza, John F. Pitrelli:
Multilayered extensions to the speech synthesis markup language for describing expressiveness. 1645-1648
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Black03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Black03
Alan W. Black:
Unit selection and emotional speech. 1649-1652
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/dAlessandroD03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/dAlessandroD03
Christophe d'Alessandro, Boris Doval:
Voice quality modification for emotional speech synthesis. 1653-1656
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SantenBCKKMVN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SantenBCKKMVN03
Jan P. H. van Santen, Lois M. Black, Gilead Cohen, Alexander Kain, Esther Klabbers, Taniya Mishra, Jacques de Villiers, Xiaochuan Niu:
Applications of computer generated expressive speech for communication disorders. 1657-1660

Speaker Verification

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Leeuwen03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Leeuwen03
David A. van Leeuwen:
Speaker verification systems and security considerations. 1661-1664
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HebertH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HebertH03
Matthieu Hébert, Larry P. Heck:
Phonetic class-based speaker verification. 1665-1668
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SuhadiSFB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SuhadiSFB03
Suhadi Suhadi, Sorel Stan, Tim Fingscheidt, Christophe Beaugeant:
An evaluation of VTS and IMM for speaker verification in noise. 1669-1672
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GanchevTVF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GanchevTVF03
Todor Ganchev, Dimitris K. Tasoulis, Michael N. Vrahatis, Nikos Fakotakis:
Locally recurrent probabilistic neural network for text-independent speaker verification. 1673-1676
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiZMSC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiZMSC03
Stan Z. Li, Dong Zhang, Chengyuan Ma, Heung-Yeung Shum, Eric Chang:
Learning to boost GMM based speaker verification. 1677-1680
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuMSK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuMSK03
Eric W. M. Yu, Man-Wai Mak, Chin-Hung Sit, Sun-Yuan Kung:
Speaker verification based on g.729 and g.723.1 coder parameters and handset mismatch compensation. 1681-1684

Dialog System Generation

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WhittakerWM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WhittakerWM03
Stephen Whittaker, Marilyn A. Walker, Preetam Maloor:
Should i tell all?: an experiment on conciseness in spoken dialogue. 1685-1688
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengYMC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengYMC03
Helen M. Meng, Wing Lin Yip, Oi Yan Mok, Shuk Fong Chan:
Natural language response generation in mixed-initiative dialogs using task goals and dialog acts. 1689-1692
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HiroseTM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HiroseTM03
Keikichi Hirose, Junji Tago, Nobuaki Minematsu:
Speech generation from concept for realizing conversation with an agent in a virtual room. 1693-1696
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WalkerPS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WalkerPS03
Marilyn A. Walker, Rashmi Prasad, Amanda Stent:
A trainable generator for recommendations in multimodal dialog. 1697-1700
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KawaharaIK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KawaharaIK03
Tatsuya Kawahara, Ryosuke Ito, Kazunori Komatani:
Spoken dialogue system for queries on appliance manuals using hierarchical confirmation strategy. 1701-1704
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kallulli03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kallulli03
Dalina Kallulli:
SAG: a procedural tactical generator for dialog systems. 1705-1708

Robust Speech Recognition 1-4

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuoD03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuoD03
Yu Luo, Limin Du:
A hidden Markov model-based missing data imputation approach. 1765-1768
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamadaOTKFKYNMN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamadaOTKFKYNMN03
Takeshi Yamada, Jiro Okada, Kazuya Takeda, Norihide Kitaoka, Masakiyo Fujimoto, Shingo Kuroiwa, Kazumasa Yamamoto, Takanobu Nishiura, Mitsunori Mizumachi, Satoshi Nakamura:
Integration of noise reduction algorithms for Aurora2 task. 1769-1772
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SinghWRL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SinghWRL03
Rita Singh, Manfred K. Warmuth, Bhiksha Raj, Paul Lamere:
Classification with free energy at raised temperatures. 1773-1776
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DingSFC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DingSFC03
Pei Ding, Bertram E. Shi, Pascale Fung, Zhigang Cao:
Flooring the observation probability for robust ASR in impulsive noise. 1777-1780
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FujimotoA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FujimotoA03
Masakiyo Fujimoto, Yasuo Ariki:
Combination of temporal domain SVD based speech enhancement and GMM based speech estimation for ASR in noise - evaluation on the AURORA2 task -. 1781-1784
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FousekP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FousekP03
Petr Fousek, Petr Pollák:
Additive noise and channel distortion-robust parametrization tool - performance evaluation on Aurora 2 & 3. 1785-1788
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DupontR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DupontR03
Stéphane Dupont, Christophe Ris:
Robust feature extraction and acoustic modeling at multitel: experiments on the Aurora databases. 1789-1792
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KotnikKH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KotnikKH03
Bojan Kotnik, Zdravko Kacic, Bogomir Horvat:
Noise robust speech parameterization based on joint wavelet packet decomposition and autoregressive modeling. 1793-1796
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CouvreurGLSV03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CouvreurGLSV03
Christophe Couvreur, Oren Gedge, Klaus Linhard, Shaunie Shammass, Johan Vantieghem:
Database adaptation for ASR in cross-environmental conditions in the SPEECON project. 1797-1800
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MotlicekC03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MotlicekC03a
Petr Motlícek, Jan Cernocký:
Autoregressive modeling based feature extraction for Aurora3 DSR task. 1801-1804
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TrentinMG03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TrentinMG03
Edmondo Trentin, Marco Matassoni, Marco Gori:
Evaluation on the Aurora 2 database of acoustic models that are less noise-sensitive. 1805-1808
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuarasaOMFCD03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuarasaOMFCD03
Javier Macías Guarasa, J. Ordonez, Juan Manuel Montero, Javier Ferreiros, Ricardo de Córdoba, Luis Fernando D'Haro:
Revisiting scenarios and methods for variable frame rate analysis in automatic speech recognition. 1809-1812
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParveenG03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParveenG03
Shahla Parveen, Phil D. Green:
Multitask learning in connectionist robust ASR using recurrent neural networks. 1813-1816
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MisraM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MisraM03
Hemant Misra, Andrew C. Morris:
Confusion matrix based entropy correction in multi-stream combination. 1817-1820
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangHX03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangHX03
Huayun Zhang, Zhaobing Han, Bo Xu:
Dynamic channel compensation based on maximum a posteriori estimation. 2137-2140
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FernandezGM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FernandezGM03
Laura Docío Fernández, David Gelbart, Nelson Morgan:
Far-field ASR on inexpensive microphones. 2141-2144
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsugeKK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsugeKK03
Satoru Tsuge, Shingo Kuroiwa, Kenji Kita:
Evaluation of ETSI advanced DSR front-end and bias removal method on the Japanese newspaper article sentences speech corpus. 2145-2148
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoonABR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoonABR03
Chng Chin Soon, Bernt Andrassy, Josef G. Bauer, Günther Ruske:
Environment adaptive control of noise reduction parameters for improved robustness of ASR. 2149-2152
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DendaNK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DendaNK03
Yuki Denda, Takanobu Nishiura, Hideki Kawahara:
Speech enhancement with microphone array and fourier / wavelet spectral subtraction in real noisy environments. 2153-2156
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NishiuraNMS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NishiuraNMS03
Takanobu Nishiura, Satoshi Nakamura, Kazuhiro Miki, Kiyohiro Shikano:
Environmental sound source identification based on hidden Markov model for robust speech recognition. 2157-2160
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JancovicKM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JancovicKM03
Peter Jancovic, Münevver Köküer, Fionn Murtagh:
High-likelihood model based on reliability statistics for robust combination of features: application to noisy speech recognition. 2161-2164
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DemirogluA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DemirogluA03
Cenk Demiroglu, David V. Anderson:
Noise robust digit recognition with missing frames. 2165-2168
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CuiBA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CuiBA03
Xiaodong Cui, Alexis Bernard, Abeer Alwan:
A noise-robust ASR back-end technique based on weighted viterbi recognition. 2169-2172
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GhulamFN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GhulamFN03
Muhammad Ghulam, Takashi Fukuda, Tsuneo Nitta:
Voice quality normalization in an utterance for robust ASR. 2173-2176
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AkbacakH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AkbacakH03
Murat Akbacak, John H. L. Hansen:
Environmental sniffing: robust digit recognition for an in-vehicle environment. 2177-2180
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hwang03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hwang03
Tai-Hwei Hwang:
Energy contour extraction for in-car speech recognition. 2181-2184
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FukudaN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FukudaN03
Takashi Fukuda, Tsuneo Nitta:
Noise-robust ASR by using distinctive phonetic features approximated with logarithmic normal distribution of HMM. 2185-2188
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FukudaN03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FukudaN03a
Takashi Fukuda, Tsuneo Nitta:
Noise-robust automatic speech recognition using orthogonalized distinctive phonetic feature vectors. 2189-2192
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YomaBS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YomaBS03
Néstor Becerra Yoma, Ivan Brito, Jorge F. Silva:
Language model accuracy and uncertainty in noise cancelling in the stochastic weighted viterbi algorithm. 2193-2196
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EnemanDMCH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EnemanDMCH03
Koen Eneman, Jacques Duchateau, Marc Moonen, Dirk Van Compernolle, Hugo Van hamme:
Assessment of dereverberation algorithms for large vocabulary speech recognition systems. 2689-2692
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MilnerJ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MilnerJ03
Ben P. Milner, Alastair Bruce James:
Analysis and compensation of packet loss in distributed speech recognition using interleaving. 2693-2696
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Milner03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Milner03
Ben P. Milner:
Non-linear compression of feature vectors using transform coding and non-uniform bit allocation. 2697-2700
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChienF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChienF03
Jen-Tzung Chien, Sadaoki Furui:
Predictive hidden Markov model selection for decision tree state tying. 2701-2704
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NakadaiMOT03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NakadaiMOT03
Kazuhiro Nakadai, Daisuke Matsuura, Hiroshi G. Okuno, Hiroshi Tsujino:
Three simultaneous speech recognition by integration of active audition and face recognition for humanoid. 2705-2708
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FujinagaKYKS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FujinagaKYKS03
Katsuhisa Fujinaga, Hiroaki Kokubo, Hirofumi Yamamoto, Gen-ichiro Kikui, Hiroshi Shimodaira:
Mis-recognized utterance detection using multiple language models generated by clustered sentences. 2709-2712
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SunZZX03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SunZZX03
Hui Sun, Guoliang Zhang, Fang Zheng, Mingxing Xu:
Using word confidence measure for OOV words detection in a spontaneous spoken dialog system. 2713-2716
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ManabeHS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ManabeHS03
Hiroyuki Manabe, Akira Hiraiwa, Toshiaki Sugimura:
Speech recognition using EMG; mime speech recognition. 2717-2720
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JitsuhiroMN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JitsuhiroMN03
Takatoshi Jitsuhiro, Tomoko Matsui, Satoshi Nakamura:
Automatic generation of non-uniform context-dependent HMM topologies based on the MDL criterion. 2721-2724
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KitaokaSN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KitaokaSN03
Norihide Kitaoka, Masahisa Shingu, Seiichi Nakagawa:
Comparison of effects of acoustic and language knowledge on spontaneous speech perception/recognition between human and automatic speech recognizer. 2725-2728
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Gorrell03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Gorrell03
Genevieve Gorrell:
Using statistical language modelling to identify new vocabulary in a grammar-based speech recognition system. 2729-2732
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GomezPSR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GomezPSR03
Angel M. Gomez, Antonio M. Peinado, Victoria E. Sánchez, Antonio J. Rubio:
A source model mitigation technique for distributed speech recognition over lossy packet channels. 2733-2736
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RussellJ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RussellJ03
Martin J. Russell, Philip J. B. Jackson:
The effect of an intermediate articulatory layer on the performance of a segmental HMM. 2737-2740
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuF03
Yi Liu, Pascale Fung:
Automatic phone set extension with confidence measure for spontaneous speech. 2741-2744
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParedesSVJ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParedesSVJ03
Roberto Paredes, Alberto Sanchís, Enrique Vidal, Alfons Juan:
Utterance verification using an optimized k-nearest neighbour classifier. 2745-2748
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FuL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FuL03
Guokang Fu, Ta-Hsin Li:
A segment-based algorithm of speech enhancement for robust speech recognition. 3029-3032
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GemelloMAM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GemelloMAM03
Roberto Gemello, Franco Mana, Dario Albesano, Renato de Mori:
Robust multiple resolution analysis for automatic speech recognition. 3033-3036
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Afify03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Afify03
Mohamed Afify:
An accurate noise compensation algorithm in the log-spectral domain for robust speech recognition. 3037-3040
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RamirezSBTR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RamirezSBTR03
Javier Ramírez, José C. Segura, M. Carmen Benítez, Ángel de la Torre, Antonio J. Rubio:
A new adaptive long-term spectral estimation voice activity detector. 3041-3044
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Carey03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Carey03
Michael J. Carey:
Robust speech recognition using non-linear spectral smoothing. 3045-3048
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MiaoW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MiaoW03
Cailian Miao, Yangsheng Wang:
A novel use of residual noise model for modified PMC. 3049-3052
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CerisaraI03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CerisaraI03
Christophe Cerisara, Irina Illina:
Robust speech recognition to non-stationary noise based on model-driven approaches. 3053-3056
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Cerisara03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Cerisara03
Christophe Cerisara:
Towards missing data recognition with cepstral features. 3057-3060
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HaverinenK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HaverinenK03
Hemmo Haverinen, Imre Kiss:
On-line parametric histogram equalization techniques for noise robust embedded speech recognition. 3061-3064
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuW03
An-Tze Yu, Hsiao-Chuan Wang:
Compensation of channel distortion in line spectrum frequency domain. 3065-3068
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MartinM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MartinM03
Arnaud Martin, Laurent Mauuary:
Voicing parameter and energy based speech/non-speech detection for speech recognition in adverse conditions. 3069-3072
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hamme03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hamme03
Hugo Van hamme:
Two correction models for likelihoods in robust speech recognition using missing feature theory. 3073-3076
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SujathaKRB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SujathaKRB03
J. Sujatha, K. R. Prasanna Kumar, K. R. Ramakrishnan, N. Balakrishnan:
Spectral maxima representation for robust automatic speech recognition. 3077-3080
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EndoKN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EndoKN03
Toshiki Endo, Shingo Kuroiwa, Satoshi Nakamura:
Missing feature theory applied to robust speech recognition over IP network. 3081-3084
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TolbaSO03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TolbaSO03
Hesham Tolba, Sid-Ahmed Selouani, Douglas D. O'Shaughnessy:
Comparative experiments to evaluate the use of auditory-based acoustic distinctive features and formant cues for robust automatic speech recognition in low-SNR car environments. 3085-3088
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hamme03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hamme03a
Hugo Van hamme:
Robust speech recognition using missing feature theory in the cepstral or LDA domain. 3089-3092
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiaoLT03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiaoLT03
Yuan-Fu Liao, Jeng-Shien Lin, Wei-Ho Tsai:
Bandwidth mismatch compensation for robust speech recognition. 3093-3096
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MorrisAC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MorrisAC03
Robert W. Morris, Jon A. Arrowood, Mark A. Clements:
Markov chain monte carlo methods for noise robust feature extraction using the autoregressive model. 3097-3100
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HilarioC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HilarioC03
Joan Marí Hilario, Fritz Class:
A comparative study of some discriminative feature reduction algorithms on the AURORA 2000 and the daimlerchrysler in-car ASR tasks. 3101-3104

Speech Recognition - Large Vocabulary 1, 2

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PsutkaIPRBHMG03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PsutkaIPRBHMG03
Josef Psutka, Pavel Ircing, Josef V. Psutka, Vlasta Radová, William J. Byrne, Jan Hajic, Jirí Mírovský, Samuel Gustman:
Large vocabulary ASR for spontaneous czech in the MALACH project. 1821-1824
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RiccardiH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RiccardiH03
Giuseppe Riccardi, Dilek Hakkani-Tür:
Active and unsupervised learning for automatic speech recognition. 1825-1828
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YapanelDH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YapanelDH03
Umit H. Yapanel, Satya Dharanipragada, John H. L. Hansen:
Perceptual MVDR-based cepstral coefficients (PMCCs) for high accuracy speech recognition. 1829-1832
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoL03
Sheng Gao, Chin-Hui Lee:
A discriminative decision tree learning approach to acoustic modeling. 1833-1836
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NguyenRJ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NguyenRJ03
Patrick Nguyen, Luca Rigazio, Jean-Claude Junqua:
Large corpus experiments for broadcast news recognition. 1837-1840
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JitapunkulMAL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JitapunkulMAL03
Somchai Jitapunkul, Ekkarit Maneenoi, Visarut Ahkuputra, Sudaporn Luksaneeyanawin:
Performance evaluation of phonotactic and contextual onset-rhyme models for speech recognition of Thai language. 1841-1844
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianLL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianLL03
Yao Qian, Tan Lee, Yujia Li:
Overlapped di-tone modeling for tone recognition in continuous Cantonese speech. 1845-1848
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NishidaK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NishidaK03
Masafumi Nishida, Tatsuya Kawahara:
Speaker model selection using Bayesian information criterion for speaker indexing and speaker adaptation. 1849-1852
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SturmKWWSS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SturmKWWSS03
Janienke Sturm, Judith M. Kessens, Mirjam Wester, Febe de Wet, Eric Sanders, Helmer Strik:
Automatic transcription of football commentaries in the MUMIS project. 1853-1856
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Peters03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Peters03
S. Douglas Peters:
On the limits of cluster-based acoustic modeling. 1857-1860
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LyuLCHL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LyuLCHL03
Dau-Cheng Lyu, Min-Siong Liang, Yuang-Chin Chiang, Chun-Nan Hsu, Ren-Yuan Lyu:
Large vocabulary taiwanese (min-nan) speech recognition using tone features and statistical pronunciation modeling. 1861-1864
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DogninE03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DogninE03
Pierre L. Dognin, Amro El-Jaroudi:
A new spectral transformation for speaker normalization. 1865-1868
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuS03
Hua Yu, Tanja Schultz:
Enhanced tree clustering with single pronunciation dictionary for conversational speech recognition. 1869-1872
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IrcingP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IrcingP03
Pavel Ircing, Josef Psutka:
Fitting class-based language models into weighted finite-state transducer framework. 1873-1876
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LefevreGL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LefevreGL03
Fabrice Lefèvre, Jean-Luc Gauvain, Lori Lamel:
Multi-source training and adaptation for generic speech recognition. 1877-1880
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KingsburyMSZAGVP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KingsburyMSZAGVP03
Brian Kingsbury, Lidia Mangu, George Saon, Geoffrey Zweig, Scott Axelrod, Vaibhava Goel, Karthik Visweswariah, Michael Picheny:
Toward domain-independent conversational speech recognition. 1881-1884
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangR03
Rong Zhang, Alexander I. Rudnicky:
Comparative study of boosting and non-boosting training for constructing ensembles of acoustic models. 1885-1888
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DingCHZX03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DingCHZX03
Peng Ding, Zhenbiao Chen, Sheng Hu, Shuwu Zhang, Bo Xu:
Discriminative optimization of large vocabulary Mandarin conversational speech recognition system. 1965-1968
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchalkwykHS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchalkwykHS03
Johan Schalkwyk, I. Lee Hetherington, Ezra Story:
Speech recognition with dynamic grammars using finite-state transducers. 1969-1972
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DemuynckLCH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DemuynckLCH03
Kris Demuynck, Tom Laureys, Dirk Van Compernolle, Hugo Van hamme:
FLavor: a flexible architecture for LVCSR. 1973-1976
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaonZKMC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaonZKMC03
George Saon, Geoffrey Zweig, Brian Kingsbury, Lidia Mangu, Upendra V. Chaudhari:
An architecture for rapid decoding of large vocabulary conversational speech. 1977-1980
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PoveyGKW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PoveyGKW03
Daniel Povey, Mark J. F. Gales, Do Yeong Kim, Philip C. Woodland:
MMI-MAP and MPE-MAP for acoustic model adaptation. 1981-1984
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoumpiotisTB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoumpiotisTB03
Vlasios Doumpiotis, Stavros Tsakalidis, William J. Byrne:
Lattice segmentation and minimum Bayes risk discriminative training. 1985-1988

Robust Methods in Processing of Natural Language Dialogues

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Zechner03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Zechner03
Klaus Zechner:
Spoken language condensation in the 21st century. 1989-1992
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Furui03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Furui03
Sadaoki Furui:
Robust methods in automatic speech recognition and understanding. 1993-1998
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Delmonte03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Delmonte03
Rodolfo Delmonte:
Parsing spontaneous speech. 1999-2004

Speaker Identification

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Reynolds03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Reynolds03
Douglas A. Reynolds:
Model compression for GMM based speaker recognition systems. 2005-2008
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NavratilR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NavratilR03
Jirí Navrátil, Ganesh N. Ramaswamy:
The awe and mystery of t-norm. 2009-2012
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BonastreMJ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BonastreMJ03
Jean-François Bonastre, Philippe Morin, Jean-Claude Junqua:
Gaussian dynamic warping (GDW) method applied to text-dependent speaker detection and verification. 2013-2016
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FerrerBGKSSSV03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FerrerBGKSSSV03
Luciana Ferrer, Harry Bratt, Venkata Ramana Rao Gadde, Sachin S. Kajarekar, Elizabeth Shriberg, M. Kemal Sönmez, Andreas Stolcke, Anand Venkataraman:
Modeling duration patterns for speaker recognition. 2017-2020
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuceyC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuceyC03
Simon Lucey, Tsuhan Chen:
Improved speaker verification through probabilistic subspace adaptation. 2021-2024
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuSMC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuSMC03
Peng Yu, Frank Seide, Chengyuan Ma, Eric Chang:
An improved model-based speaker segmentation system. 2025-2028

Speech Synthesis: Miscellaneous 1, 2

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Bellegarda03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Bellegarda03
Jerome R. Bellegarda:
A latent analogy framework for grapheme-to-phoneme conversion. 2029-2032
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Chen03c
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Chen03c
Stanley F. Chen:
Conditional and joint models for grapheme-to-phoneme conversion. 2033-2036
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PfisterR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PfisterR03
Beat Pfister, Harald Romsdorfer:
Mixed-lingual text analysis for polyglot TTS synthesis. 2037-2040
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangBS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangBS03
Jason Y. Zhang, Alan W. Black, Richard Sproat:
Identifying speakers in children's stories for speech synthesis. 2041-2044
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StevensLV03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StevensLV03
Catherine J. Stevens, Nicole Lees, Julie Vonwiller:
Experimental tools to evaluate intelligibility of text-to-speech (TTS) synthesis: effects of voice gender and signal quality. 2045-2048
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TomokiyoBL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TomokiyoBL03
Laura Mayfield Tomokiyo, Alan W. Black, Kevin A. Lenzo:
Arabic in my hand: small-footprint synthesis of egyptian arabic. 2049-2052
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BennettB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BennettB03
Christina L. Bennett, Alan W. Black:
Using acoustic models to choose pronunciation variations for synthetic voices. 2937-2940
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YanVHRT03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YanVHRT03
Qin Yan, Saeed Vaseghi, Ching-Hsiang Ho, Dimitrios Rentzos, Emir Turajlic:
Comparative analysis and synthesis of formant trajectories of british and broad australian accents. 2941-2944
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Ramirez03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Ramirez03
Miguel Arjona Ramírez:
Cycle extraction for perfect reconstruction and rate scalability. 2945-2948
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TeixeiraJM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TeixeiraJM03
António J. S. Teixeira, Luis M. T. Jesus, Roberto Martinez:
Adding fricatives to the portuguese articulatory synthesiser. 2949-2952
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SanzASM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SanzASM03
Ignasi Iriondo Sanz, Francesc Alías, Javier Sanchis, Javier Melenchón:
A hybrid method oriented to concatenative text-to-speech synthesis. 2953-2956
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoCPC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoCPC03
Yong Zhao, Min Chu, Hu Peng, Eric Chang:
Custom-tailoring TTS voice font - keeping the naturalness when reducing database size. 2957-2960

Speech Perception

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SrinivasanW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SrinivasanW03
Soundararajan Srinivasan, DeLiang Wang:
Schema-based modeling of phonemic restoration. 2053-2056
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kuwabara03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kuwabara03
Hisao Kuwabara:
Perception of voice-individuality for distortions of resonance/source characteristics and waveforms. 2057-2060
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Sato03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Sato03
Tsutomu Sato:
The perceptual cues of a high level pitch-accent pattern in Japanese: pitch-accent patterns and duration. 2061-2064
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IwakiN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IwakiN03
Mamoru Iwaki, Norio Nakamura:
Illusory continuity of intermittent pure tone in binaural listening and its dependency on interaural time difference. 2065-2068
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MinematsuGH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MinematsuGH03
Nobuaki Minematsu, Changchen Guo, Keikichi Hirose:
CART-based factor analysis of intelligibility reduction in Japanese English. 2069-2072
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TothK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TothK03
László Tóth, András Kocsor:
Harmonic alternatives to sine-wave speech. 2073-2076
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PicoviciM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PicoviciM03
Dorel Picovici, Abdulhussain E. Mahdi:
Non-intrusive assessment of perceptual speech quality using a self-organising map. 2077-2080
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DufourP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DufourP03
Sophie Dufour, Ronald Peereman:
Inhibitory priming effect in auditory word recognition: the role of the phonological mismatch length between primes and targets. 2081-2084
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ScharenborgBB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ScharenborgBB03
Odette Scharenborg, Louis ten Bosch, Lou Boves:
Recognising 'real-life' speech with spem: a speech-based computational model of human speech recognition. 2085-2088
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RosenhouseK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RosenhouseK03
Judith Rosenhouse, Liat Kishon-Rabin:
The effect of speech rate and noise on bilinguals' speech perception: the case of native speakers of arabic in israel. 2089-2092
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TurkA03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TurkA03a
Oytun Türk, Levent M. Arslan:
Subjective evaluations for perception of speaker identity through acoustic feature transplantations. 2093-2096
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ScharenborgMBN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ScharenborgMBN03
Odette Scharenborg, James M. McQueen, Louis ten Bosch, Dennis Norris:
Modelling human speech recognition using automatic speech recognition paradigms in speM. 2097-2100
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaitoSF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaitoSF03
Mutsumi Saito, Kimio Shiraishi, Kimitoshi Fukudome:
The effect of amplitude compression on wide band telephone speech for hearing-impaired elderly people. 2101-2104
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OtakeK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OtakeK03
Takashi Otake, Miki Komatsu:
Word activation model by Japanese school children without knowledge of roman alphabet. 2105-2108
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HardingM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HardingM03
Sue Harding, Georg F. Meyer:
Multi-resolution auditory scene analysis: robust speech recognition using pattern-matching from a noisy signal. 2109-2112
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MatsuiK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MatsuiK03
Hisami Matsui, Hideki Kawahara:
Investigation of emotionally morphed speech perception and its structure using a high quality speech manipulation system. 2113-2116
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PaliwalA03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PaliwalA03a
Kuldip K. Paliwal, Leigh D. Alsteris:
Usefulness of phase spectrum in human speech perception. 2117-2120
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Tokuma03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Tokuma03
Shinichi Tokuma:
Perception of English lexical stress by English and Japanese speakers: effect of duration and "realistic" intensity change. 2121-2124
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Welby03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Welby03
Pauline Welby:
French intonational rises and their role in speech seg mentation [sic]. 2125-2128
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Tokuma03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Tokuma03a
Won Tokuma:
Physical and perceptual configurations of Japanese fricatives from multidimensional scaling analyses. 2129-2132
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Au03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Au03
Ching-Pong Au:
An acquisition model of speech perception with considerations of temporal information. 2133-2136

Multi-Modal Processing and Speech Interface Design

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PotamitisGFK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PotamitisGFK03
Ilyas Potamitis, Kallirroi Georgila, Nikos Fakotakis, George K. Kokkinakis:
An integrated system for smart-home control of appliances based on remote speech interaction. 2197-2200
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JinRCCLT03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JinRCCLT03
Jianhong Jin, Martin J. Russell, Michael J. Carey, James Chapman, Harvey Lloyd-Thomas, Graham Tattersall:
A spoken language interface to an electronic programme guide. 2201-2204
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LopesTRGTFSGS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LopesTRGTFSGS03
Luís Seabra Lopes, António J. S. Teixeira, Mário Rodrigues, Diogo Gomes, Cláudio Teixeira, Liliana da Silva Ferreira, Pedro Filipe Soares, João Girão, Nuno Sénica:
Towards a personal robot with language interface. 2205-2208
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WilliamsSPA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WilliamsSPA03
Jason D. Williams, Andrew T. Shaw, Lawrence Piano, Michael Abt:
Preference, perception, and task completion of open, menu-based, and directed prompts for call routing: a case study. 2209-2212
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HatzisGCCPPO03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HatzisGCCPPO03
Athanassios Hatzis, Phil D. Green, James Carmichael, Stuart P. Cunningham, Rebecca Palmer, Mark Parker, Peter O'Neill:
An integrated toolkit deploying speech technology for computer based speech training with application to dysarthric speakers. 2213-2216
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Suhm03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Suhm03
Bernhard Suhm:
Towards best practices for speech user interface design. 2217-2220
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StallardMCMNSZ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StallardMCMNSZ03
David Stallard, John Makhoul, Fred Choi, Ehry MacRostie, Premkumar Natarajan, Richard M. Schwartz, Bushra Zawaydeh:
Design and evaluation of a limited two-way speech translator. 2221-2224
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DusanGF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DusanGF03
Sorin Dusan, Gregory J. Gadbois, James L. Flanagan:
Multimodal interaction on PDA's integrating speech and pen inputs. 2225-2228
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GieselmannD03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GieselmannD03
Petra Gieselmann, Matthias Denecke:
Towards multimodal interaction with an intelligent room. 2229-2232
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PieracciniDBDPGP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PieracciniDBDPGP03
Roberto Pieraccini, Krishna Dayanidhi, Jonathan Bloom, Jean-Gui Dahan, Michael Phillips, Bryan R. Goodman, K. Venkatesh Prasad:
A multimodal conversational interface for a concept vehicle. 2233-2236
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaSM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaSM03
Ling Ma, Dan J. Smith, Ben P. Milner:
Context awareness using environmental noise classification. 2237-2240
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiraishiTKSS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiraishiTKSS03
Tatsuya Shiraishi, Tomoki Toda, Hiromichi Kawanami, Hiroshi Saruwatari, Kiyohiro Shikano:
Simple designing methods of corpus-based visual speech synthesis. 2241-2244
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SturmBCT03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SturmBCT03
Janienke Sturm, Ilse Bakx, Bert Cranen, Jacques M. B. Terken:
Comparing the usability of a user driven and a mixed initiative multimodal dialogue system for train timetable information. 2245-2248
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MassaroL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MassaroL03
Dominic W. Massaro, Joanna Light:
Read my tongue movements: bimodal learning to perceive and produce non-native speech /r/ and /l/. 2249-2252
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PerezLF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PerezLF03
Jesus F. Guitarte Perez, Klaus Lukas, Alejandro F. Frangi:
Low resource lip finding and tracking algorithm for embedded devices. 2253-2256
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AsanoMAYIYKN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AsanoMAYIYKN03
Futoshi Asano, Yoichi Motomura, Hideki Asoh, Takashi Yoshimura, Naoyuki Ichimura, Kiyoshi Yamamoto, Nobuhiko Kitawaki, Satoshi Nakamura:
Detection and separation of speech segment using audio and video information fusion. 2257-2260
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EngwallB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EngwallB03
Olov Engwall, Jonas Beskow:
Resynthesis of 3d tongue movements from facial data. 2261-2264
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TrippelSHG03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TrippelSHG03
Thorsten Trippel, Felix Sasaki, Benjamin Hell, Dafydd Gibbon:
Acquiring lexical information from multilevel temporal annotations. 2265-2268
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CosiFT03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CosiFT03
Piero Cosi, Andrea Fusaro, Graziano Tisato:
LUCIA a new italian talking-head based on a modified cohen-massaro's labial coarticulation model. 2269-2272
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MukherjeeR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MukherjeeR03
Niloy Mukherjee, Deb Roy:
A visual context-aware multimodal system for spoken language processing. 2273-2276

Speech Recognition - Language Modeling

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PiantanidaE03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PiantanidaE03
Juan P. Piantanida, Claudio Estienne:
Maximum entropy good-turing estimator for language modeling. 2277-2280
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiZ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiZ03
Xiaolong Li, Yunxin Zhao:
Exploiting order-preserving perfect hashing to speedup n-gram language model lookahead. 2281-2284
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OikonomidisD03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OikonomidisD03
Dimitris Oikonomidis, Vassilios Digalakis:
Stem-based maximum entropy language models for inflectional languages. 2285-2288
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KrbecPH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KrbecPH03
Pavel Krbec, Petr Podveský, Jan Hajic:
Combination of a hidden tag model and a traditional n-gram model: a case study in czech speech recognition. 2289-2292
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SiivolaHCK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SiivolaHCK03
Vesa Siivola, Teemu Hirsimäki, Mathias Creutz, Mikko Kurimo:
Unlimited vocabulary speech recognition based on morphs discovered in an unsupervised manner. 2293-2296
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SzarvasF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SzarvasF03
Máté Szarvas, Sadaoki Furui:
Evaluation of the stochastic morphosyntactic language model on a one million word hungarian dictation task. 2297-2300

Feature Analysis and Cross-Language Processing of Chinese Spoken Language

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeC03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeC03a
Lin-Shan Lee, Shun-Chuan Chen:
Automatic title generation for Chinese spoken documents considering the special structure of the language. 2325-2328
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuZZ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuZZ03
Bo Xu, Shuwu Zhang, Chengqing Zong:
Statistical speech-to-speech translation with multilingual speech recognition and bilingual-chunk parsing. 2329-2332
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DuC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DuC03
Limin Du, Boxing Chen:
Automatic extraction of bilingual chunk lexicon for spoken language translation. 2333-2336
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LoLLWM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LoLLWM03
Wai Kit Lo, Yuk-Chi Li, Gina-Anne Levow, Hsin-Min Wang, Helen M. Meng:
Multi-scale document expansion in English-Mandarin cross-language spoken document retrieval. 2337-2340
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Tseng03b
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Tseng03b
Chiu-yu Tseng:
Mandarin speech prosody: issues, pitfalls and directions. 2341-2344
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiW03
Aijun Li, Xia Wang:
A contrastive investigation of standard Mandarin and accented Mandarin. 2345-2348
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Tao03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Tao03
Jianhua Tao:
Emotion control of Chinese speech synthesis in natural environment. 2349-2352

Speech Production and Physiology

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeonovS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeonovS03
Alexander S. Leonov, Victor N. Sorokin:
Optimality criteria in inverse problems for tongue-jaw interaction. 2353-2356
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SasakiMM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SasakiMM03
Koji Sasaki, Nobuhiro Miki, Yoshikazu Miyanaga:
FEM analysis based on 3-d time-varying vocal tract shape. 2357-2360
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DangH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DangH03
Jianwu Dang, Kiyoshi Honda:
Consideration of muscle co-contraction in a physiological articulatory model. 2361-2364
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ManfrediP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ManfrediP03
Claudia Manfredi, Giorgio Peretti:
Robust techniques for pre- and post-surgical voice analysis. 2365-2368
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchnellL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchnellL03
Karl Schnell, Arild Lacroix:
Analysis of lossy vocal tract models for speech production. 2369-2372
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Khioe03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Khioe03
Beatrice Fung-Wah Khioe:
Temporal properties of the nasals and nasalization in Cantonese. 2373-2376
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BettensGS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BettensGS03
Frédéric Bettens, Francis Grenez, Jean Schoentgen:
Estimation of vocal noise in running speech by means of bi-directional double linear prediction. 2377-2380
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Mahdi03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Mahdi03
Abdulhussain E. Mahdi:
Visualisation of the vocal tract based on estimation of vocal area functions and formant frequencies. 2381-2384
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Sciamarellad03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Sciamarellad03
Denisse Sciamarella, Christophe d'Alessandro:
Reproducing laryngeal mechanisms with a two-mass model. 2385-2388
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BostikS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BostikS03
Milan Bostik, Milan Sigmund:
Methods for estimation of glottal pulses waveforms exciting voiced speech. 2389-2392
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangET03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangET03
Zhaoyan Zhang, Carol Y. Espy-Wilson, Mark Tiede:
Acoustic modeling of american English lateral approximants. 2393-2396
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TakanoHMSF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TakanoHMSF03
Sayoko Takano, Kiyoshi Honda, Shinobu Masaki, Yasuhiro Shimada, Ichiro Fujimoto:
Translation and rotation of the cricothyroid joint revealed by phonation-synchronized high-resolution MRI. 2397-2400

Speech Synthesis: Voice Conversion and Miscellaneous Topics

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KawanamiITSS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KawanamiITSS03
Hiromichi Kawanami, Yohei Iwami, Tomoki Toda, Hiroshi Saruwatari, Kiyohiro Shikano:
GMM-based voice conversion applied to emotional speech synthesis. 2401-2404
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RentzosVYHT03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RentzosVYHT03
Dimitrios Rentzos, Saeed Vaseghi, Qin Yan, Ching-Hsiang Ho, Emir Turajlic:
Probability models of formant parameters for voice conversion. 2405-2408
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YeY03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YeY03
Hui Ye, Steve J. Young:
Perceptually weighted linear transformations for voice conversion. 2409-2412
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenCCLL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenCCLL03
Yining Chen, Min Chu, Eric Chang, Jia Liu, Runsheng Liu:
Voice conversion with smoothed GMM and MAP adaptation. 2413-2416
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SalorDP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SalorDP03
Özgül Salor, Mübeccel Demirekler, Bryan L. Pellom:
A system for voice conversion based on adaptive filtering and line spectral frequency distance optimization for text-to-speech synthesis. 2417-2420
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoriK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoriK03
Hiroki Mori, Hideki Kasuya:
Speaker conversion in ARX-based source-formant type speech synthesis. 2421-2424
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BreenME03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BreenME03
Andrew P. Breen, Steve Minnis, Barry Eggleton:
Implementing an SSML compliant concatenative TTS system. 2425-2428
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuMK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuMK03
Zhenglai Gu, Hiroki Mori, Hideki Kasuya:
Acoustic variations of focused disyllabic words in Mandarin Chinese: analysis, synthesis and perception. 2429-2432
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Quintana-MoralesN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Quintana-MoralesN03
Pedro J. Quintana-Morales, Juan L. Navarro-Mesa:
An approach to common acoustical pole and zero modeling of consecutive periods of voiced speech. 2433-2436
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DengBWH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DengBWH03
Huiqun Deng, Michael P. Beddoes, Rabab Kreidieh Ward, Murray Hodgson:
Estimating the vocal-tract area function and the derivative of the glottal wave from a speech signal. 2437-2440
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZolfaghariNIKI03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZolfaghariNIKI03
Parham Zolfaghari, Tomohiro Nakatani, Toshio Irino, Hideki Kawahara, Fumitada Itakura:
Glottal closure instant synchronous sinusoidal model for high quality speech analysis/synthesis. 2441-2444
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Karjalainen03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Karjalainen03
Matti Karjalainen:
Mixed physical modeling techniques applied to speech production. 2445-2448
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FagelS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FagelS03
Sascha Fagel, Walter F. Sendlmeier:
An expandable web-based audiovisual text-to-speech synthesis system. 2449-2452
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NikleczyO03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NikleczyO03
P. Nikleczy, Gábor Olaszy:
A reconstruction of farkas kempelen's speaking machine. 2453-2456
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuH03
Wentao Gu, Keikichi Hirose:
Acoustic model selection and voice quality assessment for HMM-based Mandarin speech synthesis. 2457-2460
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamagishiOMK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamagishiOMK03
Junichi Yamagishi, Koji Onishi, Takashi Masuko, Takao Kobayashi:
Modeling of various speaking styles and emotions for HMM-based speech synthesis. 2461-2464
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaiaZTKR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaiaZTKR03
Ranniery Maia, Heiga Zen, Keiichi Tokuda, Tadashi Kitamura, Fernando Gil Vianna Resende Jr.:
Towards the development of a brazilian portuguese text-to-speech system based on HMM. 2465-2468
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VozilaALT03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VozilaALT03
Paul Vozila, Jeff Adams, Yuliya Lobacheva, Ryan Thomas:
Grapheme to phoneme conversion and dictionary verification using graphonemes. 2469-2472
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FackrellSH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FackrellSH03
Justin Fackrell, Wojciech Skut, Kathrine Hammervold:
Improving the accuracy of pronunciation prediction for unit selection TTS. 2473-2476
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MishraKS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MishraKS03
Taniya Mishra, Esther Klabbers, Jan P. H. van Santen:
Detection of list-type sentences. 2477-2480

Acoustic Modelling 1, 2

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PrietoJC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PrietoJC03
Ramon Prieto, Jing Jiang, Chi-Ho Choi:
A new pitch synchronous time domain phoneme recognizer using component analysis and pitch clustering. 2481-2484
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KojimaT03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KojimaT03
Hiroaki Kojima, Kazuyo Tanaka:
Mixed-lingual spoken word recognition by using VQ codebook sequences of variable length segments. 2485-2488
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LahtiVV03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LahtiVV03
Tommi Lahti, Olli Viikki, Marcel Vasilache:
Low memory acoustic models for HMM based speech recognition. 2489-2492
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Fonollosa03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Fonollosa03
José A. R. Fonollosa:
Nearest-neighbor search algorithms based on subcodebook selection and its application to speech recognition. 2493-2496
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OmarH03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OmarH03a
Mohamed Kamal Omar, Mark Hasegawa-Johnson:
Non-linear maximum likelihood feature transformation for speech recognition. 2497-2500
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SukJC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SukJC03
Soo-Young Suk, Ho-Youl Jung, Hyun-Yeol Chung:
Automatic generation of context-independent variable parameter models using successive state and mixture splitting. 2501-2504
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZgankKH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZgankKH03
Andrej Zgank, Zdravko Kacic, Bogomir Horvat:
Data driven generation of broad classes for decision tree construction in acoustic modeling. 2505-2508
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OlsenD03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OlsenD03
Peder A. Olsen, Satya Dharanipragada:
An efficient integrated gender detection scheme and time mediated averaging of gender dependent acoustic models. 2509-2512
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OgataA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OgataA03
Jun Ogata, Yasuo Ariki:
Syllable-based acoustic modeling for Japanese spontaneous speech recognition. 2513-2516
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CetinO03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CetinO03
Özgür Çetin, Mari Ostendorf:
Cross-stream observation dependencies for multi-stream speech recognition. 2517-2520
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MakC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MakC03
Brian Kan-Wing Mak, Kin-Wah Chan:
Pruning transitions in a hidden Markov model with optimal brain surgeon. 2521-2524
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Magimai-DossSB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Magimai-DossSB03
Mathew Magimai-Doss, Todd A. Stephenson, Hervé Bourlard:
Using pitch frequency information in speech recognition. 2525-2528
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LivescuGB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LivescuGB03
Karen Livescu, James R. Glass, Jeff A. Bilmes:
Hidden feature models for speech recognition using dynamic Bayesian networks. 2529-2532
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuZDH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuZDH03
Wei Hu, Yimin Zhang, Qian Diao, Shan Huang:
An efficient viterbi algorithm on DBNs. 2533-2536
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangE03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangE03
Li Zhang, William H. Edmondson:
Speech recognition based on syllable recovery. 2537-2540
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Abu-AmerC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Abu-AmerC03
Tarek Abu-Amer, Julie Carson-Berndsen:
HARTFEX: a multi-dimensional system of HMM based recognisers for articulatory features extraction. 2541-2544
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Maison03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Maison03
Benoît Maison:
Automatic baseform generation from acoustic data. 2545-2548
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SpiessWFK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SpiessWFK03
Thurid Spiess, Britta Wrede, Gernot A. Fink, Franz Kummert:
Data-driven pronunciation modeling for ASR using acoustic subword units. 2549-2552
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VanhouckeS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VanhouckeS03
Vincent Vanhoucke, Ananth Sankar:
Variable length mixtures of inverse covariances. 2605-2608
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Neukirchen03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Neukirchen03
Christoph Neukirchen:
Semi-tied full deviation matrices for laplacian density models. 2609-2612
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VisweswariahAG03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VisweswariahAG03
Karthik Visweswariah, Scott Axelrod, Ramesh A. Gopinath:
Acoustic modeling with mixtures of subspace constrained exponential models. 2613-2616
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GoelAGOV03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GoelAGOV03
Vaibhava Goel, Scott Axelrod, Ramesh A. Gopinath, Peder A. Olsen, Karthik Visweswariah:
Discriminative estimation of subspace precision and mean (SPAM) models. 2617-2620
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YoshizawaS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YoshizawaS03
Shinichi Yoshizawa, Kiyohiro Shikano:
Model-integration rapid training based on maximum likelihood for speech recognition. 2621-2624
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LimaZNMTK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LimaZNMTK03
Amaro A. de Lima, Heiga Zen, Yoshihiko Nankaku, Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura:
On the use of kernel PCA for feature extraction in speech recognition. 2625-2628

Time is of the Essence - Dynamic Approaches to Spoken Language

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Greenberg03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Greenberg03a
Steven Greenberg:
Time is of the essence - dynamic approaches to spoken language. 2553-2556
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GrantG03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GrantG03
Ken W. Grant, Steven Greenberg:
Spectro-temporal interactions in auditory and auditory-visual speech processing. 2557-2560
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Poeppel03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Poeppel03
David Poeppel:
Brain imaging correlates of temporal quantization in spoken language. 2561-2564
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Saltzman03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Saltzman03
Elliot Saltzman:
Temporal aspects of articulatory control. 2565-2568
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Keller03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Keller03
Brigitte Zellner Keller:
The temporal organisation of speech as gauged by speech synthesis. 2569-2572
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kleinschmidt03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kleinschmidt03
Michael Kleinschmidt:
Localized spectro-temporal features for automatic speech recognition. 2573-2576
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Atlas03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Atlas03
Les E. Atlas:
Modulation spectral filtering of speech. 2577-2580

Topics in Speech Recognition

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Moore03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Moore03
Roger K. Moore:
A comparison of the data requirements of automatic speech recognition systems and human listeners. 2581-2584
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TangSZ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TangSZ03
Min Tang, Stephanie Seneff, Victor W. Zue:
Modeling linguistic features in speech recognition. 2585-2588
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RamabhadranHCIN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RamabhadranHCIN03
Bhuvana Ramabhadran, Jing Huang, Upendra V. Chaudhari, Giridharan Iyengar, Harriet J. Nock:
Impact of audio segmentation and segment clustering on automated transcription accuracy of large spoken archives. 2589-2592
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BeaufaysSWW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BeaufaysSWW03
Françoise Beaufays, Ananth Sankar, Shaun Williams, Mitch Weintraub:
Learning linguistically valid pronunciations from acoustic data. 2593-2596
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MinematsuOH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MinematsuOH03
Nobuaki Minematsu, Koichi Osaki, Keikichi Hirose:
Improvement of non-native speech recognition by effectively modeling frequently observed pronunciation habits. 2597-2600
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NakajimaKSC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NakajimaKSC03
Yoshitaka Nakajima, Hideki Kashioka, Kiyohiro Shikano, Nick Campbell:
Non-audible murmur recognition. 2601-2604

Speaker and Language Recognition

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MamiC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MamiC03
Yassine Mami, Delphine Charlet:
Speaker modeling from selected neighbors applied to speaker recognition. 2629-2632
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZetterholmSGEDC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZetterholmSGEDC03
Elisabeth Zetterholm, Kirk P. H. Sullivan, James Green, Erik J. Eriksson, Jan van Doorn, Peter E. Czigler:
Who knows carl bildt? - and what if you don't? 2633-2636
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Vivaracho-PascualOAM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Vivaracho-PascualOAM03
Carlos Vivaracho-Pascual, Javier Ortega-Garcia, Luis Alonso Romero, Q. Isaac Moro-Sancho:
Improving the competitiveness of discriminant neural networks in speaker verification. 2637-2640
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KinnunenHF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KinnunenHF03
Tomi Kinnunen, Ville Hautamäki, Pasi Fränti:
On the fusion of dissimilarity-based classifiers for speaker identification. 2641-2644
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MingSHCSV03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MingSHCSV03
Ji Ming, Darryl Stewart, Philip Hanna, Pat Corr, Francis Jack Smith, Saeed Vaseghi:
Robust speaker identification using posterior union models. 2645-2648
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZilcaNR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZilcaNR03
Ran D. Zilca, Jirí Navrátil, Ganesh N. Ramaswamy:
"syncpitch": a pseudo pitch synchronous algorithm for speaker recognition. 2649-2652
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KwonN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KwonN03
Soonil Kwon, Shrikanth S. Narayanan:
A method for on-line speaker indexing using generic reference models. 2653-2656
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MihoubiBD03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MihoubiBD03
Mohamed Mihoubi, Gilles Boulianne, Pierre Dumouchel:
Discriminative training and maximum likelihood detector for speaker identification. 2657-2660
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KajarekarAH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KajarekarAH03
Sachin S. Kajarekar, André Gustavo Adami, Hynek Hermansky:
Novel approaches for one- and two-speaker detection. 2661-2664
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CampbellRD03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CampbellRD03
Joseph P. Campbell, Douglas A. Reynolds, Robert B. Dunn:
Fusing high- and low-level features for speaker recognition. 2665-2668
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SivakumaranFA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SivakumaranFA03
P. Sivakumaran, J. Fortuna, Aladdin M. Ariyaeeinia:
Score normalisation applied to open-set, text-independent speaker identification. 2669-2672
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ArcienegaD03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ArcienegaD03
Mijail Arcienega, Andrzej Drygajlo:
On the number of Gaussian components in a mixture: an application to speaker verification tasks. 2673-2676
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Salvi03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Salvi03
Giampiero Salvi:
Using accent information in ASR models for Swedish. 2677-2680
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NakajimaNAA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NakajimaNAA03
Hideharu Nakajima, Masaaki Nagata, Hisako Asano, Masanobu Abe:
Estimating Japanese word accent from syllable sequence using support vector machine. 2681-2684
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CordobaPGMFP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CordobaPGMFP03
Ricardo de Córdoba, G. Prime, Javier Macías Guarasa, Juan Manuel Montero, Javier Ferreiros, José Manuel Pardo:
PPRLM optimization for language identification in air traffic control tasks. 2685-2688

Spoken Language Understanding and Translation

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Chen03d
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Chen03d
Hsin-Hsi Chen:
Spoken cross-language access to image collection via captions. 2749-2752
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JamoussiSH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JamoussiSH03
Salma Jamoussi, Kamel Smaïli, Jean Paul Haton:
Understanding process for speech recognition. 2753-2756
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TakezawaK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TakezawaK03
Toshiyuki Takezawa, Gen-ichiro Kikui:
Collecting machine-translation-aided bilingual dialogues for corpus-based speech translation. 2757-2760
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WutiwiwatchaiF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WutiwiwatchaiF03
Chai Wutiwiwatchai, Sadaoki Furui:
Combination of finite state automata and neural network for spoken language understanding. 2761-2764
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HorlockK03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HorlockK03a
James Horlock, Simon King:
Discriminative methods for improving named entity extraction on speech data. 2765-2768
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuGP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuGP03
Liang Gu, Yuqing Gao, Michael Picheny:
Improving statistical natural concept generation in interlingua-based speech-to-speech translation. 2769-2772
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GoulianAP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GoulianAP03
Jérôme Goulian, Jean-Yves Antoine, Franck Poirier:
How NLP techniques can improve speech understanding: ROMUS - a robust chunk based message understanding system using link grammars. 2773-2776
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChelbaA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChelbaA03
Ciprian Chelba, Alex Acero:
Discriminative training of n-gram classifiers for speech and text routing. 2777-2780
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HonalS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HonalS03
Matthias Honal, Tanja Schultz:
Correction of disfluencies in spontaneous speech using a noisy-channel approach. 2781-2784
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KoumpisR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KoumpisR03
Konstantinos Koumpis, Steve Renals:
Multi-class extractive voicemail summarization. 2785-2788
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TurRH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TurRH03
Gökhan Tür, Mazin G. Rahim, Dilek Hakkani-Tür:
Active labeling for spoken language understanding. 2789-2792
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TurH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TurH03
Gökhan Tür, Dilek Hakkani-Tür:
Exploiting unlabeled utterances for spoken language understanding. 2793-2796
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuGGP03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuGGP03
Fu-Hua Liu, Yuqing Gao, Liang Gu, Michael Picheny:
Noise robustness in speech to speech translation. 2797-2800
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SiuMW03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SiuMW03
Kai-Chung Siu, Helen M. Meng, Chin-Chung Wong:
Example-based bi-directional Chinese-English machine translation with semi-automatically induced grammars. 2801-2804
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WredeS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WredeS03
Britta Wrede, Elizabeth Shriberg:
Spotting "hot spots" in meetings: human judgments and prosodic cues. 2805-2808
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangA03
Ye-Yi Wang, Alex Acero:
Combination of CFG and n-gram modeling in semantic grammar learning. 2809-2812
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenL03
Shun-Chuan Chen, Lin-Shan Lee:
Automatic title generation for Chinese spoken documents using an adaptive k nearest-neighbor approach. 2813-2816
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoriHM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoriHM03
Takaaki Hori, Chiori Hori, Yasuhiro Minami:
Speech summarization using weighted finite-state transducers. 2817-2820
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeCL03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeCL03
Yun-Tien Lee, Shun-Chuan Chen, Lin-Shan Lee:
Cross domain Chinese speech understanding and answering based on named-entity extraction. 2821-2824
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoriHF03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoriHF03
Chiori Hori, Takaaki Hori, Sadaoki Furui:
Evaluation method for automatic speech summarization. 2825-2828
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiLC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiLC03
Li Li, Feng Liu, Wu Chou:
An information theoretic approach for using word cluster information in natural language call routing. 2829-2832
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SistaSKS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SistaSKS03
Sreenivasa Sista, Amit Srivastava, Francis Kubala, Richard M. Schwartz:
Unsupervised topic discovery applied to segmentation of news transcriptions. 2833-2836

Towards a Roadmap for Speech Technology

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Heisterkamp03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Heisterkamp03
Paul Heisterkamp:
"do not attempt to light with match!": some thoughts on progress and research goals in spoken dialog systems. 2897-2900
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GranstromH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GranstromH03
Björn Granström, David House:
Multimodality and speech technology: verbal and non-verbal communication in talking agents. 2901-2904
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Cole03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Cole03
Ronald A. Cole:
Roadmaps, journeys and destinations speculations on the future of speech technology research. 2905-2908
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Moore03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Moore03a
Roger K. Moore:
Spoken language output: realising the vision. 2909-2912

Speaker Recognition and Verification

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KennyMD03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KennyMD03
Patrick Kenny, Mohamed Mihoubi, Pierre Dumouchel:
New MAP estimators for speaker recognition. 2961-2964
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MorenoH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MorenoH03
Pedro J. Moreno, Purdy Ho:
A new SVM approach to speaker identification and verification using probabilistic distance kernels. 2965-2968
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CheungMK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CheungMK03
Ming-Cheung Cheung, Man-Wai Mak, Sun-Yuan Kung:
Adaptive decision fusion for multi-sample speaker verification over GSM networks. 2969-2972
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YiuMK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YiuMK03
Kwok-Kwong Yiu, Man-Wai Mak, Sun-Yuan Kung:
Environment adaptation for robust speaker verification. 2973-2976
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZigelC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZigelC03
Yaniv Zigel, Arnon Cohen:
On cohort selection for speaker verification. 2977-2980
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TadjB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TadjB03
Chakib Tadj, A. Benlahouar:
Speaker characterization using principal component analysis and wavelet transform for speaker verification. 2981-2984
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AkitaK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AkitaK03
Yuya Akita, Tatsuya Kawahara:
Unsupervised speaker indexing using anchor models and automatic transcription of discussions. 2985-2988
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchererGJKB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchererGJKB03
Klaus R. Scherer, Didier Grandjean, Tom Johnstone, Gudrun Klasmeyer, Tanja Bänziger:
A statistical approach to assessing speech and voice variability in speaker verification. 2989-2992
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsaiWR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsaiWR03
Wei-Ho Tsai, Hsin-Min Wang, Dwight Rodgers:
Automatic singer identification of popular music recordings via estimation and modeling of solo vocal signal. 2993-2996
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VescoviCR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VescoviCR03
Michele Vescovi, Mauro Cettolo, Romeo Rizzi:
A DP algorithm for speaker change detection. 2997-3000
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Lapidot03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Lapidot03
Itshak Lapidot:
SOM as likelihood estimator for speaker clustering. 3001-3004
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MinematsuYH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MinematsuYH03
Nobuaki Minematsu, Keita Yamauchi, Keikichi Hirose:
Automatic estimation of perceptual age using speaker modeling techniques. 3005-3008
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Rifkin03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Rifkin03
Ryan Rifkin:
Speaker recognition using local models. 3009-3012
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VogtPS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VogtPS03
Robbie Vogt, Jason W. Pelecanos, Sridha Sridharan:
Dependence of GMM adaptation on feature post-processing for speaker recognition. 3013-3016
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NakagawaZ03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NakagawaZ03
Seiichi Nakagawa, Wei Zhang:
Text-independent speaker recognition by speaker-specific GMM and speaker adapted syllable-based HMM. 3017-3020
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PadrtaR03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PadrtaR03
Ales Padrta, Vlasta Radová:
On the amount of speech data necessary for successful speaker identification. 3021-3024
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TurkS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TurkS03
Ulrich Türk, Florian Schiel:
Speaker verification based on the German veridat database. 3025-3028

Multi-Lingual Spoken Language Processing

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FischerJK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FischerJK03
Volker Fischer, Eric Janke, Siegfried Kunzmann:
Recent progress in the decoding of non-native speech with multilingual acoustic models. 3105-3108
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KuoLWC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KuoLWC03
Wei-Chih Kuo, Li-Feng Lin, Yih-Ru Wang, Sin-Horng Chen:
An NN-based approach to prosodic information generation for synthesizing English words embedded in Chinese text. 3109-3112
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MatsunagaOYI03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MatsunagaOYI03
Shoichi Matsunaga, Atsunori Ogawa, Yoshikazu Yamaguchi, Akihiro Imamura:
Speaker adaptation for non-native speakers using bilingual English lexicon and acoustic models. 3113-3116
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeBBC03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeBBC03
Viet Bac Le, Brigitte Bigi, Laurent Besacier, Eric Castelli:
Using the web for fast language model construction in minority languages. 3117-3120
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChengLWMM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChengLWMM03
Yan Ming Cheng, Chen Liu, Yuanjun Wei, Lynette Melnar, Changxue Ma:
An approach to multilingual acoustic modeling for portable devices. 3121-3124
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MartinSS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MartinSS03
Terrence Martin, Torbjørn Svendsen, Sridha Sridharan:
Cross-lingual pronunciation modelling for indonesian speech recognition. 3125-3128
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimK03a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimK03a
Woosung Kim, Sanjeev Khudanpur:
Language model adaptation using cross-lingual information. 3129-3132
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WongMSS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WongMSS03
Eddie Wong, Terrence Martin, Torbjørn Svendsen, Sridha Sridharan:
Multilingual phone clustering for recognition of spontaneous indonesian speech utilising pronunciation modelling techniques. 3133-3136
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SrinivasamurthyN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SrinivasamurthyN03
Naveen Srinivasamurthy, Shrikanth S. Narayanan:
Language-adaptive persian speech recognition. 3137-3140
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KillerSS03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KillerSS03
Mirjam Killer, Sebastian Stüker, Tanja Schultz:
Grapheme based speech recognition. 3141-3144

Interdisciplinary

- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Petrushin03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Petrushin03
Valery A. Petrushin:
Learning Chinese tones. 3145-3148
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HiroseGM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HiroseGM03
Keikichi Hirose, Frédéric Gendrin, Nobuaki Minematsu:
A pronunciation training system for Japanese lexical accents with corrective feedback in learner's voice. 3149-3152
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MouriHM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MouriHM03
Taro Mouri, Keikichi Hirose, Nobuaki Minematsu:
Considerations on vowel durations for Japanese CALL system. 3153-3156
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KatoNKA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KatoNKA03
Hiroaki Kato, Masumi Nukinay, Hideki Kawahara, Reiko Akahane-Yamada:
Influence of recording equipment on the identification of second language phoneme contrasts. 3157-3160
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TamMBB03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TamMBB03
Yik-Cheung Tam, Jack Mostow, Joseph E. Beck, Satanjeev Banerjee:
Training a confidence measure for a reading tutor that listens. 3161-3164
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BanerjeeBM03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BanerjeeBM03
Satanjeev Banerjee, Joseph E. Beck, Jack Mostow:
Evaluating the effect of predicting oral reading miscues. 3165-3168
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoladaN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoladaN03
Miroslav Holada, Jan Nouza:
VISPER II - enhanced version of the educational software for speech processing courses. 3169-3172
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuTO03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuTO03
Meirong Lu, Kazuyuki Takagi, Kazuhiko Ozeki:
The use of multiple pause information in dependency structure analysis of spoken Japanese sentences. 3173-3176
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TakagiOOO03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TakagiOOO03
Kazuyuki Takagi, Mamiko Okimoto, Yoshio Ogawa, Kazuhiko Ozeki:
A neural network approach to dependency analysis of Japanese sentences using prosodic information. 3177-3180
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AsanoNA03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AsanoNA03
Hisako Asano, Masaaki Nagata, Masanobu Abe:
Say-as classification for alphabetic words in Japanese texts. 3181-3184
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IshiharaTO03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IshiharaTO03
Kazushi Ishihara, Yasushi Tsubota, Hiroshi G. Okuno:
Automatic transformation of environmental sounds into sound-imitation words based on Japanese syllable structure. 3185-3188
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZenTK03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZenTK03
Heiga Zen, Keiichi Tokuda, Tadashi Kitamura:
Decision tree-based simultaneous clustering of phonetic contexts, dimensions, and state positions for acoustic modeling. 3189-3192
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NakagawaMN03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NakagawaMN03
Seiichi Nakagawa, Kazumasa Mori, Naoki Nakamura:
A statistical method of evaluating pronunciation proficiency for English words spoken by Japanese. 3193-3196

a service of

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.