


Остановите войну!
for scientists:


default search action
Ian R. Lane
Ian Richard Lane
Person information

- affiliation: Carnegie Mellon University, Department of Electrical and Computer Engineering, Pittsburgh, PA, USA
- affiliation: Capio Inc., Belmont, CA, USA
- affiliation (PhD 2006): Kyoto University, Japan
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2022
- [c84]Yifan Peng
, Siddharth Dalmia, Ian R. Lane, Shinji Watanabe:
Branchformer: Parallel MLP-Attention Architectures to Capture Local and Global Context for Speech Recognition and Understanding. ICML 2022: 17627-17643 - [c83]Muqiao Yang, Ian R. Lane, Shinji Watanabe
:
Online Continual Learning of End-to-End Speech Recognition Models. INTERSPEECH 2022: 2668-2672 - [i25]Yifan Peng, Siddharth Dalmia, Ian R. Lane, Shinji Watanabe
:
Branchformer: Parallel MLP-Attention Architectures to Capture Local and Global Context for Speech Recognition and Understanding. CoRR abs/2207.02971 (2022) - [i24]Muqiao Yang, Ian R. Lane, Shinji Watanabe
:
Online Continual Learning of End-to-End Speech Recognition Models. CoRR abs/2207.05071 (2022) - 2021
- [c82]Guan-Lin Chao, Ian R. Lane:
Human-Agent Collaboration Strategies for Vision-Grounded Instruction Following. ASRU 2021: 877-884 - [c81]Benjamin Elizalde, Radu Revutchi, Samarjit Das, Bhiksha Raj, Ian R. Lane, Laurie M. Heller:
Identifying Actions for Sound Event Classification. WASPAA 2021: 26-30 - [i23]Benjamin Elizalde, Radu Revutchi, Samarjit Das, Bhiksha Raj, Ian R. Lane, Laurie M. Heller:
Identifying Actions for Sound Event Classification. CoRR abs/2104.12693 (2021) - 2020
- [c80]Avneesh Saluja, Ian R. Lane, Ying Zhang:
Machine Translation with Binary Feedback: a Large-Margin Approach. AMTA 2020
2010 – 2019
- 2019
- [c79]Guan-Lin Chao, Chih Chi Hu, Bing Liu, John Paul Shen, Ian R. Lane:
Audio-visual TED corpus: enhancing the TED-LIUM corpus with facial information, contextual text and object recognition. UbiComp/ISWC Adjunct 2019: 468-473 - [c78]Guan-Lin Chao, Ian R. Lane:
BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from Transformer. INTERSPEECH 2019: 1468-1472 - [c77]Guan-Lin Chao, John Paul Shen, Ian R. Lane:
Deep Speaker Embedding for Speaker-Targeted Automatic Speech Recognition. NLPIR 2019: 39-43 - [c76]Guan-Lin Chao, Abhinav Rastogi, Semih Yavuz, Dilek Hakkani-Tür, Jindong Chen, Ian R. Lane:
Learning Question-Guided Video Representation for Multi-Turn Video Question Answering. SIGdial 2019: 215-225 - [i22]Guan-Lin Chao, Abhinav Rastogi, Semih Yavuz, Dilek Hakkani-Tür, Jindong Chen, Ian R. Lane:
Learning Question-Guided Video Representation for Multi-Turn Video Question Answering. ViGIL@NeurIPS 2019 - [i21]Guan-Lin Chao, William Chan, Ian R. Lane:
Speaker-Targeted Audio-Visual Models for Speech Recognition in Cocktail-Party Environments. CoRR abs/1906.05962 (2019) - [i20]Guan-Lin Chao, Ian R. Lane:
BERT-DST: Scalable End-to-End Dialogue State Tracking with Bidirectional Encoder Representations from Transformer. CoRR abs/1907.03040 (2019) - [i19]Guan-Lin Chao, Abhinav Rastogi, Semih Yavuz, Dilek Hakkani-Tür, Jindong Chen, Ian R. Lane:
Learning Question-Guided Video Representation for Multi-Turn Video Question Answering. CoRR abs/1907.13280 (2019) - 2018
- [j7]Sebastian Säger, Benjamin Elizalde, Damian Borth, Christian Schulze, Bhiksha Raj, Ian R. Lane:
AudioPairBank: towards a large-scale tag-pair-based audio content analysis. EURASIP J. Audio Speech Music. Process. 2018: 12 (2018) - [c75]Bing Liu, Tong Yu, Ian R. Lane, Ole J. Mengshoel:
Customized Nonlinear Bandits for Online Response Selection in Neural Conversation Models. AAAI 2018: 5245-5252 - [c74]Ming Zeng, Haoxiang Gao, Tong Yu, Ole J. Mengshoel, Helge Langseth, Ian R. Lane, Xiaobing Liu:
Understanding and improving recurrent networks for human activity recognition by continuous attention. UbiComp 2018: 56-63 - [c73]Kyu J. Han, Akshay Chandrashekaran, Jungsuk Kim, Ian R. Lane:
Densely Connected Networks for Conversational Speech Recognition. INTERSPEECH 2018: 796-800 - [c72]Chih Chi Hu, Bing Liu, John Shen, Ian R. Lane:
Online Incremental Learning for Speaker-Adaptive Language Models. INTERSPEECH 2018: 3363-3367 - [c71]Bing Liu, Ian R. Lane:
End-to-End Learning of Task-Oriented Dialogs. NAACL-HLT (Student Research Workshop) 2018: 67-73 - [c70]Bing Liu, Ian R. Lane:
Adversarial Learning of Task-Oriented Neural Dialog Models. SIGDIAL Conference 2018: 350-359 - [i18]Kyu J. Han, Akshay Chandrashekaran, Jungsuk Kim, Ian R. Lane:
The CAPIO 2017 Conversational Speech Recognition System. CoRR abs/1801.00059 (2018) - [i17]Ming Zeng, Tong Yu, Xiao Wang, Le T. Nguyen, Ole J. Mengshoel, Ian R. Lane:
Semi-Supervised Convolutional Neural Networks for Human Activity Recognition. CoRR abs/1801.07827 (2018) - [i16]Bing Liu, Ian R. Lane:
Adversarial Learning of Task-Oriented Neural Dialog Models. CoRR abs/1805.11762 (2018) - [i15]Ming Zeng, Haoxiang Gao, Tong Yu, Ole J. Mengshoel, Helge Langseth, Ian R. Lane, Xiaobing Liu:
Understanding and Improving Recurrent Networks for Human Activity Recognition by Continuous Attention. CoRR abs/1810.04038 (2018) - 2017
- [c69]Bing Liu, Ian R. Lane:
Iterative policy learning in end-to-end trainable task-oriented neural dialog models. ASRU 2017: 482-489 - [c68]Ming Zeng, Tong Yu, Xiao Wang, Le T. Nguyen, Ole J. Mengshoel, Ian R. Lane:
Semi-supervised convolutional neural networks for human activity recognition. IEEE BigData 2017: 522-529 - [c67]Benjamin Elizalde, Ankit Shah, Siddharth Dalmia, Min Hun Lee, Rohan Badlani, Anurag Kumar, Bhiksha Raj, Ian R. Lane:
An approach for self-training audio event detectors using web data. EUSIPCO 2017: 1863-1867 - [c66]Bing Liu, Ian R. Lane:
Dialog context language modeling with recurrent neural networks. ICASSP 2017: 5715-5719 - [c65]Akshay Chandrashekaran, Ian R. Lane:
Hierarchical Constrained Bayesian Optimization for Feature, Acoustic Model and Decoder Parameter Optimization. INTERSPEECH 2017: 538-542 - [c64]Kyu J. Han, Seongjun Hahm, Byung-Hak Kim, Jungsuk Kim, Ian R. Lane:
Deep Learning-Based Telephony Speech Recognition in the Wild. INTERSPEECH 2017: 1323-1327 - [c63]Bing Liu, Ian R. Lane:
An End-to-End Trainable Neural Network Model with Belief Tracking for Task-Oriented Dialog. INTERSPEECH 2017: 2506-2510 - [c62]Suyoun Kim, Ian R. Lane:
End-to-End Speech Recognition with Auditory Attention for Multi-Microphone Distance Speech Recognition. INTERSPEECH 2017: 3867-3871 - [c61]Akshay Chandrashekaran, Ian R. Lane:
Speeding up Hyper-parameter Optimization by Extrapolation of Learning Curves Using Previous Builds. ECML/PKDD (1) 2017: 477-492 - [i14]Bing Liu, Ian R. Lane:
Dialog Context Language Modeling with Recurrent Neural Networks. CoRR abs/1701.04056 (2017) - [i13]Bing Liu, Ian R. Lane:
An End-to-End Trainable Neural Network Model with Belief Tracking for Task-Oriented Dialog. CoRR abs/1708.05956 (2017) - [i12]Bing Liu, Ian R. Lane:
Iterative Policy Learning in End-to-End Trainable Task-Oriented Neural Dialog Models. CoRR abs/1709.06136 (2017) - [i11]Bing Liu, Tong Yu, Ian R. Lane, Ole J. Mengshoel:
Customized Nonlinear Bandits for Online Response Selection in Neural Conversation Models. CoRR abs/1711.08493 (2017) - [i10]Bing Liu, Ian R. Lane:
Multi-Domain Adversarial Learning for Slot Filling in Spoken Language Understanding. CoRR abs/1711.11310 (2017) - 2016
- [j6]Bo Yu, Ian R. Lane, Fang Chen:
3D Face Detection via Reconstruction Over Hierarchical Features for Single Face Situations. Int. J. Pattern Recognit. Artif. Intell. 30(4): 1655013:1-1655013:11 (2016) - [c60]David Cohen, Ian R. Lane:
An Oral Exam for Measuring a Dialog System's Capabilities. AAAI 2016: 835-841 - [c59]Rahul Rajan, Ted Selker, Ian R. Lane:
Effects of Mediating Notifications Based on Task Load. AutomotiveUI 2016: 145-152 - [c58]Benjamin Elizalde, Guan-Lin Chao, Ming Zeng, Ian R. Lane:
City-Identification of Flickr Videos Using Semantic Acoustic Features. BigMM 2016: 303-306 - [c57]Benjamin Elizalde, Anurag Kumar, Ankit Shah, Rohan Badlani, Emmanuel Vincent, Bhiksha Raj, Ian R. Lane:
Experiments on the DCASE Challenge 2016: Acoustic Scene Classification and Sound Event Detection in Real Life Recording. DCASE 2016: 20-24 - [c56]Jungsuk Kim, Ian R. Lane:
Accelerating multi-user large vocabulary continuous speech recognition on heterogeneous CPU-GPU platforms. ICASSP 2016: 5330-5334 - [c55]Bing Liu, Ian R. Lane:
Attention-Based Recurrent Neural Network Models for Joint Intent Detection and Slot Filling. INTERSPEECH 2016: 685-689 - [c54]Guan-Lin Chao, William Chan, Ian R. Lane:
Speaker-Targeted Audio-Visual Models for Speech Recognition in Cocktail-Party Environments. INTERSPEECH 2016: 2120-2124 - [c53]William Chan, Ian R. Lane:
On Online Attention-Based Speech Recognition and Joint Mandarin Character-Pinyin Training. INTERSPEECH 2016: 3404-3408 - [c52]Suyoun Kim, Ian R. Lane:
Recurrent Models for Auditory Attention in Multi-Microphone Distant Speech Recognition. INTERSPEECH 2016: 3838-3842 - [c51]Wonkyum Lee, Kyu J. Han, Ian R. Lane:
Semi-Supervised Speaker Adaptation for In-Vehicle Speech Recognition with Deep Neural Networks. INTERSPEECH 2016: 3843-3847 - [c50]Rahul Rajan, Ted Selker, Ian R. Lane:
Task Load Estimation and Mediation Using Psycho-physiological Measures. IUI 2016: 48-59 - [c49]Bing Liu, Ian R. Lane:
Joint Online Spoken Language Understanding and Language Modeling With Recurrent Neural Networks. SIGDIAL Conference 2016: 22-30 - [c48]Akshay Chandrashekaran, Ian R. Lane:
Automated optimization of decoder hyper-parameters for online LVCSR. SLT 2016: 454-460 - [e2]Alexander I. Rudnicky, Antoine Raux, Ian R. Lane, Teruhisa Misu:
Situated Dialog in Speech-Based Human-Computer Interaction, 5th International Workshop on Spoken Dialogue Systems, IWSDS 2014, Napa, CA, USA, January 18-20, 2014. Signals and Communication Technology, Springer 2016, ISBN 978-3-319-21833-5 [contents] - [i9]Suyoun Kim, Bhiksha Raj, Ian R. Lane:
Environmental Noise Embeddings for Robust Speech Recognition. CoRR abs/1601.02553 (2016) - [i8]Benjamin Elizalde, Guan-Lin Chao, Ming Zeng, Ian R. Lane:
City-Identification of Flickr Videos Using Semantic Acoustic Features. CoRR abs/1607.03257 (2016) - [i7]Sebastian Säger, Damian Borth, Benjamin Elizalde, Christian Schulze, Bhiksha Raj, Ian R. Lane, Andreas Dengel:
AudioSentibank: Large-scale Semantic Ontology of Acoustic Concepts for Audio Content Analysis. CoRR abs/1607.03766 (2016) - [i6]Benjamin Elizalde, Anurag Kumar, Ankit Shah
, Rohan Badlani, Emmanuel Vincent, Bhiksha Raj, Ian R. Lane:
Experiments on the DCASE Challenge 2016: Acoustic Scene Classification and Sound Event Detection in Real Life Recording. CoRR abs/1607.06706 (2016) - [i5]Bing Liu, Ian R. Lane:
Attention-Based Recurrent Neural Network Models for Joint Intent Detection and Slot Filling. CoRR abs/1609.01454 (2016) - [i4]Bing Liu, Ian R. Lane:
Joint Online Spoken Language Understanding and Language Modeling with Recurrent Neural Networks. CoRR abs/1609.01462 (2016) - 2015
- [j5]Teruhisa Misu, Antoine Raux, Rakesh Gupta, Ian R. Lane:
Situated language understanding for a spoken dialog system within vehicles. Comput. Speech Lang. 34(1): 186-200 (2015) - [c47]William Chan, Ian R. Lane:
Deep convolutional neural networks for acoustic modeling in low resource languages. ICASSP 2015: 2056-2060 - [c46]Florian Metze, Ankur Gandhe, Yajie Miao, Zaid A. W. Sheikh, Yun Wang, Di Xu, Hao Zhang, Jungsuk Kim, Ian R. Lane, Wonkyum Lee, Sebastian Stüker, Markus Müller:
Semi-supervised training in low-resource ASR and KWS. ICASSP 2015: 4699-4703 - [c45]William Chan, Nan Rosemary Ke, Ian R. Lane:
Transferring knowledge from a RNN to a DNN. INTERSPEECH 2015: 3264-3268 - [i3]William Chan, Ian R. Lane:
Deep Recurrent Neural Networks for Acoustic Modelling. CoRR abs/1504.01482 (2015) - [i2]William Chan, Nan Rosemary Ke, Ian R. Lane:
Transferring Knowledge from a RNN to a DNN. CoRR abs/1504.01483 (2015) - [i1]Suyoun Kim, Ian R. Lane:
Recurrent Models for Auditory Attention in Multi-Microphone Distance Speech Recognition. CoRR abs/1511.06407 (2015) - 2014
- [c44]Jungsuk Kim, Ian R. Lane:
Accelerating large vocabulary continuous speech recognition on heterogeneous CPU-GPU platforms. ICASSP 2014: 3291-3295 - [c43]Wonkyum Lee, Jungsuk Kim, Ian R. Lane:
Multi-stream combination for LVCSR and keyword search on GPU-accelerated platforms. ICASSP 2014: 3296-3300 - [c42]Ankur Gandhe, Florian Metze
, Alex Waibel, Ian R. Lane:
Optimization of Neural Network Language Models for keyword search. ICASSP 2014: 4888-4892 - [c41]William Chan, Ian R. Lane:
Distributed asynchronous optimization of convolutional neural networks. INTERSPEECH 2014: 1073-1077 - [c40]Ankur Gandhe, Florian Metze, Ian R. Lane:
Neural network language models for low resource languages. INTERSPEECH 2014: 2615-2619 - [c39]David Cohen, Akshay Chandrashekaran, Ian R. Lane, Antoine Raux:
The HRI-CMU Corpus of Situated In-Car Interactions. IWSDS 2014: 85-95 - [c38]Teruhisa Misu, Antoine Raux, Rakesh Gupta, Ian R. Lane:
Situated Language Understanding at 25 Miles per Hour. SIGDIAL Conference 2014: 22-31 - [c37]Bo Yu, Ian R. Lane:
Multi-task deep learning for image understanding. SoCPaR 2014: 37-42 - 2013
- [c36]Ankur Gandhe, Long Qin, Florian Metze
, Alexander I. Rudnicky, Ian R. Lane, Matthias Eck:
Using web text to improve keyword spotting in speech. ASRU 2013: 428-433 - [c35]Haofeng Kou, Weijia Shang, Ian R. Lane, Jike Chong:
Optimized MFCC feature extraction on GPU. ICASSP 2013: 7130-7134 - [c34]Teruhisa Misu, Antoine Raux, Ian R. Lane, Joan Devassy, Rakesh Gupta:
Situated multi-modal dialog system in vehicles. GazeIn@ICMI 2013: 25-28 - [c33]Jonas Gehring, Wonkyum Lee, Kevin Kilgour, Ian R. Lane, Yajie Miao, Alex Waibel:
Modular combination of deep neural networks for acoustic modeling. INTERSPEECH 2013: 94-98 - 2012
- [c32]Paul Maergner
, Alex Waibel, Ian R. Lane:
Unsupervised vocabulary selection for real-time speech recognition of lectures. ICASSP 2012: 4417-4420 - [c31]Jungsuk Kim, Jike Chong, Ian R. Lane:
Efficient On-The-Fly Hypothesis Rescoring in a Hybrid GPU/CPU-based Large Vocabulary Continuous Speech Recognition Engine. INTERSPEECH 2012: 1035-1038 - [c30]David Cohen, Ian R. Lane:
A Simulation-based Framework for Spoken Language Understanding and Action Selection in Situated Interaction. SDCTD@NAACL-HLT 2012: 33-36 - [c29]Ian R. Lane, Vinay Prasad, Gaurav Sinha, Arlette Umuhoza, Shangyu Luo, Akshay Chandrashekaran, Antoine Raux:
HRItk: The Human-Robot Interaction ToolKit Rapid Development of Speech-Centric Interactive Systems in ROS. SDCTD@NAACL-HLT 2012: 41-44 - 2011
- [c28]Senaka Buthpitiya, Ian R. Lane, Jike Chong:
Rapid Training of Acoustic Models Using Graphics Processing Unit. INTERSPEECH 2011: 793-796 - [c27]Michele Cossalter, Priya Sundararajan, Ian R. Lane:
Ad-Hoc Meeting Transcription on Clusters of Mobile Devices. INTERSPEECH 2011: 2881-2884 - [c26]Paul Maergner, Kevin Kilgour, Ian R. Lane, Alex Waibel:
Unsupervised vocabulary selection for simultaneous lecture translation. IWSLT 2011: 214-221 - [c25]Paul Maergner, Ian R. Lane, Alex Waibel:
Unsupervised Vocabulary Selection for Domain-Independent Simultaneous Lecture Translation. MTSummit 2011 - [c24]Avneesh Saluja, Ian R. Lane, Ying Zhang:
Context-aware Language Modeling for Conversational Speech Translation. MTSummit 2011 - 2010
- [c23]Ian R. Lane, Alex Waibel:
Named-entity projection and data-driven morphological decomposition for field maintainable speech-to-speech translation systems. INTERSPEECH 2010: 2882-2885 - [c22]Daniel Chung Yong Lim, Ian R. Lane, Alex Waibel:
Real-time spoken language identification and recognition for speech-to-speech translation. IWSLT 2010: 307-312 - [c21]Ian R. Lane, Matthias Eck, Kay Rottmann, Alex Waibel:
Tools for Collecting Speech Corpora via Mechanical-Turk. Mturk@HLT-NAACL 2010: 184-187 - [c20]Matthias Eck, Ian R. Lane, Ying Zhang, Alex Waibel:
Jibbigo: Speech-to-speech translation on mobile devices. SLT 2010: 165-166 - [e1]Marcello Federico, Ian R. Lane, Michael Paul, François Yvon, Joseph Mariani:
2010 International Workshop on Spoken Language Translation, IWSLT 2010, Paris, France, December 2-3, 2010. ISCA 2010 [contents]
2000 – 2009
- 2009
- [c19]Hassan Al-Haj, Roger Hsiao, Ian R. Lane, Alan W. Black, Alex Waibel:
Pronunciation modeling for dialectal arabic speech recognition. ASRU 2009: 525-528 - [c18]Daniel Chung Yong Lim, Ian R. Lane:
Language identification for speech-to-speech translation. INTERSPEECH 2009: 204-207 - [c17]Nguyen Bach, Roger Hsiao, Matthias Eck, Paisarn Charoenpornsawat, Stephan Vogel, Tanja Schultz, Ian R. Lane, Alex Waibel, Alan W. Black:
Incremental Adaptation of Speech-to-Speech Translation. HLT-NAACL (Short Papers) 2009: 149-152 - 2008
- [c16]Matthias Paulik, Sharath Rao, Ian R. Lane, Stephan Vogel, Tanja Schultz
:
Sentence segmentation and punctuation recovery for spoken language translation. ICASSP 2008: 5105-5108 - [c15]Ian R. Lane, Alex Waibel:
Class-based statistical machine translation for field maintainable speech-to-speech translation. INTERSPEECH 2008: 2362-2365 - 2007
- [j4]Yik-Cheung Tam, Ian R. Lane, Tanja Schultz
:
Bilingual LSA-based adaptation for statistical machine translation. Mach. Transl. 21(4): 187-207 (2007) - [j3]Ian R. Lane, Tatsuya Kawahara
, Tomoko Matsui, Satoshi Nakamura:
Out-of-Domain Utterance Detection Using Classification Confidences of Multiple Topics. IEEE Trans. Speech Audio Process. 15(1): 150-161 (2007) - [c14]Yik-Cheung Tam, Ian R. Lane, Tanja Schultz:
Bilingual-LSA Based LM Adaptation for Spoken Language Translation. ACL 2007 - [c13]Nguyen Bach, Mohamed Noamany, Ian R. Lane, Tanja Schultz:
Handling OOV words in Arabic ASR via flexible morphological constraints. INTERSPEECH 2007: 2373-2376 - [c12]Sharath Rao, Ian R. Lane, Tanja Schultz:
Optimizing sentence segmentation for spoken language translation. INTERSPEECH 2007: 2845-2848 - [c11]Ian R. Lane, Andreas Zollmann, ThuyLinh Nguyen, Nguyen Bach, Ashish Venugopal, Stephan Vogel, Kay Rottmann, Ying Zhang, Alex Waibel:
The CMU-UKA statistical machine translation systems for IWSLT 2007. IWSLT 2007: 61-68 - [c10]Sharath Rao, Ian R. Lane, Tanja Schultz:
Improving spoken language translation by automatic disfluency removal: evidence from conversational speech transcripts. MTSummit 2007 - [c9]Bing Zhao, Nguyen Bach, Ian R. Lane, Stephan Vogel:
A Log-Linear Block Transliteration Model based on Bi-Stream HMMs. HLT-NAACL 2007: 364-371 - 2006
- [b1]Ian R. Lane:
Flexible spoken language understanding based on topic classification and domain detection. Kyoto University, Japan, 2006 - [j2]Ian R. Lane, Tatsuya Kawahara
:
Verification of Speech Recognition Results Incorporating In-domain Confidence and Discourse Coherence Measures. IEICE Trans. Inf. Syst. 89-D(3): 931-938 (2006) - [c8]Matthias Eck, Ian R. Lane, Nguyen Bach, Sanjika Hewavitharana, Muntsin Kolss, Bing Zhao, Almut Silja Hildebrand, Stephan Vogel, Alex Waibel:
The UKA/CMU statistical machine translation system for IWSLT 2006. IWSLT 2006: 130-137 - 2005
- [j1]Ian R. Lane, Tatsuya Kawahara, Tomoko Matsui, Satoshi Nakamura:
Dialogue Speech Recognition by Combining Hierarchical Topic Classification and Language Model Switching. IEICE Trans. Inf. Syst. 88-D(3): 446-454 (2005) - [c7]Ian R. Lane, Tatsuya Kawahara
:
Incorporating Dialogue Context and Topic Clustering in Out-of-Domain Detection. ICASSP (1) 2005: 1045-1048 - [c6]Ian R. Lane, Tatsuya Kawahara:
Utterance verification incorporating in-domain confidence and discourse coherence measures. INTERSPEECH 2005: 421-424 - 2004
- [c5]Ian R. Lane, Tatsuya Kawahara, Tomoko Matsui, Satoshi Nakamura:
Out-of-domain detection based on confidence measures from multiple topic classification. ICASSP (1) 2004: 757-760 - [c4]Tatsuya Kawahara, Ian Richard Lane, Tomoko Matsui, Satoshi Nakamura:
Topic classification and verification modeling for out-of-domain utterance detection. INTERSPEECH 2004 - [c3]Ian Richard Lane, Tatsuya Kawahara, Shinichi Ueno:
Example-based training of dialogue planning incorporating user and situation models. INTERSPEECH 2004 - 2003
- [c2]Ian R. Lane, Tatsuya Kawahara, Tomoko Matsui:
Language model switching based on topic detection for dialog speech recognition. ICASSP (1) 2003: 616-619 - [c1]Ian R. Lane, Tatsuya Kawahara, Tomoko Matsui, Satoshi Nakamura:
Hierarchical topic classification for dialog speech recognition based on language model switching. INTERSPEECH 2003