


Остановите войну!
for scientists:


default search action
Kai Yu 0004
Person information

- affiliation: Shanghai Jiao Tong University, Computer Science and Engineering Department, China
- affiliation (PhD 2006): Cambridge University, Engineering Department, UK
Other persons with the same name
- Kai Yu — disambiguation page
- Kai Yu 0001 — Baidu Inc., Institute of Deep Learning, Beijing, China (and 3 more)
- Kai Yu 0002 — Royal Institute of Technology, Stockholm, Sweden
- Kai Yu 0003
— University of Minnesota, Department of Biomedical Engineering, Minneapolis, MN, USA
- Kai Yu 0005
— Zhejiang University, State Key Laboratory of Industrial Control Technology, Hangzhou, China
- Kai Yu 0006
— Beijing Normal University, School of Geography, China (and 1 more)
- Kai Yu 0007
— Nanjing University of Information Science and Technology, School of Marine Science, China (and 1 more)
- Kai Yu 0008
— Guangdong University of Technology, School of Information Engineering, School of Integrated Circuits, China (and 1 more)
- Kai Yu 0009
— Soochow University, School of Electronics and Information Engineering, Jiangsu, China (and 1 more)
- Kai Yu 0010
— Nanjing University, School of Electronic Science and Engineering, China
- Kai Yu 0011
— Sun Yat-Sen University, Cancer Center, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, China
- Kai Yu 0012 — Chinese Academy of Sciences, Shanghai Institute of Microsystem and Information Technology, China
- Kai Yu 0013 — Beihang University, School of Computer Science and Engineering, State Key Laboratory of Software Development Environment, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [i63]Jieyu Li, Lu Chen, Ruisheng Cao, Su Zhu, Hongshen Xu, Zhi Chen, Hanchong Zhang, Kai Yu:
On the Structural Generalization in Text-to-SQL. CoRR abs/2301.04790 (2023) - 2022
- [j33]Bo Chen
, Zhihang Xu
, Kai Yu
:
Data augmentation based non-parallel voice conversion with frame-level speaker disentangler. Speech Commun. 136: 14-22 (2022) - [j32]Chenpeng Du
, Kai Yu
:
Phone-Level Prosody Modelling With GMM-Based MDN for Diverse and Controllable Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 30: 190-201 (2022) - [j31]Bo Chen
, Chenpeng Du
, Kai Yu
:
Neural Fusion for Voice Cloning. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1993-2001 (2022) - [c173]Guangwei Li, Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
Category-Adapted Sound Event Enhancement with Weakly Labeled Data. ICASSP 2022: 851-855 - [c172]Xuenan Xu, Mengyue Wu, Kai Yu:
Diversity-Controllable and Accurate Audio Captioning Based on Neural Condition. ICASSP 2022: 971-975 - [c171]Guangwei Li, Xuenan Xu, Mengyue Wu, Kai Yu:
Navigating Audio-Visual Event Detection Across Mismatched Modalities. ICASSP 2022: 1975-1979 - [c170]Siyu Lou
, Xuenan Xu, Mengyue Wu, Kai Yu:
Audio-Text Retrieval in Context. ICASSP 2022: 4793-4797 - [c169]Wen Wu
, Mengyue Wu, Kai Yu:
Climate and Weather: Inspecting Depression Detection via Emotion Recognition. ICASSP 2022: 6262-6266 - [c168]Yu Xi, Tian Tan, Wangyou Zhang, Baochen Yang, Kai Yu:
Text Adaptive Detection for Customizable Keyword Spotting. ICASSP 2022: 6652-6656 - [c167]Yiwei Guo, Chenpeng Du
, Kai Yu:
Unsupervised Word-Level Prosody Tagging for Controllable Speech Synthesis. ICASSP 2022: 7597-7601 - [c166]Tao Liu, Shuai Fan, Xu Xiang
, Hongbo Song, Shaoxiong Lin, Jiaqi Sun, Tianyuan Han, Siyuan Chen, Binwei Yao, Sen Liu, Yifei Wu, Yanmin Qian, Kai Yu:
MSDWild: Multi-modal Speaker Diarization Dataset in the Wild. INTERSPEECH 2022: 1476-1480 - [c165]Chenpeng Du, Yiwei Guo, Xie Chen, Kai Yu:
VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature. INTERSPEECH 2022: 1596-1600 - [c164]Qinpei Zhu, Renshou Wu, Guangfeng Liu, Xinyu Zhu, Xingyu Chen, Yang Zhou, Qingliang Miao, Rui Wang, Kai Yu:
The AISP-SJTU Simultaneous Translation System for IWSLT 2022. IWSLT@ACL 2022: 208-215 - [c163]Zihan Zhao, Lu Chen, Ruisheng Cao, Hongshen Xu, Xingyu Chen, Kai Yu:
TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages. NAACL-HLT 2022: 1808-1821 - [c162]Zhi Chen, Lu Chen, Bei Chen, Libo Qin, Yuncong Liu, Su Zhu, Jian-Guang Lou, Kai Yu:
UniDU: Towards A Unified Generative Dialogue Understanding Framework. SIGDIAL 2022: 442-455 - [i62]Yiwei Guo, Chenpeng Du, Kai Yu:
Unsupervised word-level prosody tagging for controllable speech synthesis. CoRR abs/2202.07200 (2022) - [i61]Siyu Lou, Xuenan Xu, Mengyue Wu, Kai Yu:
Audio-text Retrieval in Context. CoRR abs/2203.13645 (2022) - [i60]Chenpeng Du, Yiwei Guo, Xie Chen, Kai Yu:
VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature. CoRR abs/2204.00768 (2022) - [i59]Zhi Chen, Lu Chen, Bei Chen, Libo Qin, Yuncong Liu, Su Zhu, Jian-Guang Lou, Kai Yu:
UniDU: Towards A Unified Generative Dialogue Understanding Framework. CoRR abs/2204.04637 (2022) - [i58]Wen Wu, Mengyue Wu, Kai Yu:
Climate and Weather: Inspecting Depression Detection via Emotion Recognition. CoRR abs/2204.14099 (2022) - [i57]Xuenan Xu, Mengyue Wu, Kai Yu:
A Comprehensive Survey of Automated Audio Captioning. CoRR abs/2205.05357 (2022) - [i56]Zihan Zhao, Lu Chen, Ruisheng Cao, Hongshen Xu, Xingyu Chen, Kai Yu:
TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages. CoRR abs/2205.06435 (2022) - [i55]Liangtai Sun, Xingyu Chen, Lu Chen, Tianle Dai, Zichen Zhu, Kai Yu:
META-GUI: Towards Multi-modal Conversational Agents on Mobile GUI. CoRR abs/2205.11029 (2022) - [i54]Binwei Yao, Chao Shi, Likai Zou, Lingfeng Dai, Mengyue Wu, Lu Chen, Zhen Wang, Kai Yu:
D4: a Chinese Dialogue Dataset for Depression-Diagnosis-Oriented Chat. CoRR abs/2205.11764 (2022) - [i53]Zhi Chen, Jijia Bao, Lu Chen, Yuncong Liu, Da Ma, Bei Chen, Mengyue Wu, Su Zhu, Jian-Guang Lou, Kai Yu:
DialogZoo: Large-Scale Dialog-Oriented Task Learning. CoRR abs/2205.12662 (2022) - [i52]Zhi Chen, Yuncong Liu, Lu Chen, Su Zhu, Mengyue Wu, Kai Yu:
OPAL: Ontology-Aware Pretrained Language Model for End-to-End Task-Oriented Dialogue. CoRR abs/2209.04595 (2022) - [i51]Yiwei Guo, Chenpeng Du, Xie Chen, Kai Yu:
EmoDiff: Intensity Controllable Emotional Text-to-Speech with Soft-Label Guidance. CoRR abs/2211.09496 (2022) - 2021
- [j30]Heinrich Dinkel
, Mengyue Wu, Kai Yu
:
Towards Duration Robust Weakly Supervised Sound Event Detection. IEEE ACM Trans. Audio Speech Lang. Process. 29: 887-900 (2021) - [j29]Heinrich Dinkel
, Shuai Wang
, Xuenan Xu, Mengyue Wu
, Kai Yu
:
Voice Activity Detection in the Wild: A Data-Driven Approach Using Teacher-Student Training. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1542-1555 (2021) - [c161]Boer Lyu, Lu Chen, Su Zhu, Kai Yu:
LET: Linguistic Knowledge Enhanced Graph Transformer for Chinese Short Text Matching. AAAI 2021: 13498-13506 - [c160]Ruisheng Cao
, Lu Chen, Zhi Chen, Yanbin Zhao, Su Zhu, Kai Yu:
LGESQL: Line Graph Enhanced Text-to-SQL Model with Mixed Local and Non-Local Relations. ACL/IJCNLP (1) 2021: 2541-2555 - [c159]Zhi Chen, Lu Chen, Hanqi Li, Ruisheng Cao
, Da Ma, Mengyue Wu, Kai Yu:
Decoupled Dialogue Modeling and Semantic Parsing for Multi-Turn Text-to-SQL. ACL/IJCNLP (Findings) 2021: 3063-3074 - [c158]Xingyu Chen, Zihan Zhao, Lu Chen, Jiabao Ji, Danyang Zhang, Ao Luo, Yuxuan Xiong, Kai Yu:
WebSRC: A Dataset for Web-Based Structural Reading Comprehension. EMNLP (1) 2021: 4173-4185 - [c157]Boer Lyu, Lu Chen, Kai Yu:
Glyph Enhanced Chinese Character Pre-Training for Lexical Sememe Prediction. EMNLP (Findings) 2021: 4549-4555 - [c156]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
Text-to-Audio Grounding: Building Correspondence Between Captions and Sound Events. ICASSP 2021: 606-610 - [c155]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Zeyu Xie, Kai Yu:
Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning. ICASSP 2021: 905-909 - [c154]Chenpeng Du
, Bing Han, Shuai Wang, Yanmin Qian, Kai Yu:
SynAug: Synthesis-Based Data Augmentation for Text-Dependent Speaker Verification. ICASSP 2021: 5844-5848 - [c153]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
A Lightweight Framework for Online Voice Activity Detection in the Wild. Interspeech 2021: 371-375 - [c152]Chenpeng Du
, Kai Yu:
Rich Prosody Diversity Modelling with Phone-Level Mixture Density Network. Interspeech 2021: 3136-3140 - [c151]Shuai Wang, Yexin Yang, Yanmin Qian, Kai Yu:
Revisiting the Statistics Pooling Layer in Deep Speaker Embedding Learning. ISCSLP 2021: 1-5 - [c150]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
Audio Caption in a Car Setting with a Sentence-Level Loss. ISCSLP 2021: 1-5 - [c149]Pingyue Zhang, Mengyue Wu, Heinrich Dinkel, Kai Yu:
DEPA: Self-Supervised Audio Embedding for Depression Detection. ACM Multimedia 2021: 135-143 - [c148]Zhi Chen, Lu Chen, Yanbin Zhao, Ruisheng Cao
, Zihan Xu, Su Zhu, Kai Yu:
ShadowGNN: Graph Projection Neural Network for Text-to-SQL Parser. NAACL-HLT 2021: 5567-5577 - [c147]Su Zhu, Lu Chen, Ruisheng Cao, Zhi Chen, Qingliang Miao, Kai Yu:
Few-Shot NLU with Vector Projection Distance and Abstract Triangular CRF. NLPCC (1) 2021: 505-516 - [c146]Yao Zhao, Lu Chen, Kai Yu:
Relation-Aware Multi-hop Reasoning forVisual Dialog. NLPCC (1) 2021: 810-821 - [i50]Heinrich Dinkel, Mengyue Wu, Kai Yu:
Towards duration robust weakly supervised sound event detection. CoRR abs/2101.07687 (2021) - [i49]Lu Chen, Xingyu Chen, Zihan Zhao, Danyang Zhang, Jiabao Ji, Ao Luo, Yuxuan Xiong, Kai Yu:
WebSRC: A Dataset for Web-Based Structural Reading Comprehension. CoRR abs/2101.09465 (2021) - [i48]Chenpeng Du, Kai Yu:
Mixture Density Network for Phone-Level Prosody Modelling in Speech Synthesis. CoRR abs/2102.00851 (2021) - [i47]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Zeyu Xie, Kai Yu:
Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning. CoRR abs/2102.11457 (2021) - [i46]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
Text-to-Audio Grounding: Building Correspondence Between Captions and Sound Events. CoRR abs/2102.11474 (2021) - [i45]Boer Lyu, Lu Chen, Su Zhu, Kai Yu:
LET: Linguistic Knowledge Enhanced Graph Transformer for Chinese Short Text Matching. CoRR abs/2102.12671 (2021) - [i44]Zhi Chen, Lu Chen, Yanbin Zhao, Ruisheng Cao, Zihan Xu, Su Zhu, Kai Yu:
ShadowGNN: Graph Projection Neural Network for Text-to-SQL Parser. CoRR abs/2104.04689 (2021) - [i43]Heinrich Dinkel, Shuai Wang, Xuenan Xu, Mengyue Wu, Kai Yu:
Voice activity detection in the wild: A data-driven approach using teacher-student training. CoRR abs/2105.04065 (2021) - [i42]Chenpeng Du, Kai Yu:
Diverse and Controllable Speech Synthesis with GMM-Based Phone-Level Prosody Modelling. CoRR abs/2105.13086 (2021) - [i41]Ruisheng Cao, Lu Chen, Zhi Chen, Yanbin Zhao, Su Zhu, Kai Yu:
LGESQL: Line Graph Enhanced Text-to-SQL Model with Mixed Local and Non-Local Relations. CoRR abs/2106.01093 (2021) - [i40]Zhi Chen, Lu Chen, Hanqi Li, Ruisheng Cao, Da Ma, Mengyue Wu, Kai Yu:
Decoupled Dialogue Modeling and Semantic Parsing for Multi-Turn Text-to-SQL. CoRR abs/2106.02282 (2021) - [i39]Su Zhu, Lu Chen, Ruisheng Cao, Zhi Chen, Qingliang Miao, Kai Yu:
Few-Shot NLU with Vector Projection Distance and Abstract Triangular CRF. CoRR abs/2112.04999 (2021) - 2020
- [j28]Fei Wu, Cewu Lu
, Mingjie Zhu, Hao Chen
, Jun Zhu, Kai Yu
, Lei Li, Ming Li, Qianfeng Chen, Xi Li, Xudong Cao, Zhongyuan Wang, Zhengjun Zha
, Yueting Zhuang, Yunhe Pan
:
Towards a new generation of artificial intelligence in China. Nat. Mach. Intell. 2(6): 312-316 (2020) - [j27]Su Zhu
, Zijian Zhao, Rao Ma
, Kai Yu
:
Prior Knowledge Driven Label Embedding for Slot Filling in Natural Language Understanding. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1440-1451 (2020) - [j26]Su Zhu
, Ruisheng Cao
, Kai Yu
:
Dual Learning for Semi-Supervised Natural Language Understanding. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1936-1947 (2020) - [j25]Qi Liu
, Zhehuai Chen
, Hao Li, Mingkun Huang, Yizhou Lu, Kai Yu
:
Modular End-to-End Automatic Speech Recognition Framework for Acoustic-to-Word Model. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2174-2183 (2020) - [j24]Zhi Chen
, Lu Chen, Xiaoyuan Liu, Kai Yu
:
Distributed Structured Actor-Critic Reinforcement Learning for Universal Dialogue Management. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2400-2411 (2020) - [j23]Kai Yu
, Rao Ma
, Kaiyu Shi, Qi Liu
:
Neural Network Language Model Compression With Product Quantization and Soft Binarization. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2438-2449 (2020) - [j22]Shuai Wang
, Yexin Yang
, Zhanghao Wu
, Yanmin Qian
, Kai Yu
:
Data Augmentation Using Deep Generative Models for Embedding Based Speaker Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2598-2609 (2020) - [c145]Lu Chen, Boer Lv, Chi Wang, Su Zhu, Bowen Tan, Kai Yu:
Schema-Guided Multi-Domain Dialogue State Tracking with Graph Attention Neural Networks. AAAI 2020: 7521-7528 - [c144]Yanbin Zhao, Lu Chen, Zhi Chen, Kai Yu:
Semi-Supervised Text Simplification with Back-Translation and Asymmetric Denoising Autoencoders. AAAI 2020: 9668-9675 - [c143]Yanbin Zhao, Lu Chen, Zhi Chen, Ruisheng Cao
, Su Zhu, Kai Yu:
Line Graph Enhanced AMR-to-Text Generation with Mix-Order Graph Attention Networks. ACL 2020: 732-741 - [c142]Lu Chen, Yanbin Zhao, Boer Lyu, Lesheng Jin, Zhi Chen, Su Zhu, Kai Yu:
Neural Graph Matching Networks for Chinese Short Text Matching. ACL 2020: 6152-6158 - [c141]Ruisheng Cao
, Su Zhu, Chenyu Yang, Chen Liu, Rao Ma, Yanbin Zhao, Lu Chen, Kai Yu:
Unsupervised Dual Paraphrasing for Two-stage Semantic Parsing. ACL 2020: 6806-6817 - [c140]Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Kai Yu:
A CRNN-GRU Based Reinforcement Learning Approach to Audio Captioning. DCASE 2020: 225-229 - [c139]Su Zhu, Jieyu Li, Lu Chen, Kai Yu:
Efficient Context and Schema Fusion Networks for Multi-Domain Dialogue State Tracking. EMNLP (Findings) 2020: 766-781 - [c138]Heinrich Dinkel, Kai Yu:
Duration Robust Weakly Supervised Sound Event Detection. ICASSP 2020: 311-315 - [c137]Yexin Yang, Shuai Wang, Xun Gong, Yanmin Qian, Kai Yu:
Text Adaptation for Speaker Verification with Speaker-Text Factorized Embeddings. ICASSP 2020: 6454-6458 - [c136]Zhengyang Chen, Shuai Wang, Yanmin Qian, Kai Yu:
Channel Invariant Speaker Embedding Learning with Joint Multi-Task and Adversarial Training. ICASSP 2020: 6574-6578 - [c135]Shuai Wang, Johan Rohdin
, Oldrich Plchot, Lukás Burget
, Kai Yu, Jan Cernocký
:
Investigation of Specaugment for Deep Speaker Embedding Learning. ICASSP 2020: 7139-7143 - [c134]Chenpeng Du
, Kai Yu:
Speaker Augmentation for Low Resource Speech Recognition. ICASSP 2020: 7719-7723 - [c133]Rao Ma, Hao Li, Qi Liu
, Lu Chen, Kai Yu:
Neural Lattice Search for Speech Recognition. ICASSP 2020: 7794-7798 - [c132]Jieyu Li, Su Zhu, Kai Yu:
A Hierarchical Tracker for Multi-Domain Dialogue State Tracking. ICASSP 2020: 8014-8018 - [c131]Rao Ma, Lesheng Jin, Qi Liu
, Lu Chen, Kai Yu:
Addressing the Polysemy Problem in Language Modeling with Attentional Multi-Sense Embeddings. ICASSP 2020: 8129-8133 - [c130]Han Zhao, Weihao Cui, Quan Chen, Jingwen Leng, Kai Yu, Deze Zeng, Chao Li, Minyi Guo:
CODA: Improving Resource Utilization by Slimming and Co-locating DNN and CPU Jobs. ICDCS 2020: 853-863 - [c129]Chen Liu, Su Zhu, Zijian Zhao, Ruisheng Cao
, Lu Chen, Kai Yu:
Jointly Encoding Word Confusion Network and Dialogue Context with BERT for Spoken Language Understanding. INTERSPEECH 2020: 871-875 - [c128]Hongji Wang, Heinrich Dinkel, Shuai Wang, Yanmin Qian, Kai Yu:
Dual-Adversarial Domain Adaptation for Generalized Replay Attack Detection. INTERSPEECH 2020: 1086-1090 - [c127]Yefei Chen, Heinrich Dinkel, Mengyue Wu, Kai Yu:
Voice Activity Detection in the Wild via Weakly Supervised Sound Event Detection. INTERSPEECH 2020: 3665-3669 - [c126]Zihan Xu, Zhi Chen, Lu Chen, Su Zhu, Kai Yu:
Memory Attention Neural Network for Multi-domain Dialogue State Tracking. NLPCC (1) 2020: 41-52 - [c125]Chen Liu, Su Zhu, Lu Chen, Kai Yu:
Robust Spoken Language Understanding with RL-Based Value Error Recovery. NLPCC (1) 2020: 78-90 - [c124]Zihan Zhao, Yuncong Liu, Lu Chen, Qi Liu, Rao Ma, Kai Yu:
An Investigation on Different Underlying Quantization Schemes for Pre-trained Language Models. NLPCC (1) 2020: 359-371 - [i38]Su Zhu, Zijian Zhao, Rao Ma, Kai Yu:
Prior Knowledge Driven Label Embedding for Slot Filling in Natural Language Understanding. CoRR abs/2003.09831 (2020) - [i37]Heinrich Dinkel, Yefei Chen, Mengyue Wu, Kai Yu:
GPVAD: Towards noise robust voice activity detection via weakly supervised sound event detection. CoRR abs/2003.12222 (2020) - [i36]Su Zhu, Jieyu Li, Lu Chen, Kai Yu:
Efficient Context and Schema Fusion Networks for Multi-Domain Dialogue State Tracking. CoRR abs/2004.03386 (2020) - [i35]Su Zhu, Ruisheng Cao, Kai Yu:
Dual Learning for Semi-Supervised Natural Language Understanding. CoRR abs/2004.12299 (2020) - [i34]Yanbin Zhao, Lu Chen, Zhi Chen, Kai Yu:
Semi-Supervised Text Simplification with Back-Translation and Asymmetric Denoising Autoencoders. CoRR abs/2004.14693 (2020) - [i33]Chen Liu, Su Zhu, Zijian Zhao, Ruisheng Cao, Lu Chen, Kai Yu:
Jointly Encoding Word Confusion Network and Dialogue Context with BERT for Spoken Language Understanding. CoRR abs/2005.11640 (2020) - [i32]Ruisheng Cao, Su Zhu, Chenyu Yang, Chen Liu, Rao Ma, Yanbin Zhao, Lu Chen, Kai Yu:
Unsupervised Dual Paraphrasing for Two-stage Semantic Parsing. CoRR abs/2005.13485 (2020) - [i31]Heinrich Dinkel, Nanxin Chen, Yanmin Qian, Kai Yu:
End-to-end spoofing detection with raw waveform CLDNNs. CoRR abs/2007.13060 (2020) - [i30]Qi Liu, Zhehuai Chen, Hao Li, Mingkun Huang, Yizhou Lu, Kai Yu:
Modular End-to-end Automatic Speech Recognition Framework for Acoustic-to-word Model. CoRR abs/2008.00953 (2020) - [i29]Qi Liu, Tian Tan, Kai Yu:
An Investigation on Deep Learning with Beta Stabilizer. CoRR abs/2008.01173 (2020) - [i28]Qi Liu, Yanmin Qian, Kai Yu:
Future Vector Enhanced LSTM Language Model for LVCSR. CoRR abs/2008.01832 (2020) - [i27]Chen Liu, Su Zhu, Lu Chen, Kai Yu:
Robust Spoken Language Understanding with RL-based Value Error Recovery. CoRR abs/2009.03095 (2020) - [i26]Su Zhu, Ruisheng Cao, Lu Chen, Kai Yu:
Vector Projection Network for Few-shot Slot Tagging in Natural Language Understanding. CoRR abs/2009.09568 (2020) - [i25]Yefei Chen, Shuai Wang, Yanmin Qian, Kai Yu:
End-to-End Speaker-Dependent Voice Activity Detection. CoRR abs/2009.09906 (2020) - [i24]Zhi Chen, Lu Chen, Xiang Zhou, Kai Yu:
Deep Reinforcement Learning for On-line Dialogue State Tracking. CoRR abs/2009.10321 (2020) - [i23]Zhi Chen, Lu Chen, Xiaoyuan Liu, Kai Yu:
Distributed Structured Actor-Critic Reinforcement Learning for Universal Dialogue Management. CoRR abs/2009.10326 (2020) - [i22]Zhi Chen, Xiaoyuan Liu, Lu Chen, Kai Yu:
Structured Hierarchical Dialogue Policy with Graph Neural Networks. CoRR abs/2009.10355 (2020) - [i21]Zhi Chen, Lu Chen, Yanbin Zhao, Su Zhu, Kai Yu:
Dual Learning for Dialogue State Tracking. CoRR abs/2009.10430 (2020) - [i20]Zhi Chen, Lu Chen, Zihan Xu, Yanbin Zhao, Su Zhu, Kai Yu:
CREDIT: Coarse-to-Fine Sequence Generation for Dialogue State Tracking. CoRR abs/2009.10435 (2020) - [i19]Zihan Zhao, Yuncong Liu, Lu Chen, Qi Liu, Rao Ma, Kai Yu:
An Investigation on Different Underlying Quantization Schemes for Pre-trained Language Models. CoRR abs/2010.07109 (2020)
2010 – 2019
- 2019
- [j21]Lu Chen
, Zhi Chen
, Bowen Tan, Sishan Long, Milica Gasic
, Kai Yu
:
AgentGraph: Toward Universal Dialogue Management With Structured Deep Reinforcement Learning. IEEE ACM Trans. Audio Speech Lang. Process. 27(9): 1378-1391 (2019) - [j20]Shuai Wang
, Zili Huang, Yanmin Qian
, Kai Yu
:
Discriminative Neural Embedding Learning for Short-Duration Text-Independent Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 27(11): 1686-1696 (2019) - [c123]Ruisheng Cao
, Su Zhu, Chen Liu, Jieyu Li, Kai Yu:
Semantic Parsing with Dual Learning. ACL (1) 2019: 51-64 - [c122]Xu Xiang
, Shuai Wang, Houjun Huang, Yanmin Qian, Kai Yu:
Margin Matters: Towards More Discriminative Deep Neural Network Embeddings for Speaker Recognition. APSIPA 2019: 1652-1656 - [c121]Rao Ma, Qi Liu
, Kai Yu:
Highly Efficient Neural Network Language Model Compression Using Soft Binarization Training. ASRU 2019: 62-69 - [c120]Mingkun Huang, Yizhou Lu, Lan Wang, Yanmin Qian, Kai Yu:
Exploring Model Units and Training Strategies for End-to-End Speech Recognition. ASRU 2019: 524-531 - [c119]Zijian Zhao, Su Zhu, Kai Yu:
Data Augmentation with Atomic Templates for Spoken Language Understanding. EMNLP/IJCNLP (1) 2019: 3635-3641 - [c118]Mengyue Wu, Heinrich Dinkel, Kai Yu:
Audio Caption: Listen and Tell. ICASSP 2019: 830-834 - [c117]Shuai Wang, Yexin Yang, Tianzhe Wang, Yanmin Qian, Kai Yu:
Knowledge Distillation for Small Foot-print Deep Speaker Embedding. ICASSP 2019: 6021-6025 - [c116]Xuankai Chang, Yanmin Qian, Kai Yu, Shinji Watanabe
:
End-to-end Monaural Multi-speaker ASR System without Pretraining. ICASSP 2019: 6256-6260 - [c115]Zijian Zhao, Su Zhu, Kai Yu:
A Hierarchical Decoding Model for Spoken Language Understanding from Unaligned Data. ICASSP 2019: 7305-7309 - [c114]Su Zhu, Zijian Zhao, Tiejun Zhao, Chengqing Zong, Kai Yu:
CATSLU: The 1st Chinese Audio-Textual Spoken Language Understanding Challenge. ICMI 2019: 521-525 - [c113]