default search action
Ning Cheng 0001
Person information
- affiliation: Ping An Technology (Shenzhen) Co., Ltd., China
- affiliation (former): Chinese Academy of Sciences, Institute of Automation, Beijing, China
- affiliation (former): Chinese Academy of Sciences, Shenzhen Institute of Advanced Technology, China
- affiliation (PhD 2009): University of the Chinese Academy of Sciences (UCAS), Beijing, China
Other persons with the same name
- Ning Cheng — disambiguation page
- Ning Cheng 0002 — Futurewei Technologies, Bridgewater, NJ, USA (and 3 more)
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [c101]Ming Li, Yong Zhang, Shwai He, Zhitao Li, Hongyu Zhao, Jianzong Wang, Ning Cheng, Tianyi Zhou:
Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning. ACL (1) 2024: 14255-14273 - [c100]Haoxiang Shi, Jianzong Wang, Xulong Zhang, Ning Cheng, Jun Yu, Jing Xiao:
RSET: Remapping-Based Sorting Method for Emotion Transfer Speech Synthesis. APWeb/WAIM (1) 2024: 90-104 - [c99]Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
Medical Speech Symptoms Classification via Disentangled Representation. CSCWD 2024: 1110-1115 - [c98]Yimin Deng, Huaizhen Tang, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang:
Learning Disentangled Speech Representations with Contrastive Learning and Time-Invariant Retrieval. ICASSP 2024: 7150-7154 - [c97]Bingyuan Zhang, Xulong Zhang, Ning Cheng, Jun Yu, Jing Xiao, Jianzong Wang:
EmoTalker: Emotionally Editable Talking Face Generation via Diffusion Model. ICASSP 2024: 8276-8280 - [c96]Haobin Tang, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang:
ED-TTS: Multi-Scale Emotion Modeling Using Cross-Domain Emotion Diarization for Emotional Speech Synthesis. ICASSP 2024: 12146-12150 - [c95]Yong Zhang, Hanzhang Li, Zhitao Li, Ning Cheng, Ming Li, Jing Xiao, Jianzong Wang:
Leveraging Biases in Large Language Models: "bias-kNN" for Effective Few-Shot Learning. ICASSP 2024: 12546-12550 - [c94]Jianzong Wang, Haoxiang Shi, Kaiyi Luo, Xulong Zhang, Ning Cheng, Jing Xiao:
RREH: Reconstruction Relations Embedded Hashing for Semi-paired Cross-Modal Retrieval. ICIC (LNAI 5) 2024: 374-385 - [c93]Haoxiang Shi, Xulong Zhang, Ning Cheng, Yong Zhang, Jun Yu, Jing Xiao, Jianzong Wang:
Enhancing Emotion Recognition in Conversation Through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning. ICIC (LNAI 3) 2024: 391-401 - [c92]Yimin Deng, Jianzong Wang, Xulong Zhang, Ning Cheng, Jing Xiao:
Learning Expressive Disentangled Speech Representations with Soft Speech Units and Adversarial Style Augmentation. IJCNN 2024: 1-7 - [c91]Pengcheng Li, Jianzong Wang, Xulong Zhang, Yong Zhang, Jing Xiao, Ning Cheng:
MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion. IJCNN 2024: 1-7 - [c90]Ziqi Liang, Jianzong Wang, Xulong Zhang, Yong Zhang, Ning Cheng, Jing Xiao:
EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning. IJCNN 2024: 1-7 - [c89]Sheng Ouyang, Jianzong Wang, Yong Zhang, Zhitao Li, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering. IJCNN 2024: 1-7 - [c88]Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
ConTuner: Singing Voice Beautifying with Pitch and Expressiveness Condition. IJCNN 2024: 1-6 - [c87]Jianzong Wang, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization. IJCNN 2024: 1-7 - [c86]Ming Li, Yong Zhang, Zhitao Li, Jiuhai Chen, Lichang Chen, Ning Cheng, Jianzong Wang, Tianyi Zhou, Jing Xiao:
From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning. NAACL-HLT 2024: 7602-7635 - [i78]Bingyuan Zhang, Xulong Zhang, Ning Cheng, Jun Yu, Jing Xiao, Jianzong Wang:
EmoTalker: Emotionally Editable Talking Face Generation via Diffusion Model. CoRR abs/2401.08049 (2024) - [i77]Yimin Deng, Huaizhen Tang, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang:
Learning Disentangled Speech Representations with Contrastive Learning and Time-Invariant Retrieval. CoRR abs/2401.08096 (2024) - [i76]Haobin Tang, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang:
ED-TTS: Multi-Scale Emotion Modeling using Cross-Domain Emotion Diarization for Emotional Speech Synthesis. CoRR abs/2401.08166 (2024) - [i75]Yong Zhang, Hanzhang Li, Zhitao Li, Ning Cheng, Ming Li, Jing Xiao, Jianzong Wang:
Leveraging Biases in Large Language Models: "bias-kNN" for Effective Few-Shot Learning. CoRR abs/2401.09783 (2024) - [i74]Ming Li, Yong Zhang, Shwai He, Zhitao Li, Hongyu Zhao, Jianzong Wang, Ning Cheng, Tianyi Zhou:
Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning. CoRR abs/2402.00530 (2024) - [i73]Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
Medical Speech Symptoms Classification via Disentangled Representation. CoRR abs/2403.05000 (2024) - [i72]Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
CONTUNER: Singing Voice Beautifying with Pitch and Expressiveness Condition. CoRR abs/2404.19187 (2024) - [i71]Ziqi Liang, Jianzong Wang, Xulong Zhang, Yong Zhang, Ning Cheng, Jing Xiao:
EAD-VC: Enhancing Speech Auto-Disentanglement for Voice Conversion with IFUB Estimator and Joint Text-Guided Consistent Learning. CoRR abs/2404.19212 (2024) - [i70]Jianzong Wang, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
EfficientASR: Speech Recognition Network Compression via Attention Redundancy and Chunk-Level FFN Optimization. CoRR abs/2404.19214 (2024) - [i69]Sheng Ouyang, Jianzong Wang, Yong Zhang, Zhitao Li, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering. CoRR abs/2404.19316 (2024) - [i68]Yimin Deng, Jianzong Wang, Xulong Zhang, Ning Cheng, Jing Xiao:
Learning Expressive Disentangled Speech Representations with Soft Speech Units and Adversarial Style Augmentation. CoRR abs/2405.00603 (2024) - [i67]Pengcheng Li, Jianzong Wang, Xulong Zhang, Yong Zhang, Jing Xiao, Ning Cheng:
MAIN-VC: Lightweight Speech Representation Disentanglement for One-shot Voice Conversion. CoRR abs/2405.00930 (2024) - [i66]Haoxiang Shi, Jianzong Wang, Xulong Zhang, Ning Cheng, Jun Yu, Jing Xiao:
RSET: Remapping-based Sorting Method for Emotion Transfer Speech Synthesis. CoRR abs/2405.17028 (2024) - [i65]Jianzong Wang, Haoxiang Shi, Kaiyi Luo, Xulong Zhang, Ning Cheng, Jing Xiao:
RREH: Reconstruction Relations Embedded Hashing for Semi-Paired Cross-Modal Retrieval. CoRR abs/2405.17777 (2024) - [i64]Haoxiang Shi, Xulong Zhang, Ning Cheng, Yong Zhang, Jun Yu, Jing Xiao, Jianzong Wang:
Enhancing Emotion Recognition in Conversation through Emotional Cross-Modal Fusion and Inter-class Contrastive Learning. CoRR abs/2405.17900 (2024) - [i63]Haoyan Yang, Zhitao Li, Yong Zhang, Jianzong Wang, Ning Cheng, Ming Li, Jing Xiao:
PFID: Privacy First Inference Delegation Framework for LLMs. CoRR abs/2406.12238 (2024) - 2023
- [c85]Tong Ye, Shijing Si, Jianzong Wang, Ning Cheng, Zhitao Li, Jing Xiao:
On the Calibration and Uncertainty with Pólya-Gamma Augmentation for Dialog Retrieval Models. AAAI 2023: 13923-13931 - [c84]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Voice Conversion with Denoising Diffusion Probabilistic GAN Models. ADMA (4) 2023: 154-167 - [c83]Kexin Zhu, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Symbolic and Acoustic: Multi-domain Music Emotion Modeling for Instrumental Music. ADMA (4) 2023: 168-181 - [c82]Xulong Zhang, Jianzong Wang, Ning Cheng, Yifu Sun, Chuanyao Zhang, Jing Xiao:
Machine Unlearning Methodology Based on Stochastic Teacher Network. ADMA (5) 2023: 250-261 - [c81]Jianzong Wang, Yimin Deng, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding. ISPA/BDCloud/SocialCom/SustainCom 2023: 752-757 - [c80]Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation. ISPA/BDCloud/SocialCom/SustainCom 2023: 923-928 - [c79]Yimin Deng, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation. ISPA/BDCloud/SocialCom/SustainCom 2023: 1143-1148 - [c78]Haoyan Yang, Zhitao Li, Yong Zhang, Jianzong Wang, Ning Cheng, Ming Li, Jing Xiao:
PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual Adapter. EMNLP 2023: 5364-5375 - [c77]Ganghui Ru, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving Music Genre Classification from multi-modal Properties of Music and Genre Correlations Perspective. ICASSP 2023: 1-5 - [c76]Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Learning Speech Representations with Flexible Hidden Feature Dimensions. ICASSP 2023: 1-5 - [c75]Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
VQ-CL: Learning Disentangled Speech Representations with Contrastive Learning and Vector Quantization. ICASSP 2023: 1-5 - [c74]Haobin Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis. ICASSP 2023: 1-5 - [c73]Tong Ye, Zhitao Li, Jianzong Wang, Ning Cheng, Jing Xiao:
Efficient Uncertainty Estimation with Gaussian Process for Reliable Dialog Response Retrieval. ICASSP 2023: 1-5 - [c72]Xulong Zhang, Haobin Tang, Jianzong Wang, Ning Cheng, Jian Luo, Jing Xiao:
Dynamic Alignment Mask CTC: Improved Mask CTC With Aligned Cross Entropy. ICASSP 2023: 1-5 - [c71]Kexin Zhu, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving EEG-based Emotion Recognition by Fusing Time-Frequency and Spatial Representations. ICASSP 2023: 1-5 - [c70]Yazhong Si, Xulong Zhang, Fan Yang, Jianzong Wang, Ning Cheng, Jing Xiao:
AOSR-Net: All-in-One Sandstorm Removal Network. ICTAI 2023: 641-645 - [c69]Jianzong Wang, Xulong Zhang, Aolan Sun, Ning Cheng, Jing Xiao:
FastGraphTTS: An Ultrafast Syntax-Aware Speech Synthesis Framework. ICTAI 2023: 905-912 - [c68]Kaiyi Luo, Xulong Zhang, Jianzong Wang, Huaxiong Li, Ning Cheng, Jing Xiao:
Contrastive Latent Space Reconstruction Learning for Audio-Text Retrieval. ICTAI 2023: 913-917 - [c67]Jianzong Wang, Xulong Zhang, Haobin Tang, Aolan Sun, Ning Cheng, Jing Xiao:
SAR: Self-Supervised Anti-Distortion Representation for End-To-End Speech Model. IJCNN 2023: 1-7 - [c66]Haobin Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech Synthesis. INTERSPEECH 2023: 12-16 - [c65]Jiaxin Fan, Yong Zhang, Hanzhang Li, Jianzong Wang, Zhitao Li, Sheng Ouyang, Ning Cheng, Jing Xiao:
Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism. INTERSPEECH 2023: 2173-2177 - [c64]Yong Zhang, Zhitao Li, Jianzong Wang, Yiming Gao, Ning Cheng, Fengying Yu, Jing Xiao:
Prompt Guided Copy Mechanism for Conversational Question Answering. INTERSPEECH 2023: 3422-3426 - [c63]Yifu Sun, Xulong Zhang, Jianzong Wang, Ning Cheng, Kaiyu Hu, Jing Xiao:
Investigation of Music Emotion Recognition Based on Segmented Semi-Supervised Learning. INTERSPEECH 2023: 5456-5460 - [c62]Yimin Deng, Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
PMVC: Data Augmentation-Based Prosody Modeling for Expressive Voice Conversion. ACM Multimedia 2023: 184-192 - [i62]Ganghui Ru, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving Music Genre Classification from multi-modal properties of music and genre correlations Perspective. CoRR abs/2303.07667 (2023) - [i61]Haobin Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
QI-TTS: Questioning Intonation Control for Emotional Speech Synthesis. CoRR abs/2303.07682 (2023) - [i60]Xulong Zhang, Haobin Tang, Jianzong Wang, Ning Cheng, Jian Luo, Jing Xiao:
Dynamic Alignment Mask CTC: Improved Mask-CTC with Aligned Cross Entropy. CoRR abs/2303.07687 (2023) - [i59]Tong Ye, Zhitao Li, Jianzong Wang, Ning Cheng, Jing Xiao:
Efficient Uncertainty Estimation with Gaussian Process for Reliable Dialog Response Retrieval. CoRR abs/2303.08599 (2023) - [i58]Tong Ye, Shijing Si, Jianzong Wang, Ning Cheng, Zhitao Li, Jing Xiao:
On the Calibration and Uncertainty with Pólya-Gamma Augmentation for Dialog Retrieval Models. CoRR abs/2303.08606 (2023) - [i57]Kexin Zhu, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving EEG-based Emotion Recognition by Fusing Time-frequency And Spatial Representations. CoRR abs/2303.11421 (2023) - [i56]Jianzong Wang, Xulong Zhang, Haobin Tang, Aolan Sun, Ning Cheng, Jing Xiao:
SAR: Self-Supervised Anti-Distortion Representation for End-To-End Speech Model. CoRR abs/2304.11547 (2023) - [i55]Haobin Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech Synthesis. CoRR abs/2306.00648 (2023) - [i54]Yong Zhang, Zhitao Li, Jianzong Wang, Yiming Gao, Ning Cheng, Fengying Yu, Jing Xiao:
Prompt Guided Copy Mechanism for Conversational Question Answering. CoRR abs/2308.03422 (2023) - [i53]Jiaxin Fan, Yong Zhang, Hanzhang Li, Jianzong Wang, Zhitao Li, Sheng Ouyang, Ning Cheng, Jing Xiao:
Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism. CoRR abs/2308.03423 (2023) - [i52]Yimin Deng, Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
PMVC: Data Augmentation-Based Prosody Modeling for Expressive Voice Conversion. CoRR abs/2308.11084 (2023) - [i51]Ming Li, Yong Zhang, Zhitao Li, Jiuhai Chen, Lichang Chen, Ning Cheng, Jianzong Wang, Tianyi Zhou, Jing Xiao:
From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning. CoRR abs/2308.12032 (2023) - [i50]Kexin Zhu, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Symbolic & Acoustic: Multi-domain Music Emotion Modeling for Instrumental Music. CoRR abs/2308.14317 (2023) - [i49]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Voice Conversion with Denoising Diffusion Probabilistic GAN Models. CoRR abs/2308.14319 (2023) - [i48]Xulong Zhang, Jianzong Wang, Ning Cheng, Yifu Sun, Chuanyao Zhang, Jing Xiao:
Machine Unlearning Methodology base on Stochastic Teacher Network. CoRR abs/2308.14322 (2023) - [i47]Zipeng Qi, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang:
DiffTalker: Co-driven audio-image diffusion for talking faces via intermediate landmarks. CoRR abs/2309.07509 (2023) - [i46]Jianzong Wang, Xulong Zhang, Aolan Sun, Ning Cheng, Jing Xiao:
FastGraphTTS: An Ultrafast Syntax-Aware Speech Synthesis Framework. CoRR abs/2309.08837 (2023) - [i45]Yazhong Si, Xulong Zhang, Fan Yang, Jianzong Wang, Ning Cheng, Jing Xiao:
AOSR-Net: All-in-One Sandstorm Removal Network. CoRR abs/2309.08838 (2023) - [i44]Kaiyi Luo, Xulong Zhang, Jianzong Wang, Huaxiong Li, Ning Cheng, Jing Xiao:
Contrastive Latent Space Reconstruction Learning for Audio-Text Retrieval. CoRR abs/2309.08839 (2023) - [i43]Haoyan Yang, Zhitao Li, Yong Zhang, Jianzong Wang, Ning Cheng, Ming Li, Jing Xiao:
PRCA: Fitting Black-Box Large Language Models for Retrieval Question Answering via Pluggable Reward-Driven Contextual Adapter. CoRR abs/2310.18347 (2023) - [i42]Jianzong Wang, Pengcheng Li, Xulong Zhang, Ning Cheng, Jing Xiao:
DQR-TTS: Semi-supervised Text-to-speech Synthesis with Dynamic Quantized Representation. CoRR abs/2311.07965 (2023) - [i41]Yimin Deng, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
CLN-VC: Text-Free Voice Conversion Based on Fine-Grained Style Control and Contrastive Learning with Negative Samples Augmentation. CoRR abs/2311.08670 (2023) - [i40]Jianzong Wang, Yimin Deng, Ziqi Liang, Xulong Zhang, Ning Cheng, Jing Xiao:
CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding. CoRR abs/2311.08673 (2023) - 2022
- [c61]Xulong Zhang, Jianzong Wang, Ning Cheng, Edward Xiao, Jing Xiao:
Shallow Diffusion Motion Model for Talking Face Generation from Speech. APWeb/WAIM (2) 2022: 144-157 - [c60]Chuanyao Zhang, Jianzong Wang, Zhangcheng Huang, Lingwei Kong, Xiaoyang Qu, Ning Cheng, Jing Xiao:
Supervised Contrastive Meta-learning for Few-Shot Classification. HPCC/DSS/SmartCity/DependSys 2022: 1736-1742 - [c59]Qiqi Wang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
DRVC: A Framework of Any-to-Any Voice Conversion with Self-Supervised Learning. ICASSP 2022: 3184-3188 - [c58]Botao Zhao, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
nnSpeech: Speaker-Guided Conditional Variational Autoencoder for Zero-Shot Multi-speaker text-to-speech. ICASSP 2022: 4293-4297 - [c57]Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Avqvc: One-Shot Voice Conversion By Vector Quantization With Applying Contrastive Learning. ICASSP 2022: 4613-4617 - [c56]Tong Ye, Shijing Si, Jianzong Wang, Rui Wang, Ning Cheng, Jing Xiao:
VU-BERT: A Unified Framework for Visual Dialog. ICASSP 2022: 6687-6691 - [c55]Yong Zhang, Zhitao Li, Jianzong Wang, Ning Cheng, Jing Xiao:
Self-Attention for Incomplete Utterance Rewriting. ICASSP 2022: 8047-8051 - [c54]Shijing Si, Jianzong Wang, Xulong Zhang, Xiaoyang Qu, Ning Cheng, Jing Xiao:
Boosting StarGANs for Voice Conversion with Contrastive Discriminator. ICONIP (2) 2022: 355-366 - [c53]Denghao Li, Yuqiao Zeng, Jianzong Wang, Lingwei Kong, Zhangcheng Huang, Ning Cheng, Xiaoyang Qu, Jing Xiao:
Blur the Linguistic Boundary: Interpreting Chinese Buddhist Sutra in English via Neural Machine Translation. ICTAI 2022: 228-232 - [c52]Aolan Sun, Xulong Zhang, Tiandong Ling, Jianzong Wang, Ning Cheng, Jing Xiao:
Pre-Avatar: An Automatic Presentation Generation Framework Leveraging Talking Avatar. ICTAI 2022: 1002-1006 - [c51]Jian Luo, Jianzong Wang, Ning Cheng, Haobin Tang, Jing Xiao:
Speech Augmentation Based Unsupervised Learning for Keyword Spotting. IJCNN 2022: 1-7 - [c50]Jian Luo, Jianzong Wang, Ning Cheng, Zhenpeng Zheng, Jing Xiao:
Adaptive Activation Network for Low Resource Multilingual Speech Recognition. IJCNN 2022: 1-7 - [c49]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
SUSing: SU-net for Singing Voice Synthesis. IJCNN 2022: 1-7 - [c48]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
MDCNN-SID: Multi-scale Dilated Convolution Network for Singer Identification. IJCNN 2022: 1-7 - [c47]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTS. IJCNN 2022: 1-7 - [c46]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Singer Identification for Metaverse with Timbral and Middle-Level Perceptual Features. IJCNN 2022: 1-7 - [c45]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
MetaSID: Singer Identification with Domain Adaptation for Metaverse. IJCNN 2022: 1-7 - [c44]Tong Ye, Shijing Si, Jianzong Wang, Ning Cheng, Jing Xiao:
Uncertainty Calibration for Deep Audio Classifiers. INTERSPEECH 2022: 1556-1560 - [c43]Sicheng Yang, Methawee Tantrawenith, Haolin Zhuang, Zhiyong Wu, Aolan Sun, Jianzong Wang, Ning Cheng, Huaizhen Tang, Xintao Zhao, Jie Wang, Helen Meng:
Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion. INTERSPEECH 2022: 2553-2557 - [c42]Jian Luo, Jianzong Wang, Ning Cheng, Edward Xiao, Xulong Zhang, Jing Xiao:
Tiny-Sepformer: A Tiny Time-Domain Transformer Network For Speech Separation. INTERSPEECH 2022: 5313-5317 - [c41]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Adapitch: Adaption Multi-Speaker Text-to-Speech Conditioned on Pitch Disentangling with Untranscribed Data. MSN 2022: 456-460 - [c40]Xulong Zhang, Jianzong Wang, Ning Cheng, Kexin Zhu, Jing Xiao:
Improving Speech Representation Learning via Speech-level and Phoneme-level Masking Approach. MSN 2022: 485-489 - [c39]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
MetaSpeech: Speech Effects Switch Along with Environment for Metaverse. MSN 2022: 841-846 - [c38]Xulong Zhang, Jianzong Wang, Ning Cheng, Mengyuan Zhao, Zhiyong Zhang, Jing Xiao:
Linguistic-Enhanced Transformer with CTC Embedding for Speech Recognition. MSN 2022: 915-920 - [c37]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Semi-Supervised Learning Based on Reference Model for Low-resource TTS. MSN 2022: 966-971 - [c36]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Improving Imbalanced Text Classification with Dynamic Curriculum Learning. MSN 2022: 1031-1036 - [i39]Huaizhen Tang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
AVQVC: One-shot Voice Conversion by Vector Quantization with applying contrastive learning. CoRR abs/2202.10020 (2022) - [i38]Botao Zhao, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
nnSpeech: Speaker-Guided Conditional Variational Autoencoder for Zero-shot Multi-speaker Text-to-Speech. CoRR abs/2202.10712 (2022) - [i37]Tong Ye, Shijing Si, Jianzong Wang, Rui Wang, Ning Cheng, Jing Xiao:
VU-BERT: A Unified framework for Visual Dialog. CoRR abs/2202.10787 (2022) - [i36]Qiqi Wang, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
DRVC: A Framework of Any-to-Any Voice Conversion with Self-Supervised Learning. CoRR abs/2202.10976 (2022) - [i35]Yong Zhang, Zhitao Li, Jianzong Wang, Ning Cheng, Jing Xiao:
Self-Attention for Incomplete Utterance Rewriting. CoRR abs/2202.12160 (2022) - [i34]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
Singer Identification for Metaverse with Timbral and Middle-Level Perceptual Features. CoRR abs/2205.11817 (2022) - [i33]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
MetaSID: Singer Identification with Domain Adaptation for Metaverse. CoRR abs/2205.11821 (2022) - [i32]Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao:
TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTS. CoRR abs/2205.11824 (2022) - [i31]