default search action
Jianhua Tao 0001
陶建华
Person information
- unicode name: 陶建华
- affiliation: Tsinghua University, Department of Automation, Beijing, China
- affiliation: University of Chinese Academy of Sciences, School of Artificial Intelligence, Beijing, China
- affiliation (PhD 2001): Tsinghua University, Beijing, China
Other persons with the same name
- Jianhua Tao 0002 — Guangzhou University, School of Mechanical and Electrical Engineering, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j89]Tao Wang, Jiangyan Yi, Ruibo Fu, Jianhua Tao, Zhengqi Wen, Chu Yuan Zhang:
Emotion selectable end-to-end text-based speech editing. Artif. Intell. 329: 104076 (2024) - [j88]Cunhang Fan, Heng Xie, Jianhua Tao, Yongwei Li, Guanxiong Pei, Taihao Li, Zhao Lv:
ICaps-ResLSTM: Improved capsule network and residual LSTM for EEG emotion recognition. Biomed. Signal Process. Control. 87(Part B): 105422 (2024) - [j87]Pengpeng Shao, Jianhua Tao:
Multi-level graph contrastive learning. Neurocomputing 570: 127101 (2024) - [j86]Zheng Lian, Licai Sun, Haiyang Sun, Kang Chen, Zhuofan Wen, Hao Gu, Bin Liu, Jianhua Tao:
GPT-4V with emotion: A zero-shot benchmark for Generalized Emotion Recognition. Inf. Fusion 108: 102367 (2024) - [j85]Licai Sun, Zheng Lian, Bin Liu, Jianhua Tao:
HiCMAE: Hierarchical Contrastive Masked Autoencoder for self-supervised Audio-Visual Emotion Recognition. Inf. Fusion 108: 102382 (2024) - [j84]Guofeng Yi, Cunhang Fan, Kang Zhu, Zhao Lv, Shan Liang, Zhengqi Wen, Guanxiong Pei, Taihao Li, Jianhua Tao:
VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis. Knowl. Based Syst. 283: 111136 (2024) - [j83]Pengpeng Shao, Yang Wen, Jianhua Tao:
Bayesian hypernetwork collaborates with time-difference evolutional network for temporal knowledge prediction. Neural Networks 175: 106146 (2024) - [j82]Cunhang Fan, Jun Xue, Jianhua Tao, Jiangyan Yi, Chenglong Wang, Chengshi Zheng, Zhao Lv:
Spatial reconstructed local attention Res2Net with F0 subband for fake speech detection. Neural Networks 175: 106320 (2024) - [j81]Feihu Che, Jianhua Tao:
M2ixKG: Mixing for harder negative samples in knowledge graph. Neural Networks 177: 106358 (2024) - [j80]Cunhang Fan, Hongyu Zhang, Wei Huang, Jun Xue, Jianhua Tao, Jiangyan Yi, Zhao Lv, Xiaopei Wu:
DGSD: Dynamical graph self-distillation for EEG-based auditory spatial attention detection. Neural Networks 179: 106580 (2024) - [j79]Jiangyan Yi, Chenglong Wang, Jianhua Tao, Chuyuan Zhang, Cunhang Fan, Zhengkun Tian, Haoxin Ma, Ruibo Fu:
SceneFake: An initial dataset and benchmarks for scene fake audio detection. Pattern Recognit. 152: 110468 (2024) - [j78]Mingyue Niu, Jianhua Tao, Yongwei Li, Yong Qin, Ya Li:
WavDepressionNet: Automatic Depression Level Prediction via Raw Speech Signals. IEEE Trans. Affect. Comput. 15(1): 285-296 (2024) - [j77]Licai Sun, Zheng Lian, Bin Liu, Jianhua Tao:
Efficient Multimodal Transformer With Dual-Level Feature Restoration for Robust Multimodal Sentiment Analysis. IEEE Trans. Affect. Comput. 15(1): 309-325 (2024) - [j76]Cunhang Fan, Mingming Ding, Jianhua Tao, Ruibo Fu, Jiangyan Yi, Zhengqi Wen, Zhao Lv:
Dual-Branch Knowledge Distillation for Noise-Robust Synthetic Speech Detection. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2453-2466 (2024) - [j75]Zheng Lian, Bin Liu, Jianhua Tao:
PIRNet: Personality-Enhanced Iterative Refinement Network for Emotion Recognition in Conversation. IEEE Trans. Neural Networks Learn. Syst. 35(2): 2863-2874 (2024) - [c274]Cunhang Fan, Yujie Chen, Jun Xue, Yonghui Kong, Jianhua Tao, Zhao Lv:
Progressive Distillation Based on Masked Generation Feature Method for Knowledge Graph Completion. AAAI 2024: 8380-8388 - [c273]Xiaohui Zhang, Jiangyan Yi, Chenglong Wang, Chu Yuan Zhang, Siding Zeng, Jianhua Tao:
What to Remember: Self-Adaptive Continual Learning for Audio Deepfake Detection. AAAI 2024: 19569-19577 - [c272]Sicheng Zhao, Jianhua Tao, Guiguang Ding:
Open-world Domain Adaptation and Generalization. ACM TUR-C 2024 - [c271]Hao Gu, Jiangyan Yi, Zheng Lian, Jianhua Tao, Xinrui Yan:
NLoPT: N-gram Enhanced Low-Rank Task Adaptive Pre-training for Efficient Language Model Adaption. LREC/COLING 2024: 12259-12270 - [c270]Shuaihu Han, Guohua Yang, Dawei Zhang, Jianhua Tao, Feihu Che:
Multi-stage Vs Single-Stage: A Local Information Focused Approach for Overlapping Event Extraction. ICANN (7) 2024: 277-291 - [c269]Chenglong Wang, Jiayi He, Jiangyan Yi, Jianhua Tao, Chu Yuan Zhang, Xiaohui Zhang:
Multi-Scale Permutation Entropy for Audio Deepfake Detection. ICASSP 2024: 1406-1410 - [c268]Mingyu Xu, Zheng Lian, Bin Liu, Zerui Chen, Jianhua Tao:
Pseudo Labels Regularization for Imbalanced Partial-Label Learning. ICASSP 2024: 6305-6309 - [c267]Yong Ren, Tao Wang, Jiangyan Yi, Le Xu, Jianhua Tao, Chu Yuan Zhang, Junzuo Zhou:
Fewer-Token Neural Speech Codec with Time-Invariant Codes. ICASSP 2024: 12737-12741 - [c266]Shuaihu Han, Guohua Yang, Dawei Zhang, Jianhua Tao:
What Comes Next and Why? A Staged Encoder-Decoder Architecture for Script Event Prediction. IJCNN 2024: 1-9 - [c265]Xiaoyang Li, Guohua Yang, Dawei Zhang, Jianhua Tao:
APC: Predict Global Representation From Local Observation In Multi-Agent Reinforcement Learning. IJCNN 2024: 1-8 - [c264]Yonghui Kong, Cunhang Fan, Yujie Chen, Shuai Zhang, Zhao Lv, Jianhua Tao:
Bilateral Masking with prompt for Knowledge Graph Completion. NAACL-HLT (Findings) 2024: 240-249 - [i113]Licai Sun, Zheng Lian, Kexin Wang, Yu He, Mingyu Xu, Haiyang Sun, Bin Liu, Jianhua Tao:
SVFAP: Self-supervised Video Facial Affect Perceiver. CoRR abs/2401.00416 (2024) - [i112]Zheng Lian, Licai Sun, Yong Ren, Hao Gu, Haiyang Sun, Lan Chen, Bin Liu, Jianhua Tao:
MERBench: A Unified Evaluation Benchmark for Multimodal Emotion Recognition. CoRR abs/2401.03429 (2024) - [i111]Licai Sun, Zheng Lian, Bin Liu, Jianhua Tao:
HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition. CoRR abs/2401.05698 (2024) - [i110]Cunhang Fan, Yujie Chen, Jun Xue, Yonghui Kong, Jianhua Tao, Zhao Lv:
Progressive Distillation Based on Masked Generation Feature Method for Knowledge Graph Completion. CoRR abs/2401.12997 (2024) - [i109]Kang Chen, Zheng Lian, Haiyang Sun, Bin Liu, Jianhua Tao:
Can Deception Detection Go Deeper? Dataset, Evaluation, and Benchmark for Deception Reasoning. CoRR abs/2402.11432 (2024) - [i108]Zhuofan Wen, Fengyu Zhang, Siyuan Zhang, Haiyang Sun, Mingyu Xu, Licai Sun, Zheng Lian, Bin Liu, Jianhua Tao:
Multimodal Fusion with Pre-Trained Model Features in Affective Behaviour Analysis In-the-wild. CoRR abs/2403.15044 (2024) - [i107]Xinxin Zheng, Feihu Che, Jinyang Wu, Shuai Zhang, Shuai Nie, Kang Liu, Jianhua Tao:
KS-LLM: Knowledge Selection of Large Language Models with Evidence Document for Question Answering. CoRR abs/2404.15660 (2024) - [i106]Zheng Lian, Haiyang Sun, Licai Sun, Zhuofan Wen, Siyuan Zhang, Shun Chen, Hao Gu, Jinming Zhao, Ziyang Ma, Xie Chen, Jiangyan Yi, Rui Liu, Kele Xu, Bin Liu, Erik Cambria, Guoying Zhao, Björn W. Schuller, Jianhua Tao:
MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition. CoRR abs/2404.17113 (2024) - [i105]Yuankun Xie, Yi Lu, Ruibo Fu, Zhengqi Wen, Zhiyong Wang, Jianhua Tao, Xin Qi, Xiaopeng Wang, Yukun Liu, Haonan Cheng, Long Ye, Yi Sun:
The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio. CoRR abs/2405.04880 (2024) - [i104]Jinyang Wu, Feihu Che, Xinxin Zheng, Shuai Zhang, Ruihan Jin, Shuai Nie, Pengpeng Shao, Jianhua Tao:
Can large language models understand uncommon meanings of common words? CoRR abs/2405.05741 (2024) - [i103]Xiaohui Zhang, Jiangyan Yi, Jianhua Tao:
EVDA: Evolving Deepfake Audio Detection Continual Learning Benchmark. CoRR abs/2405.08596 (2024) - [i102]Zhiyong Wang, Ruibo Fu, Zhengqi Wen, Yuankun Xie, Yukun Liu, Xiaopeng Wang, Xuefei Liu, Yongwei Li, Jianhua Tao, Yi Lu, Xin Qi, Shuchen Shi:
Generalized Fake Audio Detection via Deep Stable Learning. CoRR abs/2406.03237 (2024) - [i101]Yuankun Xie, Ruibo Fu, Zhengqi Wen, Zhiyong Wang, Xiaopeng Wang, Haonan Cheng, Long Ye, Jianhua Tao:
Generalized Source Tracing: Detecting Novel Audio Deepfake Algorithm with Real Emphasis and Fake Dispersion Strategy. CoRR abs/2406.03240 (2024) - [i100]Xiaopeng Wang, Ruibo Fu, Zhengqi Wen, Zhiyong Wang, Yuankun Xie, Yukun Liu, Jianhua Tao, Xuefei Liu, Yongwei Li, Xin Qi, Yi Lu, Shuchen Shi:
Genuine-Focused Learning using Mask AutoEncoder for Generalized Fake Audio Detection. CoRR abs/2406.03247 (2024) - [i99]Shuchen Shi, Ruibo Fu, Zhengqi Wen, Jianhua Tao, Tao Wang, Chunyu Qiang, Yi Lu, Xin Qi, Xuefei Liu, Yukun Liu, Yongwei Li, Zhiyong Wang, Xiaopeng Wang:
PPPR: Portable Plug-in Prompt Refiner for Text to Audio Generation. CoRR abs/2406.04683 (2024) - [i98]Junzuo Zhou, Jiangyan Yi, Tao Wang, Jianhua Tao, Ye Bai, Chu Yuan Zhang, Yong Ren, Zhengqi Wen:
TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking. CoRR abs/2406.04840 (2024) - [i97]Yujie Chen, Jiangyan Yi, Jun Xue, Chenglong Wang, Xiaohui Zhang, Shunbo Dong, Siding Zeng, Jianhua Tao, Zhao Lv, Cunhang Fan:
RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection. CoRR abs/2406.06086 (2024) - [i96]Yi Lu, Yuankun Xie, Ruibo Fu, Zhengqi Wen, Jianhua Tao, Zhiyong Wang, Xin Qi, Xuefei Liu, Yongwei Li, Yukun Liu, Xiaopeng Wang, Shuchen Shi:
Codecfake: An Initial Dataset for Detecting LLM-based Deepfake Audio. CoRR abs/2406.08112 (2024) - [i95]Ruibo Fu, Shuchen Shi, Hongming Guo, Tao Wang, Chunyu Qiang, Zhengqi Wen, Jianhua Tao, Xin Qi, Yi Lu, Xiaopeng Wang, Zhiyong Wang, Yukun Liu, Xuefei Liu, Shuai Zhang, Guanjun Li:
MINT: a Multi-modal Image and Narrative Text Dubbing Dataset for Foley Audio Content Planning and Generation. CoRR abs/2406.10591 (2024) - [i94]Ruihan Jin, Ruibo Fu, Zhengqi Wen, Shuai Zhang, Yukun Liu, Jianhua Tao:
Fake News Detection and Manipulation Reasoning via Large Vision-Language Models. CoRR abs/2407.02042 (2024) - [i93]Ruibo Fu, Xin Qi, Zhengqi Wen, Jianhua Tao, Tao Wang, Chunyu Qiang, Zhiyong Wang, Yi Lu, Xiaopeng Wang, Shuchen Shi, Yukun Liu, Xuefei Liu, Shuai Zhang:
ASRRL-TTS: Agile Speaker Representation Reinforcement Learning for Text-to-Speech Speaker Adaptation. CoRR abs/2407.05421 (2024) - [i92]Zheng Lian, Haiyang Sun, Licai Sun, Jiangyan Yi, Bin Liu, Jianhua Tao:
AffectGPT: Dataset and Framework for Explainable Multimodal Emotion Recognition. CoRR abs/2407.07653 (2024) - [i91]Siding Zeng, Jiangyan Yi, Jianhua Tao, Yujie Chen, Shan Liang, Yong Ren, Xiaohui Zhang:
An Unsupervised Domain Adaptation Method for Locating Manipulated Region in partially fake Audio. CoRR abs/2407.08239 (2024) - [i90]Cong Cai, Shan Liang, Xuefei Liu, Kang Zhu, Zhengqi Wen, Jianhua Tao, Heng Xie, Jizhou Cui, Yiming Ma, Zhenhua Cheng, Hanzhe Xu, Ruibo Fu, Bin Liu, Yongwei Li:
MDPE: A Multimodal Deception Dataset with Personality and Emotional Characteristics. CoRR abs/2407.12274 (2024) - [i89]Jiangyan Yi, Chu Yuan Zhang, Jianhua Tao, Chenglong Wang, Xinrui Yan, Yong Ren, Hao Gu, Junzuo Zhou:
ADD 2023: Towards Audio Deepfake Detection and Analysis in the Wild. CoRR abs/2408.04967 (2024) - [i88]Chunyu Qiang, Wang Geng, Yi Zhao, Ruibo Fu, Tao Wang, Cheng Gong, Tianrui Wang, Qiuyu Liu, Jiangyan Yi, Zhengqi Wen, Chen Zhang, Hao Che, Longbiao Wang, Jiangwu Dang, Jianhua Tao:
VQ-CTAP: Cross-Modal Fine-Grained Sequence Representation Learning for Speech Processing. CoRR abs/2408.05758 (2024) - [i87]Zhiyong Wang, Xiaopeng Wang, Yuankun Xie, Ruibo Fu, Zhengqi Wen, Jianhua Tao, Yukun Liu, Guanjun Li, Xin Qi, Yi Lu, Xuefei Liu, Yongwei Li:
A Noval Feature via Color Quantisation for Fake Audio Detection. CoRR abs/2408.10849 (2024) - [i86]Xin Qi, Ruibo Fu, Zhengqi Wen, Jianhua Tao, Shuchen Shi, Yi Lu, Zhiyong Wang, Xiaopeng Wang, Yuankun Xie, Yukun Liu, Guanjun Li, Xuefei Liu, Yongwei Li:
EELE: Exploring Efficient and Extensible LoRA Integration in Emotional Text-to-Speech. CoRR abs/2408.10852 (2024) - [i85]Yuankun Xie, Chenxu Xiong, Xiaopeng Wang, Zhiyong Wang, Yi Lu, Xin Qi, Ruibo Fu, Yukun Liu, Zhengqi Wen, Jianhua Tao, Guanjun Li, Long Ye:
Does Current Deepfake Audio Detection Model Effectively Detect ALM-based Deepfake Audio? CoRR abs/2408.10853 (2024) - [i84]Moyang Liu, Yukun Liu, Ruibo Fu, Zhengqi Wen, Jianhua Tao, Xuefei Liu, Guanjun Li:
Exploring the Role of Audio in Multimodal Misinformation Detection. CoRR abs/2408.12558 (2024) - [i83]Jinyang Wu, Feihu Che, Chuyuan Zhang, Jianhua Tao, Shuai Zhang, Pengpeng Shao:
Pandora's Box or Aladdin's Lamp: A Comprehensive Analysis Revealing the Role of RAG Noise in Large Language Models. CoRR abs/2408.13533 (2024) - [i82]Hao Gu, Jiangyan Yi, Chenglong Wang, Yong Ren, Jianhua Tao, Xinrui Yan, Yujie Chen, Xiaohui Zhang:
Utilizing Speaker Profiles for Impersonation Audio Detection. CoRR abs/2408.17009 (2024) - 2023
- [j74]Pengpeng Shao, Jiayi He, Guanjun Li, Dawei Zhang, Jianhua Tao:
Hierarchical graph attention network for temporal knowledge graph reasoning. Neurocomputing 550: 126390 (2023) - [j73]Zepeng Huai, Dawei Zhang, Guohua Yang, Jianhua Tao:
Spatial-temporal knowledge graph network for event prediction. Neurocomputing 553: 126557 (2023) - [j72]Pengpeng Shao, Tong Liu, Feihu Che, Dawei Zhang, Jianhua Tao:
Adaptive pseudo-Siamese policy network for temporal knowledge prediction. Neural Networks 160: 192-201 (2023) - [j71]Zheng Lian, Lan Chen, Licai Sun, Bin Liu, Jianhua Tao:
GCNet: Graph Completion Network for Incomplete Multimodal Learning in Conversation. IEEE Trans. Pattern Anal. Mach. Intell. 45(7): 8419-8432 (2023) - [j70]Andreas Triantafyllopoulos, Björn W. Schuller, Gökçe Iymen, Tevfik Metin Sezgin, Xiangheng He, Zijiang Yang, Panagiotis Tzirakis, Shuo Liu, Silvan Mertes, Elisabeth André, Ruibo Fu, Jianhua Tao:
An Overview of Affective Speech Synthesis and Conversion in the Deep Learning Era. Proc. IEEE 111(10): 1355-1381 (2023) - [j69]Jiangyan Yi, Jianhua Tao, Ye Bai, Zhengkun Tian, Cunhang Fan:
Transfer knowledge for punctuation prediction via adversarial training. Speech Commun. 149: 1-10 (2023) - [j68]Mingyue Niu, Jianhua Tao, Bin Liu, Jian Huang, Zheng Lian:
Multimodal Spatiotemporal Representation for Automatic Depression Level Detection. IEEE Trans. Affect. Comput. 14(1): 294-307 (2023) - [j67]Mingyue Niu, Ziping Zhao, Jianhua Tao, Ya Li, Björn W. Schuller:
Dual Attention and Element Recalibration Networks for Automatic Depression Level Prediction. IEEE Trans. Affect. Comput. 14(3): 1954-1965 (2023) - [j66]Zheng Lian, Bin Liu, Jianhua Tao:
SMIN: Semi-Supervised Multi-Modal Interaction Network for Conversational Emotion Recognition. IEEE Trans. Affect. Comput. 14(3): 2415-2429 (2023) - [j65]Jiangyan Yi, Jianhua Tao, Ruibo Fu, Tao Wang, Chu Yuan Zhang, Chenglong Wang:
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2963-2973 (2023) - [c263]Junjie Chen, Yongwei Li, Ziping Zhao, Xuefei Liu, Zhengqi Wen, Jianhua Tao:
Hybrid Multi-Task Learning for End-To-End Multimodal Emotion Recognition. APSIPA ASC 2023: 1966-1971 - [c262]Yi Lu, Ruibo Fu, Xin Qi, Zhengqi Wen, Jianhua Tao, Jiangyan Yi, Tao Wang, Yong Ren, Chuyuan Zhang, Chenyu Yang, Wenling Shi:
The VIBVG Speech Synthesis System for Blizzard Challenge 2023. Blizzard Challenge 2023 - [c261]Xiaohui Zhang, Mangui Liang, Zhengkun Tian, Jiangyan Yi, Jianhua Tao:
TST: Time-Sparse Transducer for Automatic Speech Recognition. CICAI (2) 2023: 68-80 - [c260]Xiaohui Zhang, Jiangyan Yi, Jianhua Tao, Chenglong Wang, Le Xu, Ruibo Fu:
Adaptive Fake Audio Detection with Low-Rank Model Squeezing. DADA@IJCAI 2023: 95-100 - [c259]Chenglong Wang, Jiangyan Yi, Xiaohui Zhang, Jianhua Tao, Xinrui Yan, Le Xu, Ruibo Fu:
Low-rank Adaptation Method for Wav2vec2-based Fake Audio Detection. DADA@IJCAI 2023: 101-106 - [c258]Jiangyan Yi, Jianhua Tao, Ruibo Fu, Xinrui Yan, Chenglong Wang, Tao Wang, Chu Yuan Zhang, Xiaohui Zhang, Yan Zhao, Yong Ren, Le Xu, Junzuo Zhou, Hao Gu, Zhengqi Wen, Shan Liang, Zheng Lian, Shuai Nie, Haizhou Li:
ADD 2023: the Second Audio Deepfake Detection Challenge. DADA@IJCAI 2023: 125-130 - [c257]Guanjun Li, Wei Xue, Wenju Liu, Jiangyan Yi, Jianhua Tao:
GCC-Speaker: Target Speaker Localization with Optimal Speaker-Dependent Weighting in Multi-Speaker Scenarios. ICASSP 2023: 1-5 - [c256]Jinlong Xue, Yayue Deng, Fengping Wang, Ya Li, Yingming Gao, Jianhua Tao, Jianqing Sun, Jiaen Liang:
M2-CTTS: End-to-End Multi-Scale Multi-Modal Conversational Text-to-Speech Synthesis. ICASSP 2023: 1-5 - [c255]Xiaohui Zhang, Jiangyan Yi, Jianhua Tao, Chenglong Wang, Chu Yuan Zhang:
Do You Remember? Overcoming Catastrophic Forgetting for Fake Audio Detection. ICML 2023: 41819-41831 - [c254]Zepeng Huai, Guohua Yang, Jianhua Tao, Dawei Zhang:
Learning Item Attributes and User Interests for Knowledge Graph Enhanced Recommendation. ICONIP (4) 2023: 284-297 - [c253]Ruiteng Zhang, Jianguo Wei, Xugang Lu, Yongwei Li, Junhai Xu, Di Jin, Jianhua Tao:
SOT: Self-supervised Learning-Assisted Optimal Transport for Unsupervised Adaptive Speech Emotion Recognition. INTERSPEECH 2023: 1858-1862 - [c252]Chenglong Wang, Jiangyan Yi, Jianhua Tao, Chu Yuan Zhang, Shuai Zhang, Ruibo Fu, Xun Chen:
TO-Rawnet: Improving RawNet with TCN and Orthogonal Regularization for Fake Audio Detection. INTERSPEECH 2023: 3137-3141 - [c251]Haiyang Sun, Zheng Lian, Bin Liu, Ying Li, Jianhua Tao, Licai Sun, Cong Cai, Meng Wang, Yuan Cheng:
EmotionNAS: Two-stream Neural Architecture Search for Speech Emotion Recognition. INTERSPEECH 2023: 3597-3601 - [c250]Chenglong Wang, Jiangyan Yi, Jianhua Tao, Chu Yuan Zhang, Shuai Zhang, Xun Chen:
Detection of Cross-Dataset Fake Audio Based on Prosodic and Pronunciation Features. INTERSPEECH 2023: 3844-3848 - [c249]Licai Sun, Zheng Lian, Bin Liu, Jianhua Tao:
MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic Facial Expression Recognition. ACM Multimedia 2023: 6110-6121 - [c248]Ke Xu, Kang Chen, Licai Sun, Zheng Lian, Bin Liu, Gong Chen, Haiyang Sun, Mingyu Xu, Jianhua Tao:
Integrating VideoMAE based model and Optical Flow for Micro- and Macro-expression Spotting. ACM Multimedia 2023: 9576-9580 - [c247]Zheng Lian, Haiyang Sun, Licai Sun, Kang Chen, Mingyu Xu, Kexin Wang, Ke Xu, Yu He, Ying Li, Jinming Zhao, Ye Liu, Bin Liu, Jiangyan Yi, Meng Wang, Erik Cambria, Guoying Zhao, Björn W. Schuller, Jianhua Tao:
MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning. ACM Multimedia 2023: 9610-9614 - [c246]Zheng Lian, Erik Cambria, Guoying Zhao, Björn W. Schuller, Jianhua Tao:
MRAC'23: 1st International Workshop on Multimodal and Responsible Affective Computing. ACM Multimedia 2023: 9713-9714 - [c245]Guofeng Yi, Yuguang Yang, Yu Pan, Yuhang Cao, Jixun Yao, Xiang Lv, Cunhang Fan, Zhao Lv, Jianhua Tao, Shan Liang, Heng Lu:
Exploring the Power of Cross-Contextual Large Language Model in Mimic Emotion Prediction. MuSe@ACM Multimedia 2023: 19-26 - [c244]Heng Xie, Jizhou Cui, Yuhang Cao, Junjie Chen, Jianhua Tao, Cunhang Fan, Xuefei Liu, Zhengqi Wen, Heng Lu, Yuguang Yang, Zhao Lv, Yongwei Li:
Multimodal Cross-Lingual Features and Weight Fusion for Cross-Cultural Humor Detection. MuSe@ACM Multimedia 2023: 51-57 - [c243]Haiyang Sun, Zhuofan Wen, Mingyu Xu, Zheng Lian, Licai Sun, Bin Liu, Jianhua Tao:
Exclusive Modeling for MuSe-Personalisation Challenge. MuSe@ACM Multimedia 2023: 73-80 - [c242]Mingyu Xu, Zheng Lian, Lei Feng, Bin Liu, Jianhua Tao:
ALIM: Adjusting Label Importance Mechanism for Noisy Partial Label Learning. NeurIPS 2023 - [c241]Mingyu Xu, Zheng Lian, Bin Liu, Jianhua Tao:
VRA: Variational Rectified Activation for Out-of-distribution Detection. NeurIPS 2023 - [e6]Jianhua Tao, Haizhou Li, Jiangyan Yi, Cunhang Fan:
Proceedings of the Workshop on Deepfake Audio Detection and Analysis co-located with 32th International Joint Conference on Artificial Intelligence (IJCAI 2023), Macao, China, August 19, 2023. CEUR Workshop Proceedings 3597, CEUR-WS.org 2023 [contents] - [i81]Haogeng Liu, Tao Wang, Ruibo Fu, Jiangyan Yi, Zhengqi Wen, Jianhua Tao:
UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion. CoRR abs/2301.03801 (2023) - [i80]Mingyu Xu, Zheng Lian, Lei Feng, Bin Liu, Jianhua Tao:
DALI: Dynamically Adjusted Label Importance for Noisy Partial Label Learning. CoRR abs/2301.12077 (2023) - [i79]Zheng Lian, Haiyang Sun, Licai Sun, Jinming Zhao, Ye Liu, Bin Liu, Jiangyan Yi, Meng Wang, Erik Cambria, Guoying Zhao, Björn W. Schuller, Jianhua Tao:
MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning. CoRR abs/2304.08981 (2023) - [i78]Jinlong Xue, Yayue Deng, Fengping Wang, Ya Li, Yingming Gao, Jianhua Tao, Jianqing Sun, Jiaen Liang:
M2-CTTS: End-to-End Multi-scale Multi-modal Conversational Text-to-Speech Synthesis. CoRR abs/2305.02269 (2023) - [i77]Chenglong Wang, Jiangyan Yi, Jianhua Tao, Chu Yuan Zhang, Shuai Zhang, Xun Chen:
Detection of Cross-Dataset Fake Audio Based on Prosodic and Pronunciation Features. CoRR abs/2305.13700 (2023) - [i76]Chenglong Wang, Jiangyan Yi, Jianhua Tao, Chuyuan Zhang, Shuai Zhang, Ruibo Fu, Xun Chen:
TO-Rawnet: Improving RawNet with TCN and Orthogonal Regularization for Fake Audio Detection. CoRR abs/2305.13701 (2023) - [i75]Jiangyan Yi, Jianhua Tao, Ruibo Fu, Xinrui Yan, Chenglong Wang, Tao Wang, Chu Yuan Zhang, Xiaohui Zhang, Yan Zhao, Yong Ren, Le Xu, Junzuo Zhou, Hao Gu, Zhengqi Wen, Shan Liang, Zheng Lian, Shuai Nie, Haizhou Li:
ADD 2023: the Second Audio Deepfake Detection Challenge. CoRR abs/2305.13774 (2023) - [i74]Xiaohui Zhang, Jiangyan Yi, Jianhua Tao, Chenlong Wang, Le Xu, Ruibo Fu:
Adaptive Fake Audio Detection with Low-Rank Model Squeezing. CoRR abs/2306.04956 (2023) - [i73]Chenglong Wang, Jiangyan Yi, Xiaohui Zhang, Jianhua Tao, Le Xu, Ruibo Fu:
Low-rank Adaptation Method for Wav2vec2-based Fake Audio Detection. CoRR abs/2306.05617 (2023) - [i72]Haogeng Liu, Tao Wang, Jie Cao, Ran He, Jianhua Tao:
Boosting Fast and High-Quality Speech Synthesis with Linear Diffusion. CoRR abs/2306.05708 (2023) - [i71]Zheng Lian, Licai Sun, Mingyu Xu, Haiyang Sun, Ke Xu, Zhuofan Wen, Shun Chen, Bin Liu, Jianhua Tao:
Explainable Multimodal Emotion Reasoning. CoRR abs/2306.15401 (2023) - [i70]Licai Sun, Zheng Lian, Bin Liu, Jianhua Tao:
MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic Facial Expression Recognition. CoRR abs/2307.02227 (2023) - [i69]Xiaohui Zhang, Mangui Liang, Zhengkun Tian, Jiangyan Yi, Jianhua Tao:
TST: Time-Sparse Transducer for Automatic Speech Recognition. CoRR abs/2307.08323 (2023) - [i68]Xiaohui Zhang, Jiangyan Yi, Jianhua Tao, Chenglong Wang, Chuyuan Zhang:
Do You Remember? Overcoming Catastrophic Forgetting for Fake Audio Detection. CoRR abs/2308.03300 (2023) - [i67]Cunhang Fan, Jun Xue, Jianhua Tao, Jiangyan Yi, Chenglong Wang, Chengshi Zheng, Zhao Lv:
Spatial Reconstructed Local Attention Res2Net with F0 Subband for Fake Speech Detection. CoRR abs/2308.09944 (2023) - [i66]Jiangyan Yi, Chenglong Wang, Jianhua Tao, Xiaohui Zhang, Chu Yuan Zhang, Yan Zhao:
Audio Deepfake Detection: A Survey. CoRR abs/2308.14970 (2023) - [i65]Chu Yuan Zhang, Jiangyan Yi, Jianhua Tao, Chenglong Wang, Xinrui Yan:
Distinguishing Neural Speech Synthesis Models Through Fingerprints in Speech Waveforms. CoRR abs/2309.06780 (2023) - [i64]