default search action

combined dblp search
author search
venue search
publication search

ask others

Xubo Liu

> Home > Persons

This is just a disambiguation page, and is not intended to be the bibliography of an actual person. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiuYLMKTWWWP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiuYLMKTWWWP24
Haohe Liu, Yi Yuan, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Qiao Tian, Yuping Wang, Wenwu Wang, Yuxuan Wang, Mark D. Plumbley:
AudioLDM 2: Learning Holistic Audio Generation With Self-Supervised Pretraining. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2871-2883 (2024)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/MeiLSPW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/MeiLSPW24
Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
Towards Generating Diverse Audio Captions via Adversarial Training. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3311-3323 (2024)
[c37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LiuLK0P24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LiuLK0P24
Haohe Liu, Xubo Liu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Learning Temporal Resolution in Spectrogram for Audio Classification. AAAI 2024: 13873-13881
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/HuangLK00ZT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/HuangLK00ZT24
Qiushi Huang, Xubo Liu, Tom Ko, Bo Wu, Wenwu Wang, Yu Zhang, Lilian Tang:
Selective Prompting Tuning for Personalized Conversations with LLMs. ACL (Findings) 2024: 16212-16226
[c35]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/coling/LiuGZLZYY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/LiuGZLZYY24
Xumeng Liu, Wenya Guo, Ying Zhang, Xubo Liu, Yu Zhao, Shenglong Yu, Xiaojie Yuan:
Look before You Leap: Dual Logical Verification for Knowledge-based Visual Question Generation. LREC/COLING 2024: 10802-10812
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuanLLHP024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuanLLHP024
Yi Yuan, Haohe Liu, Xubo Liu, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Retrieval-Augmented Text-to-Audio Generation. ICASSP 2024: 581-585
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangZ0LXTML024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangZ0LXTML024
Hejing Zhang, Qiaoxi Zhu, Jian Guan, Haohe Liu, Feiyang Xiao, Jiantong Tian, Xinhao Mei, Xubo Liu, Wenwu Wang:
First-Shot Unsupervised Anomalous Sound Detection with Unknown Anomalies Estimated by Metadata-Assisted Audio Generation. ICASSP 2024: 1271-1275
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuLZWXT024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuLZWXT024
Yuzhuo Liu, Xubo Liu, Yan Zhao, Yuanyuan Wang, Rui Xia, Pingchuan Tain, Yuxuan Wang:
Audio Prompt Tuning for Universal Sound Separation. ICASSP 2024: 1446-1450
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenGLWLL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenGLWLL024
Yaru Chen, Ruohao Guo, Xubo Liu, Peipei Wu, Guangyao Li, Zhenbo Li, Wenwu Wang:
CM-PIE: Cross-Modal Perception for Interactive-Enhanced Audio-Visual Video Parsing. ICASSP 2024: 8421-8425
[i48]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-17806
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-17806
Yi Yuan, Zhuo Chen, Xubo Liu, Haohe Liu, Xuenan Xu, Dongya Jia, Yuanzhe Chen, Mark D. Plumbley, Wenwu Wang:
T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining. CoRR abs/2404.17806 (2024)
[i47]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-18081
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-18081
Qixin Deng, Qikai Yang, Ruibin Yuan, Yipeng Huang, Yi Wang, Xubo Liu, Zeyue Tian, Jiahao Pan, Ge Zhang, Hanfeng Lin, Yizhi Li, Yinghao Ma, Jie Fu, Chenghua Lin, Emmanouil Benetos, Wenwu Wang, Guangyu Xia, Wei Xue, Yike Guo:
ComposerX: Multi-Agent Symbolic Music Composition with LLMs. CoRR abs/2404.18081 (2024)
[i46]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-17800
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-17800
Meng Cui, Xubo Liu, Haohe Liu, Jinzheng Zhao, Daoliang Li, Wenwu Wang:
Fish Tracking, Counting, and Behaviour Analysis in Digital Aquaculture: A Comprehensive Review. CoRR abs/2406.17800 (2024)
[i45]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-18187
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-18187
Qiushi Huang, Xubo Liu, Tom Ko, Bo Wu, Wenwu Wang, Yu Zhang, Lilian Tang:
Selective Prompting Tuning for Personalized Conversations with LLMs. CoRR abs/2406.18187 (2024)
[i44]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-18847
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-18847
Qiushi Huang, Shuai Fu, Xubo Liu, Wenwu Wang, Tom Ko, Yu Zhang, Lilian Tang:
Learning Retrieval Augmentation for Personalized Dialogue Generation. CoRR abs/2406.18847 (2024)
[i43]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-04416
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-04416
Yi Yuan, Dongya Jia, Xiaobin Zhuang, Yuanzhe Chen, Zhengxi Liu, Zhuo Chen, Yuping Wang, Yuxuan Wang, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Improving Audio Generation with Visual Enhanced Caption. CoRR abs/2407.04416 (2024)
[i42]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-04936
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-04936
Feiyang Xiao, Jian Guan, Qiaoxi Zhu, Xubo Liu, Wenbo Wang, Shuhan Qi, Kejia Zhang, Jianyuan Sun, Wenwu Wang:
A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining. CoRR abs/2407.04936 (2024)
[i41]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-11745
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-11745
Junqi Zhao, Xubo Liu, Jinzheng Zhao, Yi Yuan, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang:
Universal Sound Separation with Self-Supervised Audio Masked Autoencoder. CoRR abs/2407.11745 (2024)
2023
[c30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/Huang0KL00T23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/Huang0KL00T23
Qiushi Huang, Yu Zhang, Tom Ko, Xubo Liu, Bo Wu, Wenwu Wang, H. Lilian Tang:
Personalized Dialogue Generation with Persona-Adaptive Attention. AAAI 2023: 12916-12923
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/LiuLV0CXDMKPPF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/LiuLV0CXDMKPPF23
Xubo Liu, Egor Lakomkin, Konstantinos Vougioukas, Pingchuan Ma, Honglie Chen, Ruiming Xie, Morrie Doulaty, Niko Moritz, Jáchym Kolár, Stavros Petridis, Maja Pantic, Christian Fuegen:
SynthVSR: Scaling Up Visual Speech RecognitionWith Synthetic Supervision. CVPR 2023: 18806-18815
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/HuangFL0K0T23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/HuangFL0K0T23
Qiushi Huang, Shuai Fu, Xubo Liu, Wenwu Wang, Tom Ko, Yu Zhang, Lilian Tang:
Learning Retrieval Augmentation for Personalized Dialogue Generation. EMNLP 2023: 2523-2540
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/CayliLK023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/CayliLK023
Özkan Çayli, Xubo Liu, Volkan Kiliç, Wenwu Wang:
Knowledge Distillation for Efficient Audio-Visual Video Captioning. EUSIPCO 2023: 745-749
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/eusipco/YuanLLLPW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/YuanLLLPW23
Yi Yuan, Haohe Liu, Jinhua Liang, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Leveraging Pre-Trained AudioLDM for Sound Generation: A Benchmark Study. EUSIPCO 2023: 765-769
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuLKMPW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuLKMPW23
Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Mark D. Plumbley, Wenwu Wang:
Simple Pooling Front-Ends for Efficient Audio Classification. ICASSP 2023: 1-5
[c24]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/icml/LiuCYMLM0P23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LiuCYMLM0P23
Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo P. Mandic, Wenwu Wang, Mark D. Plumbley:
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models. ICML 2023: 21450-21474
[c23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiangLLPBP023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiangLLPBP023
Jinhua Liang, Xubo Liu, Haohe Liu, Huy Phan, Emmanouil Benetos, Mark D. Plumbley, Wenwu Wang:
Adapting Language-Audio Models as Few-Shot Audio Learners. INTERSPEECH 2023: 276-280
[c22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuHMLKSLK0TPK023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuHMLKSLK0TPK023
Xubo Liu, Qiushi Huang, Xinhao Mei, Haohe Liu, Qiuqiang Kong, Jianyuan Sun, Shengchen Li, Tom Ko, Yu Zhang, H. Lilian Tang, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention. INTERSPEECH 2023: 2838-2842
[c21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuKLM0P23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuKLM0P23
Haohe Liu, Qiuqiang Kong, Xubo Liu, Xinhao Mei, Wenwu Wang, Mark D. Plumbley:
Ontology-aware Learning and Evaluation for Audio Tagging. INTERSPEECH 2023: 3799-3803
[c20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SunLMKP023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SunLMKP023
Jianyuan Sun, Xubo Liu, Xinhao Mei, Volkan Kiliç, Mark D. Plumbley, Wenwu Wang:
Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning. INTERSPEECH 2023: 4164-4168
[i40]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-12503
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-12503
Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo P. Mandic, Wenwu Wang, Mark D. Plumbley:
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models. CoRR abs/2301.12503 (2023)
[i39]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-03857
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-03857
Yi Yuan, Haohe Liu, Jinhua Liang, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Leveraging Pre-trained AudioLDM for Text to Sound Generation: A Benchmark Study. CoRR abs/2303.03857 (2023)
[i38]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-17200
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-17200
Xubo Liu, Egor Lakomkin, Konstantinos Vougioukas, Pingchuan Ma, Honglie Chen, Ruiming Xie, Morrie Doulaty, Niko Moritz, Jáchym Kolár, Stavros Petridis, Maja Pantic, Christian Fuegen:
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision. CoRR abs/2303.17200 (2023)
[i37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-15905
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-15905
Yi Yuan, Haohe Liu, Xubo Liu, Xiyuan Kang, Mark D. Plumbley, Wenwu Wang:
Latent Diffusion Model Based Foley Sound Generation System For DCASE Challenge 2023 Task 7. CoRR abs/2305.15905 (2023)
[i36]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-17719
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-17719
Jinhua Liang, Xubo Liu, Haohe Liu, Huy Phan, Emmanouil Benetos, Mark D. Plumbley, Wenwu Wang:
Adapting Language-Audio Models as Few-Shot Audio Learners. CoRR abs/2305.17719 (2023)
[i35]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18753
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18753
Jianyuan Sun, Xubo Liu, Xinhao Mei, Volkan Kiliç, Mark D. Plumbley, Wenwu Wang:
Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning. CoRR abs/2305.18753 (2023)
[i34]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-10359
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-10359
Yi Yuan, Haohe Liu, Xubo Liu, Xiyuan Kang, Peipei Wu, Mark D. Plumbley, Wenwu Wang:
Text-Driven Foley Sound Generation With Latent Diffusion Model. CoRR abs/2306.10359 (2023)
[i33]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-14335
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-14335
Xubo Liu, Zhongkai Zhu, Haohe Liu, Yi Yuan, Meng Cui, Qiushi Huang, Jinhua Liang, Yin Cao, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang:
WavJourney: Compositional Audio Creation with Large Language Models. CoRR abs/2307.14335 (2023)
[i32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-05037
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-05037
Xubo Liu, Qiuqiang Kong, Yan Zhao, Haohe Liu, Yi Yuan, Yuzhuo Liu, Rui Xia, Yuxuan Wang, Mark D. Plumbley, Wenwu Wang:
Separate Anything You Describe. CoRR abs/2308.05037 (2023)
[i31]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-05734
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-05734
Haohe Liu, Qiao Tian, Yi Yuan, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Yuping Wang, Wenwu Wang, Yuxuan Wang, Mark D. Plumbley:
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining. CoRR abs/2308.05734 (2023)
[i30]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-05058
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-05058
Meng Cui, Xubo Liu, Haohe Liu, Zhuangzhuang Du, Tao Chen, Guoping Lian, Daoliang Li, Wenwu Wang:
Multimodal Fish Feeding Intensity Assessment in Aquaculture. CoRR abs/2309.05058 (2023)
[i29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08051
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08051
Yi Yuan, Haohe Liu, Xubo Liu, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Retrieval-Augmented Text-to-Audio Generation. CoRR abs/2309.08051 (2023)
[i28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-09705
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-09705
Feiyang Xiao, Qiaoxi Zhu, Jian Guan, Xubo Liu, Haohe Liu, Kejia Zhang, Wenwu Wang:
Synth-AC: Enhancing Audio Captioning with Synthetic Supervision. CoRR abs/2309.09705 (2023)
[i27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-07517
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-07517
Yaru Chen, Ruohao Guo, Xubo Liu, Peipei Wu, Guangyao Li, Zhenbo Li, Wenwu Wang:
CM-PIE: Cross-modal perception for interactive-enhanced audio-visual video parsing. CoRR abs/2310.07517 (2023)
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-14173
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-14173
Hejing Zhang, Qiaoxi Zhu, Jian Guan, Haohe Liu, Feiyang Xiao, Jiantong Tian, Xinhao Mei, Xubo Liu, Wenwu Wang:
First-Shot Unsupervised Anomalous Sound Detection With Unknown Anomalies Estimated by Metadata-Assisted Audio Generation. CoRR abs/2310.14173 (2023)
[i25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-18399
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-18399
Yuzhuo Liu, Xubo Liu, Yan Zhao, Yuanyuan Wang, Rui Xia, Pingchuan Tain, Yuxuan Wang:
Audio Prompt Tuning for Universal Sound Separation. CoRR abs/2311.18399 (2023)
2022
[j2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ejasmp/MeiLPW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ejasmp/MeiLPW22
Xinhao Mei, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Automated audio captioning: an overview of recent progress and new challenges. EURASIP J. Audio Speech Music. Process. 2022(1): 26 (2022)
[c19]
- view
  - electronic edition @ dcase.community (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/dcase/LiuLMKWP22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/LiuLMKWP22
Haohe Liu, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Segment-Level Metric Learning for Few-Shot Bioacoustic Event Detection. DCASE 2022
[c18]
- view
  - electronic edition @ dcase.community (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/dcase/XiaoLKSCPW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/XiaoLKSCPW22
Yang Xiao, Xubo Liu, James A. King, Arshdeep Singh, Eng Siong Chng, Mark D. Plumbley, Wenwu Wang:
Continual Learning for On-Ddevice Environmental Sound Classification. DCASE 2022
[c17]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/eusipco/SunLMZPKW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/SunLMZPKW22
Jianyuan Sun, Xubo Liu, Xinhao Mei, Jinzheng Zhao, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Deep Neural Decision Forest for Acoustic Scene Classification. EUSIPCO 2022: 772-776
[c16]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/eusipco/ZhaoWGLSXW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/ZhaoWGLSXW22
Jinzheng Zhao, Peipei Wu, Shidrokh Goudarzi, Xubo Liu, Jianyuan Sun, Yong Xu, Wenwu Wang:
Visually Assisted Self-supervised Audio Speaker Localization and Tracking. EUSIPCO 2022: 787-791
[c15]
- view
  - electronic edition @ ieee.org
  - no references & citations available
- export record
  dblp key:
  - conf/eusipco/LiuMHSZLPKW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/LiuMHSZLPKW22
Xubo Liu, Xinhao Mei, Qiushi Huang, Jianyuan Sun, Jinzheng Zhao, Haohe Liu, Mark D. Plumbley, Volkan Kilic, Wenwu Wang:
Leveraging Pre-trained BERT for Audio Captioning. EUSIPCO 2022: 1145-1149
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icacs2/LiuZLY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icacs2/LiuZLY22
Yunxiang Liu, Jianlin Zhu, Xubo Liu, Xinxin Yuan:
Path Planning based on Astar Algorithm in Automatic Driving. ICACS 2022: 7:1-7:4
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhaoWLXMGW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhaoWLXMGW22
Jinzheng Zhao, Peipei Wu, Xubo Liu, Yong Xu, Lyudmila Mihaylova, Simon J. Godsill, Wenwu Wang:
Audio-Visual Tracking of Multiple Speakers Via a PMBM Filter. ICASSP 2022: 5068-5072
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MeiLSPW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MeiLSPW22
Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
Diverse Audio Captioning Via Adversarial Training. ICASSP 2022: 8882-8886
[c11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuLKMZHPW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuLKMZHPW22
Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Separate What You Describe: Language-Queried Audio Source Separation. INTERSPEECH 2022: 1801-1805
[c10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoWLGLXW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoWLGLXW22
Jinzheng Zhao, Peipei Wu, Xubo Liu, Shidrokh Goudarzi, Haohe Liu, Yong Xu, Wenwu Wang:
Audio Visual Multi-Speaker Tracking with Improved GCF and PMBM Filter. INTERSPEECH 2022: 3704-3708
[c9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MeiLSPW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MeiLSPW22
Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
On Metric Learning for Audio-Text Cross-Modal Retrieval. INTERSPEECH 2022: 4142-4146
[c8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuCLKTW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuCLKTW22
Haohe Liu, Woosung Choi, Xubo Liu, Qiuqiang Kong, Qiao Tian, DeLiang Wang:
Neural Vocoder is All You Need for Speech Super-resolution. INTERSPEECH 2022: 4227-4231
[c7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuLKTZWHW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuLKTZWHW22
Haohe Liu, Xubo Liu, Qiuqiang Kong, Qiao Tian, Yan Zhao, DeLiang Wang, Chuanzeng Huang, Yuxuan Wang:
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration. INTERSPEECH 2022: 4232-4236
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/CuiLZSLCPLW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/CuiLZSLCPLW22
Meng Cui, Xubo Liu, Jinzheng Zhao, Jianyuan Sun, Guoping Lian, Tao Chen, Mark D. Plumbley, Daoliang Li, Wenwu Wang:
Fish Feeding Intensity Assessment in Aquaculture: A New Audio Dataset AFFIA3K and a Deep Learning Algorithm. MLSP 2022: 1-6
[i24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-02838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-02838
Xubo Liu, Xinhao Mei, Qiushi Huang, Jianyuan Sun, Jinzheng Zhao, Haohe Liu, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Leveraging Pre-trained BERT for Audio Captioning. CoRR abs/2203.02838 (2022)
[i23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-03436
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-03436
Jianyuan Sun, Xubo Liu, Xinhao Mei, Jinzheng Zhao, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Deep Neural Decision Forest for Acoustic Scene Classification. CoRR abs/2203.03436 (2022)
[i22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-14941
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-14941
Haohe Liu, Woosung Choi, Xubo Liu, Qiuqiang Kong, Qiao Tian, DeLiang Wang:
Neural Vocoder is All You Need for Speech Super-resolution. CoRR abs/2203.14941 (2022)
[i21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15147
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15147
Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Separate What You Describe: Language-Queried Audio Source Separation. CoRR abs/2203.15147 (2022)
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-15537
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-15537
Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
On Metric Learning for Audio-Text Cross-Modal Retrieval. CoRR abs/2203.15537 (2022)
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-05841
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-05841
Haohe Liu, Xubo Liu, Qiuqiang Kong, Qiao Tian, Yan Zhao, DeLiang Wang, Chuanzeng Huang, Yuxuan Wang:
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration. CoRR abs/2204.05841 (2022)
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-05949
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-05949
Xinhao Mei, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Automated Audio Captioning: an Overview of Recent Progress and New Challenges. CoRR abs/2205.05949 (2022)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-07429
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-07429
Yang Xiao, Xubo Liu, James A. King, Arshdeep Singh, Eng Siong Chng, Mark D. Plumbley, Wenwu Wang:
Continual Learning For On-Device Environmental Sound Classification. CoRR abs/2207.07429 (2022)
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-07773
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-07773
Haohe Liu, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Segment-level Metric Learning for Few-shot Bioacoustic Event Detection. CoRR abs/2207.07773 (2022)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-10547
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-10547
Haohe Liu, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Surrey System for DCASE 2022 Task 5: Few-shot Bioacoustic Event Detection with Segment-level Metric Learning. CoRR abs/2207.10547 (2022)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-01555
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-01555
Arshdeep Singh, James A. King, Xubo Liu, Wenwu Wang, Mark D. Plumbley:
Low-complexity CNNs for Acoustic Scene Classification. CoRR abs/2208.01555 (2022)
[i13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-00943
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-00943
Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Mark D. Plumbley, Wenwu Wang:
Simple Pooling Front-ends For Efficient Audio Classification. CoRR abs/2210.00943 (2022)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-01719
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-01719
Haohe Liu, Xubo Liu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Learning the Spectrogram Temporal Resolution for Audio Classification. CoRR abs/2210.01719 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-05037
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-05037
Jianyuan Sun, Xubo Liu, Xinhao Mei, Mark D. Plumbley, Volkan Kilic, Wenwu Wang:
Automated Audio Captioning via Fusion of Low- and High- Dimensional Features. CoRR abs/2210.05037 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15088
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15088
Qiushi Huang, Yu Zhang, Tom Ko, Xubo Liu, Bo Wu, Wenwu Wang, H. Lilian Tang:
Personalized Dialogue Generation with Persona-Adaptive Attention. CoRR abs/2210.15088 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-16428
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-16428
Xubo Liu, Qiushi Huang, Xinhao Mei, Haohe Liu, Qiuqiang Kong, Jianyuan Sun, Shengchen Li, Tom Ko, Yu Zhang, H. Lilian Tang, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention. CoRR abs/2210.16428 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-12195
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-12195
Haohe Liu, Qiuqiang Kong, Xubo Liu, Xinhao Mei, Wenwu Wang, Mark D. Plumbley:
Ontology-aware Learning and Evaluation for Audio Tagging. CoRR abs/2211.12195 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-02033
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-02033
Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
Towards Generating Diverse Audio Captions via Adversarial Training. CoRR abs/2212.02033 (2022)
2021
[c5]
- view
  - electronic edition @ dcase.community (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/dcase/LiuHMKTPW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/LiuHMKTPW21
Xubo Liu, Qiushi Huang, Xinhao Mei, Tom Ko, H. Lilian Tang, Mark D. Plumbley, Wenwu Wang:
CL4AC: A Contrastive Loss for Audio Captioning. DCASE 2021: 196-200
[c4]
- view
  - electronic edition @ dcase.community (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/dcase/MeiHLCWWZLKTSPW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/MeiHLCWWZLKTSPW21
Xinhao Mei, Qiushi Huang, Xubo Liu, Gengyun Chen, Jingqian Wu, Yusong Wu, Jinzheng Zhao, Shengchen Li, Tom Ko, H. Lilian Tang, Xi Shao, Mark D. Plumbley, Wenwu Wang:
An Encoder-Decoder Based Audio Captioning System with Transfer and Reinforcement Learning. DCASE 2021: 206-210
[c3]
- view
  - electronic edition @ dcase.community (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/dcase/MeiLHPW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcase/MeiLHPW21
Xinhao Mei, Xubo Liu, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Audio Captioning Transformer. DCASE 2021: 211-215
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangKTL021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangKTL021
Qiushi Huang, Tom Ko, H. Lilian Tang, Xubo Liu, Bo Wu:
Token-Level Supervised Contrastive Learning for Punctuation Restoration. Interspeech 2021: 2012-2016
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/mlsp/LiuIZHPW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsp/LiuIZHPW21
Xubo Liu, Turab Iqbal, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning. MLSP 2021: 1-6
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-09099
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-09099
Qiushi Huang, Tom Ko, H. Lilian Tang, Xubo Liu, Bo Wu:
Token-Level Supervised Contrastive Learning for Punctuation Restoration. CoRR abs/2107.09099 (2021)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-09817
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-09817
Xinhao Mei, Xubo Liu, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Audio Captioning Transformer. CoRR abs/2107.09817 (2021)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-09990
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-09990
Xubo Liu, Qiushi Huang, Xinhao Mei, Tom Ko, H. Lilian Tang, Mark D. Plumbley, Wenwu Wang:
CL4AC: A Contrastive Loss for Audio Captioning. CoRR abs/2107.09990 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2107-09998
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-09998
Xubo Liu, Turab Iqbal, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning. CoRR abs/2107.09998 (2021)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2108-02752
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-02752
Xinhao Mei, Qiushi Huang, Xubo Liu, Gengyun Chen, Jingqian Wu, Yusong Wu, Jinzheng Zhao, Shengchen Li, Tom Ko, H. Lilian Tang, Xi Shao, Mark D. Plumbley, Wenwu Wang:
An Encoder-Decoder Based Audio Captioning System With Transfer and Reinforcement Learning. CoRR abs/2108.02752 (2021)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2110-06691
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-06691
Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
Diverse Audio Captioning via Adversarial Training. CoRR abs/2110.06691 (2021)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/access/LiuLCF19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/LiuLCF19
Zhenbao Liu, Xubo Liu, Jie Chen, Chen Fang:
Altitude Control for Variable Load Quadrotor via Learning Rate Based Robust Sliding Mode Controller. IEEE Access 7: 9736-9744 (2019)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.