default search action

combined dblp search
author search
venue search
publication search

ask others

Guangyao Li 0001

> Home > Persons

Person information

affiliation: Tsinghua University, Department of Computer Science and Technology, Beijing, China
affiliation (former): Renmin University of China, Gaoling School of Artificial Intelligence, Beijing, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2026
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/pr/ZhangLLLC26
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pr/ZhangLLLC26
Xiaoqiang Zhang, Guangyao Li, Xiaomeng Li, Buwen Liang, Ying Chen:
Sarcasm detection enhanced by multi-modal topics using denoising diffusion probabilistic models. Pattern Recognit. 171: 112130 (2026)
2025
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/GuoYCNLQQZXY00Z25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/GuoYCNLQQZXY00Z25
Ruohao Guo, Xianghua Ying, Yaru Chen, Dantong Niu, Guangyao Li, Liao Qu, Yanyu Qi, Jinxing Zhou, Bowei Xing, Wenzhen Yue, Ji Shi, Qixun Wang, Peiliang Zhang, Buwen Liang:
Audio-Visual Instance Segmentation. CVPR 2025: 13550-13560
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/DuLZZZ025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/DuLZZZ025
Henghui Du, Guangyao Li, Chang Zhou, Chunjie Zhang, Alan Zhao, Di Hu:
Crab: A Unified Audio-Visual Scene Understanding Model with Explicit Cooperation. CVPR 2025: 18804-18814
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangCLL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangCLL25
Xiaoqiang Zhang, Ying Chen, Guangyao Li, Buwen Liang:
PEDE: Enhance Multi-modal Sarcasm Detection in Videos via Prompted Emotion Distributions. ICASSP 2025: 1-5
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Wang0FGLZ0025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Wang0FGLZ0025
Ren Wang, Xin Wang, Tongtong Feng, Xinyue Gong, Guangyao Li, Yu-Wei Zhan, Qing Li, Wenwu Zhu:
Improving Compositional Generalization in Cross-Embodiment Learning via Mixture of Disentangled Prototypes. ACM Multimedia 2025: 7162-7171
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-05907
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-05907
Tongtong Feng, Xin Wang, Zekai Zhou, Ren Wang, Yuwei Zhan, Guangyao Li, Qing Li, Wenwu Zhu:
EvoAgent: Agent Autonomous Evolution with Continual World Model for Long-Horizon Tasks. CoRR abs/2502.05907 (2025)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-13068
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-13068
Henghui Du, Guangyao Li, Chang Zhou, Chunjie Zhang, Alan Zhao, Di Hu:
Crab: A Unified Audio-Visual Scene Understanding Model with Explicit Cooperation. CoRR abs/2503.13068 (2025)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-23271
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-23271
Jinxing Zhou, Zhihui Li, Yongqiang Yu, Yanghao Zhou, Ruohao Guo, Guangyao Li, Yuxin Mao, Mingfei Han, Xiaojun Chang, Meng Wang:
Mettle: Meta-Token Learning for Memory-Efficient Audio-Visual Adaptation. CoRR abs/2506.23271 (2025)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-04532
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-04532
Yuwei Zhan, Xin Wang, Hong Chen, Tongtong Feng, Wei Feng, Ren Wang, Guangyao Li, Qing Li, Wenwu Zhu:
PhyVLLM: Physics-Guided Video Language Model with Motion-Appearance Disentanglement. CoRR abs/2512.04532 (2025)
2024
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WangLLD0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WangLLD0L24
Yaoting Wang, Weisong Liu, Guangyao Li, Jian Ding, Di Hu, Xi Li:
Prompting Segmentation with Sound Is Generalizable Audio-Visual Source Localizer. AAAI 2024: 5669-5677
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/WangSZLZH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/WangSZLZH24
Yaoting Wang, Peiwen Sun, Dongzhan Zhou, Guangyao Li, Honggang Zhang, Di Hu:
Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes. ECCV (74) 2024: 196-213
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenGLWLL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenGLWLL024
Yaru Chen, Ruohao Guo, Xubo Liu, Peipei Wu, Guangyao Li, Zhenbo Li, Wenwu Wang:
CM-PIE: Cross-Modal Perception for Interactive-Enhanced Audio-Visual Video Parsing. ICASSP 2024: 8421-8425
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiD024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiD024
Guangyao Li, Henghui Du, Di Hu:
Boosting Audio Visual Question Answering via Key Semantic-Aware Cues. ACM Multimedia 2024: 5997-6005
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/uavm/FengLWWL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/uavm/FengLWWL024
Tongtong Feng, Qing Li, Xin Wang, Mingzi Wang, Guangyao Li, Wenwu Zhu:
Multi-weather Cross-view Geo-localization Using Denoising Diffusion Models. UAVM 2024: 35-39
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-10957
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-10957
Yaoting Wang, Peiwen Sun, Dongzhan Zhou, Guangyao Li, Honggang Zhang, Di Hu:
Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes. CoRR abs/2407.10957 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-20693
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-20693
Guangyao Li, Henghui Du, Di Hu:
Boosting Audio Visual Question Answering via Key Semantic-Aware Cues. CoRR abs/2407.20693 (2024)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-02408
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-02408
Tongtong Feng, Qing Li, Xin Wang, Mingzi Wang, Guangyao Li, Wenwu Zhu:
Multi-weather Cross-view Geo-localization Using Denoising Diffusion Models. CoRR abs/2408.02408 (2024)
2023
[j2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/aeog/HeidlerMHJLGWZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/aeog/HeidlerMHJLGWZ23
Konrad Heidler, Lichao Mou, Di Hu, Pu Jin, Guangyao Li, Chuang Gan, Ji-Rong Wen, Xiao Xiang Zhu:
Self-supervised audiovisual representation learning for remote sensing data. Int. J. Appl. Earth Obs. Geoinformation 116: 103130 (2023)
[c4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiX023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiX023
Guangyao Li, Yixin Xu, Di Hu:
Multi-Scale Attention for Audio Question Answering. INTERSPEECH 2023: 3442-3446
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/LiH023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/LiH023
Guangyao Li, Wenxuan Hou, Di Hu:
Progressive Spatio-temporal Perception for Audio-Visual Question Answering. ACM Multimedia 2023: 7808-7816
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-17993
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-17993
Guangyao Li, Yixin Xu, Di Hu:
Multi-Scale Attention for Audio Question Answering. CoRR abs/2305.17993 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-09431
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-09431
Wenxuan Hou, Guangyao Li, Yapeng Tian, Di Hu:
Towards Long Form Audio-visual Video Understanding. CoRR abs/2306.09431 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-05421
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-05421
Guangyao Li, Wenxuan Hou, Di Hu:
Progressive Spatio-temporal Perception for Audio-Visual Question Answering. CoRR abs/2308.05421 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-07929
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-07929
Yaoting Wang, Weisong Liu, Guangyao Li, Jian Ding, Di Hu, Xi Li:
Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer. CoRR abs/2309.07929 (2023)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-07517
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-07517
Yaru Chen, Ruohao Guo, Xubo Liu, Peipei Wu, Guangyao Li, Zhenbo Li, Wenwu Wang:
CM-PIE: Cross-modal perception for interactive-enhanced audio-visual video parsing. CoRR abs/2310.07517 (2023)
2022
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/LiWTXW022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/LiWTXW022
Guangyao Li, Yake Wei, Yapeng Tian, Chenliang Xu, Ji-Rong Wen, Di Hu:
Learning to Answer Questions in Dynamic Audio-Visual Scenarios. CVPR 2022: 19086-19096
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-14072
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-14072
Guangyao Li, Yake Wei, Yapeng Tian, Chenliang Xu, Ji-Rong Wen, Di Hu:
Learning to Answer Questions in Dynamic Audio-Visual Scenarios. CoRR abs/2203.14072 (2022)
2021
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-00688
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-00688
Konrad Heidler, Lichao Mou, Di Hu, Pu Jin, Guangyao Li, Chuang Gan, Ji-Rong Wen, Xiao Xiang Zhu:
Self-supervised Audiovisual Representation Learning for Remote Sensing Data. CoRR abs/2108.00688 (2021)
2020
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/cea/LiGLCL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cea/LiGLCL20
Zhenbo Li, Ruohao Guo, Meng Li, Yaru Chen, Guangyao Li:
A review of computer vision technologies for plant phenotyping. Comput. Electron. Agric. 176: 105672 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/prcv/LiLZLY19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/prcv/LiLZLY19
Guangyao Li, Zhenbo Li, Chuyue Zhang, Yaodong Li, Jun Yue:
Shellfish Detection Based on Fusion Attention Mechanism in End-to-End Network. PRCV (3) 2019: 516-527

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.