


default search action
Guangyao Li 0001
Person information
- affiliation: Tsinghua University, Department of Computer Science and Technology, Beijing, China
- affiliation (former): Renmin University of China, Gaoling School of Artificial Intelligence, Beijing, China
Other persons with the same name
- Guangyao Li — disambiguation page
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
[j3]Xiaoqiang Zhang, Guangyao Li
, Xiaomeng Li, Buwen Liang, Ying Chen:
Sarcasm detection enhanced by multi-modal topics using denoising diffusion probabilistic models. Pattern Recognit. 171: 112130 (2026)- 2025
[c13]Ruohao Guo, Xianghua Ying, Yaru Chen, Dantong Niu, Guangyao Li, Liao Qu, Yanyu Qi, Jinxing Zhou, Bowei Xing, Wenzhen Yue, Ji Shi, Qixun Wang, Peiliang Zhang, Buwen Liang:
Audio-Visual Instance Segmentation. CVPR 2025: 13550-13560
[c12]Henghui Du, Guangyao Li, Chang Zhou, Chunjie Zhang, Alan Zhao, Di Hu:
Crab: A Unified Audio-Visual Scene Understanding Model with Explicit Cooperation. CVPR 2025: 18804-18814
[c11]Xiaoqiang Zhang, Ying Chen, Guangyao Li, Buwen Liang:
PEDE: Enhance Multi-modal Sarcasm Detection in Videos via Prompted Emotion Distributions. ICASSP 2025: 1-5
[c10]Ren Wang
, Xin Wang
, Tongtong Feng
, Xinyue Gong
, Guangyao Li
, Yu-Wei Zhan
, Qing Li
, Wenwu Zhu
:
Improving Compositional Generalization in Cross-Embodiment Learning via Mixture of Disentangled Prototypes. ACM Multimedia 2025: 7162-7171
[i14]Tongtong Feng, Xin Wang, Zekai Zhou, Ren Wang, Yuwei Zhan, Guangyao Li, Qing Li, Wenwu Zhu:
EvoAgent: Agent Autonomous Evolution with Continual World Model for Long-Horizon Tasks. CoRR abs/2502.05907 (2025)
[i13]Henghui Du, Guangyao Li, Chang Zhou, Chunjie Zhang, Alan Zhao, Di Hu:
Crab: A Unified Audio-Visual Scene Understanding Model with Explicit Cooperation. CoRR abs/2503.13068 (2025)
[i12]Jinxing Zhou, Zhihui Li, Yongqiang Yu, Yanghao Zhou, Ruohao Guo, Guangyao Li, Yuxin Mao, Mingfei Han, Xiaojun Chang, Meng Wang:
Mettle: Meta-Token Learning for Memory-Efficient Audio-Visual Adaptation. CoRR abs/2506.23271 (2025)
[i11]Yuwei Zhan, Xin Wang, Hong Chen, Tongtong Feng, Wei Feng, Ren Wang, Guangyao Li, Qing Li, Wenwu Zhu:
PhyVLLM: Physics-Guided Video Language Model with Motion-Appearance Disentanglement. CoRR abs/2512.04532 (2025)- 2024
[c9]Yaoting Wang, Weisong Liu, Guangyao Li, Jian Ding, Di Hu, Xi Li:
Prompting Segmentation with Sound Is Generalizable Audio-Visual Source Localizer. AAAI 2024: 5669-5677
[c8]Yaoting Wang
, Peiwen Sun
, Dongzhan Zhou
, Guangyao Li
, Honggang Zhang
, Di Hu
:
Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes. ECCV (74) 2024: 196-213
[c7]Yaru Chen, Ruohao Guo, Xubo Liu, Peipei Wu, Guangyao Li, Zhenbo Li, Wenwu Wang:
CM-PIE: Cross-Modal Perception for Interactive-Enhanced Audio-Visual Video Parsing. ICASSP 2024: 8421-8425
[c6]Guangyao Li
, Henghui Du
, Di Hu
:
Boosting Audio Visual Question Answering via Key Semantic-Aware Cues. ACM Multimedia 2024: 5997-6005
[c5]Tongtong Feng
, Qing Li
, Xin Wang
, Mingzi Wang
, Guangyao Li
, Wenwu Zhu
:
Multi-weather Cross-view Geo-localization Using Denoising Diffusion Models. UAVM 2024: 35-39
[i10]Yaoting Wang, Peiwen Sun, Dongzhan Zhou, Guangyao Li, Honggang Zhang, Di Hu:
Ref-AVS: Refer and Segment Objects in Audio-Visual Scenes. CoRR abs/2407.10957 (2024)
[i9]Guangyao Li, Henghui Du, Di Hu:
Boosting Audio Visual Question Answering via Key Semantic-Aware Cues. CoRR abs/2407.20693 (2024)
[i8]Tongtong Feng, Qing Li, Xin Wang, Mingzi Wang, Guangyao Li, Wenwu Zhu:
Multi-weather Cross-view Geo-localization Using Denoising Diffusion Models. CoRR abs/2408.02408 (2024)- 2023
[j2]Konrad Heidler, Lichao Mou, Di Hu, Pu Jin, Guangyao Li, Chuang Gan, Ji-Rong Wen, Xiao Xiang Zhu
:
Self-supervised audiovisual representation learning for remote sensing data. Int. J. Appl. Earth Obs. Geoinformation 116: 103130 (2023)
[c4]Guangyao Li, Yixin Xu, Di Hu:
Multi-Scale Attention for Audio Question Answering. INTERSPEECH 2023: 3442-3446
[c3]Guangyao Li
, Wenxuan Hou
, Di Hu
:
Progressive Spatio-temporal Perception for Audio-Visual Question Answering. ACM Multimedia 2023: 7808-7816
[i7]Guangyao Li, Yixin Xu, Di Hu:
Multi-Scale Attention for Audio Question Answering. CoRR abs/2305.17993 (2023)
[i6]Wenxuan Hou, Guangyao Li, Yapeng Tian, Di Hu:
Towards Long Form Audio-visual Video Understanding. CoRR abs/2306.09431 (2023)
[i5]Guangyao Li, Wenxuan Hou, Di Hu:
Progressive Spatio-temporal Perception for Audio-Visual Question Answering. CoRR abs/2308.05421 (2023)
[i4]Yaoting Wang, Weisong Liu, Guangyao Li, Jian Ding, Di Hu, Xi Li:
Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer. CoRR abs/2309.07929 (2023)
[i3]Yaru Chen, Ruohao Guo, Xubo Liu, Peipei Wu, Guangyao Li, Zhenbo Li, Wenwu Wang:
CM-PIE: Cross-modal perception for interactive-enhanced audio-visual video parsing. CoRR abs/2310.07517 (2023)- 2022
[c2]Guangyao Li, Yake Wei, Yapeng Tian, Chenliang Xu, Ji-Rong Wen, Di Hu:
Learning to Answer Questions in Dynamic Audio-Visual Scenarios. CVPR 2022: 19086-19096
[i2]Guangyao Li, Yake Wei, Yapeng Tian, Chenliang Xu, Ji-Rong Wen, Di Hu:
Learning to Answer Questions in Dynamic Audio-Visual Scenarios. CoRR abs/2203.14072 (2022)- 2021
[i1]Konrad Heidler, Lichao Mou, Di Hu, Pu Jin, Guangyao Li, Chuang Gan, Ji-Rong Wen, Xiao Xiang Zhu:
Self-supervised Audiovisual Representation Learning for Remote Sensing Data. CoRR abs/2108.00688 (2021)- 2020
[j1]Zhenbo Li
, Ruohao Guo, Meng Li, Yaru Chen, Guangyao Li:
A review of computer vision technologies for plant phenotyping. Comput. Electron. Agric. 176: 105672 (2020)
2010 – 2019
- 2019
[c1]Guangyao Li, Zhenbo Li
, Chuyue Zhang, Yaodong Li, Jun Yue:
Shellfish Detection Based on Fusion Attention Mechanism in End-to-End Network. PRCV (3) 2019: 516-527
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-02-13 00:41 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







