


default search action
Xihua Wang 0002
Person information
- affiliation: Renmin University of China, Beijing, China
Other persons with the same name
- Xihua Wang — disambiguation page
- Xihua Wang 0001
— University of Alberta, Edmonton, Alberta, Canada
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[c9]Yihan Wu, Yichen Lu
, Yifan Peng, Xihua Wang, Ruihua Song, Shinji Watanabe
:
Enhancing Audiovisual Speech Recognition Through Bifocal Preference Optimization. AAAI 2025: 25516-25524
[c8]Xihua Wang, Ruihua Song, Chongxuan Li, Xin Cheng, Boyuan Li, Yihan Wu, Yuyue Wang, Hongteng Xu, Yunfeng Wang:
Animate and Sound an Image. CVPR 2025: 23369-23378
[c7]Xin Cheng, Xihua Wang, Yihan Wu, Yuyue Wang, Ruihua Song:
LoVA: Long-form Video-to-Audio Generation. ICASSP 2025: 1-5
[c6]Boyuan Li, Xihua Wang, Ruihua Song, Wenbing Huang:
Two-in-One: Unified Multi-Person Interactive Motion Generation by Latent Diffusion Transformer. ICASSP 2025: 1-5
[c5]Yuyue Wang, Xin Cheng, Yihan Wu, Xihua Wang, Jinchuan Tian, Ruihua Song:
A Visual Speech Language Model for Visual Text-to-Speech Task. MMAsia 2025: 66:1-66:8
[i7]Xin Cheng, Yuyue Wang, Xihua Wang, Yihan Wu, Kaisi Guan, Yijing Chen, Peng Zhang, Xiaojiang Liu, Meng Cao, Ruihua Song:
VSSFlow: Unifying Video-conditioned Sound and Speech Generation via Joint Learning. CoRR abs/2509.24773 (2025)
[i6]Kaisi Guan, Xihua Wang, Zhengfeng Lai, Xin Cheng, Peng Zhang, Xiaojiang Liu, Ruihua Song, Meng Cao:
Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction. CoRR abs/2510.03117 (2025)
[i5]Yuyue Wang, Xin Cheng, Yihan Wu, Xihua Wang, Jinchuan Tian, Ruihua Song:
VSpeechLM: A Visual Speech Language Model for Visual Text-to-Speech Task. CoRR abs/2511.22229 (2025)- 2024
[c4]Xihua Wang
, Yuyue Wang
, Yihan Wu
, Ruihua Song
, Xu Tan
, Zehua Chen
, Hongteng Xu
, Guodong Sui
:
TiVA: Time-Aligned Video-to-Audio Generation. ACM Multimedia 2024: 573-582
[c3]Xu Gu
, Xihua Wang
, Chuhao Jin
, Ruihua Song
:
ScaMo: Towards Text to Video Storyboard Generation Using Scale and Movement of Shots. MMAsia 2024: 115:1-115:8
[i4]Yihan Wu, Soumi Maiti, Yifan Peng, Wangyou Zhang, Chenda Li, Yuyue Wang, Xihua Wang, Shinji Watanabe
, Ruihua Song:
SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition. CoRR abs/2401.18045 (2024)
[i3]Xin Cheng
, Xihua Wang, Yihan Wu, Yuyue Wang, Ruihua Song:
LoVA: Long-form Video-to-Audio Generation. CoRR abs/2409.15157 (2024)
[i2]Boyuan Li, Xihua Wang, Ruihua Song, Wenbing Huang:
Two-in-One: Unified Multi-Person Interactive Motion Generation by Latent Diffusion Transformer. CoRR abs/2412.16670 (2024)
[i1]Yihan Wu, Yichen Lu, Yifan Peng, Xihua Wang, Ruihua Song, Shinji Watanabe
:
Enhancing Audiovisual Speech Recognition through Bifocal Preference Optimization. CoRR abs/2412.19005 (2024)- 2023
[c2]Xu Gu
, Yuchong Sun
, Feiyue Ni
, Shizhe Chen
, Xihua Wang
, Ruihua Song
, Boyuan Li
, Xiang Cao
:
TeViS: Translating Text Synopses to Video Storyboards. ACM Multimedia 2023: 4968-4979
[c1]Xihua Wang, Lei Ji, Kun Yan, Yuchong Sun, Ruihua Song:
Expanding the Horizons: Exploring Further Steps in Open-Vocabulary Segmentation. PRCV (10) 2023: 407-419
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-02-08 23:19 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







