default search action

combined dblp search
author search
venue search
publication search

ask others

Xihua Wang 0002

> Home > Persons

Person information

affiliation: Renmin University of China, Beijing, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WuLPWS025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WuLPWS025
Yihan Wu, Yichen Lu, Yifan Peng, Xihua Wang, Ruihua Song, Shinji Watanabe:
Enhancing Audiovisual Speech Recognition Through Bifocal Preference Optimization. AAAI 2025: 25516-25524
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/WangSL0LW0XW25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/WangSL0LW0XW25
Xihua Wang, Ruihua Song, Chongxuan Li, Xin Cheng, Boyuan Li, Yihan Wu, Yuyue Wang, Hongteng Xu, Yunfeng Wang:
Animate and Sound an Image. CVPR 2025: 23369-23378
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0008WW0S25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0008WW0S25
Xin Cheng, Xihua Wang, Yihan Wu, Yuyue Wang, Ruihua Song:
LoVA: Long-form Video-to-Audio Generation. ICASSP 2025: 1-5
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiWS025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiWS025
Boyuan Li, Xihua Wang, Ruihua Song, Wenbing Huang:
Two-in-One: Unified Multi-Person Interactive Motion Generation by Latent Diffusion Transformer. ICASSP 2025: 1-5
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/mmasia/0003000TS25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mmasia/0003000TS25
Yuyue Wang, Xin Cheng, Yihan Wu, Xihua Wang, Jinchuan Tian, Ruihua Song:
A Visual Speech Language Model for Visual Text-to-Speech Task. MMAsia 2025: 66:1-66:8
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-24773
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-24773
Xin Cheng, Yuyue Wang, Xihua Wang, Yihan Wu, Kaisi Guan, Yijing Chen, Peng Zhang, Xiaojiang Liu, Meng Cao, Ruihua Song:
VSSFlow: Unifying Video-conditioned Sound and Speech Generation via Joint Learning. CoRR abs/2509.24773 (2025)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-03117
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-03117
Kaisi Guan, Xihua Wang, Zhengfeng Lai, Xin Cheng, Peng Zhang, Xiaojiang Liu, Ruihua Song, Meng Cao:
Taming Text-to-Sounding Video Generation via Advanced Modality Condition and Interaction. CoRR abs/2510.03117 (2025)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-22229
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-22229
Yuyue Wang, Xin Cheng, Yihan Wu, Xihua Wang, Jinchuan Tian, Ruihua Song:
VSpeechLM: A Visual Speech Language Model for Visual Text-to-Speech Task. CoRR abs/2511.22229 (2025)
2024
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Wang0WS0CXS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Wang0WS0CXS24
Xihua Wang, Yuyue Wang, Yihan Wu, Ruihua Song, Xu Tan, Zehua Chen, Hongteng Xu, Guodong Sui:
TiVA: Time-Aligned Video-to-Audio Generation. ACM Multimedia 2024: 573-582
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/mmasia/GuWJS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mmasia/GuWJS24
Xu Gu, Xihua Wang, Chuhao Jin, Ruihua Song:
ScaMo: Towards Text to Video Storyboard Generation Using Scale and Movement of Shots. MMAsia 2024: 115:1-115:8
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-18045
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-18045
Yihan Wu, Soumi Maiti, Yifan Peng, Wangyou Zhang, Chenda Li, Yuyue Wang, Xihua Wang, Shinji Watanabe, Ruihua Song:
SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition. CoRR abs/2401.18045 (2024)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-15157
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-15157
Xin Cheng, Xihua Wang, Yihan Wu, Yuyue Wang, Ruihua Song:
LoVA: Long-form Video-to-Audio Generation. CoRR abs/2409.15157 (2024)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-16670
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-16670
Boyuan Li, Xihua Wang, Ruihua Song, Wenbing Huang:
Two-in-One: Unified Multi-Person Interactive Motion Generation by Latent Diffusion Transformer. CoRR abs/2412.16670 (2024)
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-19005
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-19005
Yihan Wu, Yichen Lu, Yifan Peng, Xihua Wang, Ruihua Song, Shinji Watanabe:
Enhancing Audiovisual Speech Recognition through Bifocal Preference Optimization. CoRR abs/2412.19005 (2024)
2023
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/GuSNCWSLC23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/GuSNCWSLC23
Xu Gu, Yuchong Sun, Feiyue Ni, Shizhe Chen, Xihua Wang, Ruihua Song, Boyuan Li, Xiang Cao:
TeViS: Translating Text Synopses to Video Storyboards. ACM Multimedia 2023: 4968-4979
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/prcv/WangJYSS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/prcv/WangJYSS23
Xihua Wang, Lei Ji, Kun Yan, Yuchong Sun, Ruihua Song:
Expanding the Horizons: Exploring Further Steps in Open-Vocabulary Segmentation. PRCV (10) 2023: 407-419

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.