


default search action
Qi Wu 0001
Person information
- unicode name: 吴琦
- affiliation: University of Adelaide, School of Computer Science, Australian Centre for Robotic Vision, Adelaide, Australia
- affiliation (PhD 2015): University of Bath, UK
Other persons with the same name
- Qi Wu — disambiguation page
- Qi Wu 0002 — Chinese Academy of Sciences, Institute of Computing Technology, Beijing, China
- Qi Wu 0003
(aka: Edmond Qi Wu) — Shanghai Jiao Tong University, School of Electronic, Information and Electrical Engineering, China (and 2 more)
- Qi Wu 0004
— Technical University of Hamburg-Harburg, Germany (and 2 more)
- Qi Wu 0005
— Beihang University, School of Instrumentation and Optoelectronic Engineering, Beijing, China
- Qi Wu 0006 — ScaleFlux Inc., San Jose, CA, USA (and 2 more)
- Qi Wu 0007
— Shanghai Jiao Tong University, School of Electronic Information and Electrical Engineering, Shanghai Key Laboratory of Navigation and Location-based Services, Shanghai, China
- Qi Wu 0008 — Jiangxi University of Finance and Economics, School of Information Technology, Nanchang, China
- Qi Wu 0009
— City University of Hong Kong, School of Data Science, Hong Kong (and 2 more)
- Qi Wu 0010
— Chongqing University of Posts and Telecommunications, School of Communication and Information Engineering, Chongqing, China
- Qi Wu 0011
— Huazhong University of Science and Technology, School of Electrical and Electronic Engineering, State Key Laboratory of Advanced Electromagnetic Engineering and Technology, Wuhan, China
- Qi Wu 0012
— Huazhong University of Science and Technology, School of Mechanical Science and Engineering, National NC System Engineering Research Center, Wuhan, China
- Qi Wu 0013
— Guangdong Police College, Department of Computer Science, Guangzhou, China
- Qi Wu 0014 (aka: Tony Qi Wu) — Carnegie Mellon University, Electrical and Computer Engineering Department, Pittsburgh, PA, USA (and 1 more)
- Qi Wu 0015
— University of California, Davis, CA, USA (and 1 more)
- Qi Wu 0016
— Xiangtan University, School of Mathematics and Computational Science, Xiangtan, China (and 1 more)
- Qi Wu 0017 — Megvii Technology, Beijing, China
- Qi Wu 0018
— Zhejiang University of Technology, College of Information Engineering, Hangzhou, China
- Qi Wu 0019
— State Grid Changzhou Power Supply Company, 500 kV Substation Operation and Overhaul Center, Changzhou, China (and 1 more)
- Qi Wu 0020
— Wuhan University of Science and Technology, School of Automobile and Traffic Engineering, China
- Qi Wu 0021
— University of Science and Technology of China, Hefei, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j39]Jianpeng Zhang, Xiaomin Chen, Bing Yang, Qingbiao Guan, Qi Chen, Jian Chen, Qi Wu, Yutong Xie
, Yong Xia:
Advances in attention mechanisms for medical image segmentation. Comput. Sci. Rev. 56: 100721 (2025) - [j38]Minkui Tan, Qi Chen, Zixiong Huang, Qi Wu, Yuanqing Li, Jiaqiu Zhou:
Auto-3D-house Design from Structured User Requirements. Int. J. Autom. Comput. 22(2): 368-385 (2025) - [c118]Keke Gai, Dongjue Wang, Jing Yu, Mohan Wang, Liehuang Zhu, Qi Wu:
MFL-Owner: Ownership Protection for Multi-modal Federated Learning via Orthogonal Transform Watermark. AAAI 2025: 3049-3058 - [i129]Haodong Hong, Yanyuan Qiao, Sen Wang, Jiajun Liu, Qi Wu:
General Scene Adaptation for Vision-and-Language Navigation. CoRR abs/2501.17403 (2025) - [i128]Weiren Zhao, Feng Wang, Yanran Wang, Yutong Xie, Qi Wu, Yuyin Zhou:
UD-Mamba: A pixel-level uncertainty-driven Mamba model for medical image segmentation. CoRR abs/2502.02024 (2025) - [i127]Keke Gai, Mohan Wang, Jing Yu, Dongjue Wang, Qi Wu:
Adaptive Prototype Knowledge Transfer for Federated Learning with Mixed Modalities and Heterogeneous Tasks. CoRR abs/2502.04400 (2025) - [i126]Shuo Wang, Keke Gai, Jing Yu, Liehuang Zhu, Qi Wu:
Vertical Federated Continual Learning via Evolving Prototype Knowledge. CoRR abs/2502.09152 (2025) - [i125]Liangqi Lei, Keke Gai, Jing Yu, Liehuang Zhu, Qi Wu:
Secure and Efficient Watermarking for Latent Diffusion Models in Model Distribution Scenarios. CoRR abs/2502.13345 (2025) - [i124]Zerui Li, Gengze Zhou, Haodong Hong, Yanyan Shao, Wenqi Lyu, Yanyuan Qiao, Qi Wu:
Ground-level Viewpoint Vision-and-Language Navigation in Continuous Environments. CoRR abs/2502.19024 (2025) - [i123]Xinyu Wang, Bohan Zhuang, Qi Wu:
Are Large Vision Language Models Good Game Players? CoRR abs/2503.02358 (2025) - [i122]Xiangyan Qu, Jing Yu, Jiamin Zhuang, Gaopeng Gou, Gang Xiong, Qi Wu:
MADS: Multi-Attribute Document Supervision for Zero-Shot Image Classification. CoRR abs/2503.06847 (2025) - [i121]Xiangyu Shi, Zerui Li, Wenqi Lyu, Jiatong Xia, Feras Dayoub, Yanyuan Qiao, Qi Wu:
SmartWay: Enhanced Waypoint Prediction and Backtracking for Zero-Shot Vision-and-Language Navigation. CoRR abs/2503.10069 (2025) - [i120]Yuanmin Tang, Jing Yu, Keke Gai, Jiamin Zhuang, Gang Xiong, Gaopeng Gou, Qi Wu:
Missing Target-Relevant Information Prediction with World Model for Accurate Zero-Shot Composed Image Retrieval. CoRR abs/2503.17109 (2025) - [i119]Chaohan Wang, Yutong Xie, Qi Chen, Yuyin Zhou, Qi Wu:
A Comprehensive Analysis of Mamba for 3D Volumetric Medical Image Segmentation. CoRR abs/2503.19308 (2025) - 2024
- [j37]Mengyang Sun, Wei Suo, Peng Wang
, Kai Niu, Le Liu, Guosheng Lin, Yanning Zhang, Qi Wu:
An Adaptive Correlation Filtering Method for Text-Based Person Search. Int. J. Comput. Vis. 132(10): 4440-4455 (2024) - [j36]Yutong Xie
, Lin Gu
, Tatsuya Harada
, Jianpeng Zhang, Yong Xia
, Qi Wu:
Rethinking masked image modelling for medical image representation. Medical Image Anal. 98: 103304 (2024) - [j35]Ning Ding
, Chaorui Deng
, Mingkui Tan
, Qing Du
, Zhiwei Ge, Qi Wu
:
Image Captioning With Controllable and Adaptive Length Levels. IEEE Trans. Pattern Anal. Mach. Intell. 46(2): 764-779 (2024) - [j34]Chen Gao
, Si Liu
, Jinyu Chen
, Luting Wang
, Qi Wu
, Bo Li
, Qi Tian
:
Room-Object Entity Prompting and Reasoning for Embodied Referring Expression. IEEE Trans. Pattern Anal. Mach. Intell. 46(2): 994-1010 (2024) - [j33]Yutong Xie
, Jianpeng Zhang
, Yong Xia
, Qi Wu
:
UniMiSS+: Universal Medical Self-Supervised Learning From Cross-Dimensional Unpaired Data. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 10021-10035 (2024) - [j32]Zhiquan Wen
, Shuaicheng Niu, Ge Li
, Qingyao Wu
, Mingkui Tan
, Qi Wu
:
Test-Time Model Adaptation for Visual Question Answering With Debiased Self-Supervisions. IEEE Trans. Multim. 26: 2137-2147 (2024) - [j31]Mingkui Tan
, Zhiquan Wen
, Leyuan Fang
, Qi Wu
:
Transformer-Based Relational Inference Network for Complex Visual Relational Reasoning. ACM Trans. Multim. Comput. Commun. Appl. 20(1): 10:1-10:23 (2024) - [c117]Qi Chen, Dileepa Pitawela, Chongyang Zhao, Gengze Zhou, Hsiang-Ting Chen, Qi Wu:
WebVLN: Vision-and-Language Navigation on Websites. AAAI 2024: 1165-1173 - [c116]Bahram Mohammadi, Yicong Hong, Yuankai Qi
, Qi Wu, Shirui Pan, Javen Qinfeng Shi:
Augmented Commonsense Knowledge for Remote Object Grounding. AAAI 2024: 4269-4277 - [c115]Yuanmin Tang, Jing Yu, Keke Gai, Jiamin Zhuang, Gang Xiong, Yue Hu, Qi Wu:
Context-I2W: Mapping Images to Context-Dependent Words for Accurate Zero-Shot Composed Image Retrieval. AAAI 2024: 5180-5188 - [c114]Gengze Zhou, Yicong Hong, Qi Wu:
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models. AAAI 2024: 7641-7649 - [c113]Qi Chen
, Yutong Xie
, Biao Wu, Xiaomin Chen, James Ang, Minh-Son To
, Xiaojun Chang
, Qi Wu:
Act Like a Radiologist: Radiology Report Generation Across Anatomical Regions. ACCV (6) 2024: 36-52 - [c112]Zixiong Huang, Qi Chen, Libo Sun, Yifan Yang, Naizhou Wang, Qi Wu, Mingkui Tan:
G-NeRF: Geometry-enhanced Novel View Synthesis from Single-View Images. CVPR 2024: 10117-10126 - [c111]Yiwen Ye, Yutong Xie, Jianpeng Zhang, Ziyang Chen, Qi Wu, Yong Xia:
Continual Self-Supervised Learning: Towards Universal Multi-Modal Medical Data Representation Learning. CVPR 2024: 11114-11124 - [c110]Vu Minh Hieu Phan, Yutong Xie, Yuankai Qi
, Lingqiao Liu, Liyang Liu, Bowen Zhang, Zhibin Liao, Qi Wu, Minh-Son To
, Johan W. Verjans
:
Decomposing Disease Descriptions for Enhanced Pathology Detection: A Multi-Aspect Vision-Language Pre-Training Framework. CVPR 2024: 11492-11501 - [c109]Yutong Xie, Qi Chen, Sinuo Wang, Minh-Son To
, Iris Lee, Ee Win Khoo, Kerolos Hendy, Daniel Koh, Yong Xia, Qi Wu:
PairAug: What Can Augmented Image-Text Pairs Do for Radiology? CVPR 2024: 11652-11661 - [c108]Xinyu Wang, Bohan Zhuang, Qi Wu:
ModaVerse: Efficiently Transforming Modalities with LLMs. CVPR 2024: 26596-26606 - [c107]Gengze Zhou
, Yicong Hong
, Zun Wang
, Xin Eric Wang
, Qi Wu
:
NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models. ECCV (7) 2024: 260-278 - [c106]Yanyuan Qiao
, Qianyi Liu
, Jiajun Liu
, Jing Liu
, Qi Wu
:
LLM as Copilot for Coarse-Grained Vision-and-Language Navigation. ECCV (5) 2024: 459-476 - [c105]Haodong Hong, Sen Wang, Zi Huang, Qi Wu, Jiajun Liu:
Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts. IJCAI 2024: 839-847 - [c104]Zilin Lu
, Yutong Xie, Qingjie Zeng, Mengkang Lu, Qi Wu, Yong Xia:
Spot the Difference: Difference Visual Question Answering with Residual Alignment. MICCAI (5) 2024: 649-658 - [c103]Yili Li
, Jing Yu
, Keke Gai
, Bang Liu
, Gang Xiong
, Qi Wu
:
T2VIndexer: A Generative Video Indexer for Efficient Text-Video Retrieval. ACM Multimedia 2024: 3955-3963 - [c102]Xiangyan Qu
, Jing Yu
, Keke Gai
, Jiamin Zhuang
, Yuanmin Tang
, Gang Xiong
, Gaopeng Gou
, Qi Wu
:
Visual-Semantic Decomposition and Partial Alignment for Document-based Zero-Shot Learning. ACM Multimedia 2024: 4581-4590 - [c101]Haodong Hong
, Sen Wang
, Zi Huang
, Qi Wu
, Jiajun Liu
:
Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments. ACM Multimedia 2024: 7639-7648 - [c100]Yicheng Wu
, Yutong Xie
, Xiangde Luo
, Qi Wu
, Jianfei Cai
:
Dataset, Challenge, and Evaluation for Tumor Segmentation Variability. ACM Multimedia 2024: 11302-11303 - [c99]Keji He, Kehan Chen, Jiawang Bai, Yan Huang, Qi Wu, Shu-Tao Xia, Liang Wang:
Everyday Object Meets Vision-and-Language Navigation Agent via Backdoor. NeurIPS 2024 - [c98]Jiazhao Zhang, Kunyu Wang, Rongtao Xu, Gengze Zhou, Yicong Hong, Xiaomeng Fang, Qi Wu, Zhizheng Zhang, He Wang:
NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation. Robotics: Science and Systems 2024 - [i118]Xinyu Wang, Bohan Zhuang, Qi Wu:
ModaVerse: Efficiently Transforming Modalities with LLMs. CoRR abs/2401.06395 (2024) - [i117]Jiazhao Zhang, Kunyu Wang, Rongtao Xu, Gengze Zhou, Yicong Hong, Xiaomeng Fang, Qi Wu, Zhizheng Zhang, He Wang:
NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation. CoRR abs/2402.15852 (2024) - [i116]Vu Minh Hieu Phan, Yutong Xie, Yuankai Qi, Lingqiao Liu, Liyang Liu, Bowen Zhang, Zhibin Liao, Qi Wu, Minh-Son To
, Johan W. Verjans
:
Decomposing Disease Descriptions for Enhanced Pathology Detection: A Multi-Aspect Vision-Language Pre-training Framework. CoRR abs/2403.07636 (2024) - [i115]Yanyuan Qiao, Zheng Yu, Longteng Guo, Sihan Chen, Zijia Zhao, Mingzhen Sun, Qi Wu, Jing Liu:
VL-Mamba: Exploring State Space Models for Multimodal Learning. CoRR abs/2403.13600 (2024) - [i114]Yutong Xie, Qi Chen, Sinuo Wang, Minh-Son To
, Iris Lee, Ee Win Khoo, Kerolos Hendy, Daniel Koh, Yong Xia, Qi Wu:
PairAug: What Can Augmented Image-Text Pairs Do for Radiology? CoRR abs/2404.04960 (2024) - [i113]Zixiong Huang, Qi Chen, Libo Sun, Yifan Yang, Naizhou Wang, Mingkui Tan, Qi Wu:
G-NeRF: Geometry-enhanced Novel View Synthesis from Single-View Images. CoRR abs/2404.07474 (2024) - [i112]Feng Chen, Zhen Yang, Bohan Zhuang, Qi Wu:
Streaming Video Diffusion: Online Video Editing with Diffusion Models. CoRR abs/2405.19726 (2024) - [i111]Bahram Mohammadi, Yicong Hong, Yuankai Qi, Qi Wu, Shirui Pan, Javen Qinfeng Shi:
Augmented Commonsense Knowledge for Remote Object Grounding. CoRR abs/2406.01256 (2024) - [i110]Haodong Hong, Sen Wang, Zi Huang, Qi Wu, Jiajun Liu:
Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts. CoRR abs/2406.02208 (2024) - [i109]Yue Zhang, Ziqiao Ma, Jialu Li, Yanyuan Qiao, Zun Wang, Joyce Chai, Qi Wu, Mohit Bansal, Parisa Kordjamshidi:
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models. CoRR abs/2407.07035 (2024) - [i108]Gengze Zhou, Yicong Hong, Zun Wang, Xin Eric Wang, Qi Wu:
NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models. CoRR abs/2407.12366 (2024) - [i107]Xiangyan Qu, Jing Yu, Keke Gai, Jiamin Zhuang, Yuanmin Tang, Gang Xiong, Gaopeng Gou, Qi Wu:
Visual-Semantic Decomposition and Partial Alignment for Document-based Zero-Shot Learning. CoRR abs/2407.15613 (2024) - [i106]Biao Wu, Yutong Xie, Zeyu Zhang, Minh Hieu Phan, Qi Chen, Ling Chen, Qi Wu:
XLIP: Cross-modal Attention Masked Modelling for Medical Language-Image Pre-Training. CoRR abs/2407.19546 (2024) - [i105]Haodong Hong, Sen Wang, Zi Huang, Qi Wu, Jiajun Liu:
Navigating Beyond Instructions: Vision-and-Language Navigation in Obstructed Environments. CoRR abs/2407.21452 (2024) - [i104]Yili Li, Jing Yu, Keke Gai, Bang Liu, Gang Xiong, Qi Wu:
T2VIndexer: A Generative Video Indexer for Efficient Text-Video Retrieval. CoRR abs/2408.11432 (2024) - [i103]Yanyuan Qiao, Wenqi Lyu, Hui Wang, Zixu Wang, Zerui Li, Yuan Zhang, Mingkui Tan, Qi Wu:
Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs. CoRR abs/2409.18794 (2024) - [i102]Yuanmin Tang, Jing Yu, Keke Gai, Jiamin Zhuang, Gaopeng Gou, Gang Xiong, Qi Wu:
Denoise-I2W: Mapping Images to Denoising Words for Accurate Zero-Shot Composed Image Retrieval. CoRR abs/2410.17393 (2024) - [i101]Ruoxi Sun, Jiamin Chang, Hammond Pearce, Chaowei Xiao, Bo Li, Qi Wu, Surya Nepal, Minhui Xue:
SoK: Unifying Cybersecurity and Cybersafety of Multimodal Foundation Models with an Information Theory Approach. CoRR abs/2411.11195 (2024) - [i100]Liangqi Lei, Keke Gai, Jing Yu, Liehuang Zhu, Qi Wu:
Conceptwm: A Diffusion Model Watermark for Concept Protection. CoRR abs/2411.11688 (2024) - [i99]Qi Chen, Ruoshan Zhao, Sinuo Wang, Vu Minh Hieu Phan, Anton van den Hengel, Johan Verjans
, Zhibin Liao, Minh-Son To
, Yong Xia, Jian Chen, Yutong Xie, Qi Wu:
A Survey of Medical Vision-and-Language Applications and Their Techniques. CoRR abs/2411.12195 (2024) - [i98]Feng Chen, Chenhui Gou, Jing Liu, Yang Yang, Zhaoyang Li, Jiyuan Zhang, Zhenbang Sun, Bohan Zhuang, Qi Wu:
Evaluating and Advancing Multimodal Large Language Models in Ability Lens. CoRR abs/2411.14725 (2024) - [i97]Gengze Zhou, Yicong Hong, Zun Wang, Chongyang Zhao, Mohit Bansal, Qi Wu:
SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts. CoRR abs/2412.05552 (2024) - [i96]Yuanmin Tang, Xiaoting Qin, Jue Zhang, Jing Yu, Gaopeng Gou, Gang Xiong, Qingwei Ling, Saravan Rajmohan, Dongmei Zhang, Qi Wu:
Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval. CoRR abs/2412.11077 (2024) - 2023
- [j30]Zhihong Lin
, Donghao Zhang, Qingyi Tao, Danli Shi
, Gholamreza Haffari, Qi Wu, Mingguang He, Zongyuan Ge
:
Medical visual question answering: A survey. Artif. Intell. Medicine 143: 102611 (2023) - [j29]Yanyuan Qiao
, Yuankai Qi
, Yicong Hong
, Zheng Yu
, Peng Wang
, Qi Wu
:
HOP+: History-Enhanced and Order-Aware Pre-Training for Vision-and-Language Navigation. IEEE Trans. Pattern Anal. Mach. Intell. 45(7): 8524-8537 (2023) - [j28]Zihan Wang
, Olivia Byrnes, Hu Wang
, Ruoxi Sun
, Congbo Ma, Huaming Chen, Qi Wu
, Minhui Xue
:
Data Hiding With Deep Learning: A Survey Unifying Digital Watermarking and Steganography. IEEE Trans. Comput. Soc. Syst. 10(6): 2985-2999 (2023) - [j27]Mengge He
, Wenjing Du
, Zhiquan Wen
, Qing Du
, Yutong Xie, Qi Wu
:
Multi-Granularity Aggregation Transformer for Joint Video-Audio-Text Representation Learning. IEEE Trans. Circuits Syst. Video Technol. 33(6): 2990-3002 (2023) - [j26]Wei Suo
, Mengyang Sun
, Peng Wang
, Yanning Zhang
, Qi Wu
:
Rethinking and Improving Feature Pyramids for One-Stage Referring Expression Comprehension. IEEE Trans. Image Process. 32: 854-864 (2023) - [j25]Hao Li
, Jinfa Huang, Peng Jin
, Guoli Song
, Qi Wu
, Jie Chen:
Weakly-Supervised 3D Spatial Reasoning for Text-Based Visual Question Answering. IEEE Trans. Image Process. 32: 3367-3382 (2023) - [j24]Mengyang Sun
, Wei Suo
, Peng Wang
, Yanning Zhang
, Qi Wu
:
A Proposal-Free One-Stage Framework for Referring Expression Comprehension and Generation via Dense Cross-Attention. IEEE Trans. Multim. 25: 2446-2458 (2023) - [c97]Zhiquan Wen, Yaowei Wang, Mingkui Tan, Qingyao Wu, Qi Wu:
Digging out Discrimination Information from Generated Samples for Robust Visual Question Answering. ACL (Findings) 2023: 6910-6928 - [c96]Wei Suo, Mengyang Sun, Weisong Liu, Yiqi Gao, Peng Wang, Yanning Zhang, Qi Wu:
S3C: Semi-Supervised VQA Natural Language Explanation via Self-Critical Learning. CVPR 2023: 2646-2656 - [c95]Gaoxiang Cong, Liang Li, Yuankai Qi
, Zheng-Jun Zha, Qi Wu, Wenyu Wang, Bin Jiang, Ming-Hsuan Yang, Qingming Huang:
Learning to Dub Movies via Hierarchical Prosody Models. CVPR 2023: 14687-14697 - [c94]Cristian Rodriguez Opazo, Edison Marrese-Taylor, Basura Fernando, Hiroya Takamura, Qi Wu:
Memory-efficient Temporal Moment Localization in Long Videos. EACL 2023: 1901-1916 - [c93]Xi Tian, Yong-Liang Yang, Qi Wu:
ShapeScaffolder: Structure-Aware 3D Shape Generation from Text. ICCV 2023: 2715-2724 - [c92]Zun Wang, Jialu Li, Yicong Hong, Yi Wang, Qi Wu, Mohit Bansal, Stephen Gould, Hao Tan, Yu Qiao:
Scaling Data Generation in Vision-and-Language Navigation. ICCV 2023: 11975-11986 - [c91]Chaorui Deng, Da Chen
, Qi Wu:
Identity-Consistent Aggregation for Video Object Detection. ICCV 2023: 13388-13398 - [c90]Shubo Liu, Hongsheng Zhang, Yuankai Qi
, Peng Wang, Yanning Zhang, Qi Wu:
AerialVLN: Vision-and-Language Navigation for UAVs. ICCV 2023: 15338-15348 - [c89]Yanyuan Qiao, Zheng Yu, Qi Wu:
VLN-PETL: Parameter-Efficient Transfer Learning for Vision-and-Language Navigation. ICCV 2023: 15397-15406 - [c88]Chaorui Deng, Qi Chen, Pengda Qin, Da Chen, Qi Wu:
Prompt Switch: Efficient CLIP Adaptation for Text-Video Retrieval. ICCV 2023: 15602-15612 - [c87]Yanyuan Qiao, Yuankai Qi
, Zheng Yu, Jing Liu, Qi Wu:
March in Chat: Interactive Prompting for Remote Embodied Referring Expression. ICCV 2023: 15712-15721 - [c86]Yutong Xie, Lin Gu, Tatsuya Harada, Jianpeng Zhang, Yong Xia, Qi Wu:
MedIM: Boost Medical Image Representation via Radiology Report-Guided Masking. MICCAI (1) 2023: 13-23 - [c85]Zheng Yu, Yutong Xie, Yong Xia, Qi Wu:
PLMVQA: Applying Pseudo Labels for Medical Visual Question Answering with Limited Data. MTSAIL/LEAF/AI4Treat/MMMI/REMIA@MICCAI 2023: 357-367 - [c84]Qingbiao Guan, Yutong Xie, Bing Yang, Jianpeng Zhang, Zhibin Liao, Qi Wu, Yong Xia:
Unpaired Cross-Modal Interaction Learning for COVID-19 Segmentation on Limited CT Images. MICCAI (3) 2023: 603-613 - [c83]Biao Wu, Yutong Xie, Zeyu Zhang, Jinchao Ge, Kaspar Yaxley, Suzan Bahadir, Qi Wu, Yifan Liu, Minh-Son To
:
BHSD: A 3D Multi-class Brain Hemorrhage Segmentation Dataset. MLMI@MICCAI (1) 2023: 147-156 - [c82]Zheng Yu, Yanyuan Qiao, Yutong Xie, Qi Wu:
Multi-modal Adapter for Medical Vision-and-Language Learning. MLMI@MICCAI (1) 2023: 393-402 - [c81]Chongyang Zhao
, Yuankai Qi
, Qi Wu
:
Mind the Gap: Improving Success Rate of Vision-and-Language Navigation by Revisiting Oracle Success Routes. ACM Multimedia 2023: 4349-4358 - [c80]Jingying Gao, Qi Wu, Alan Blair, Maurice Pagnucco:
LoRA: A Logical Reasoning Augmented Dataset for Visual Question Answering. NeurIPS 2023 - [i95]Anthony Manchin, Jamie Sherrah, Qi Wu, Anton van den Hengel:
Program Generation from Diverse Video Demonstrations. CoRR abs/2302.00178 (2023) - [i94]Qi Chen, Yutong Xie, Biao Wu, Minh-Son To
, James Ang, Qi Wu:
S4M: Generating Radiology Reports by A Single Model for Multiple Body Parts. CoRR abs/2305.16685 (2023) - [i93]Gengze Zhou, Yicong Hong, Qi Wu:
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models. CoRR abs/2305.16986 (2023) - [i92]Yutong Xie, Bing Yang, Qingbiao Guan, Jianpeng Zhang, Qi Wu, Yong Xia:
Attention Mechanisms in Medical Image Segmentation: A Survey. CoRR abs/2305.17937 (2023) - [i91]Zun Wang, Jialu Li, Yicong Hong, Yi Wang, Qi Wu, Mohit Bansal, Stephen Gould, Hao Tan, Yu Qiao:
Scaling Data Generation in Vision-and-Language Navigation. CoRR abs/2307.15644 (2023) - [i90]Chongyang Zhao, Yuankai Qi, Qi Wu:
Mind the Gap: Improving Success Rate of Vision-and-Language Navigation by Revisiting Oracle Success Routes. CoRR abs/2308.03244 (2023) - [i89]Shubo Liu, Hongsheng Zhang, Yuankai Qi, Peng Wang, Yaning Zhang, Qi Wu:
AerialVLN: Vision-and-Language Navigation for UAVs. CoRR abs/2308.06735 (2023) - [i88]Chaorui Deng, Qi Chen, Pengda Qin, Da Chen
, Qi Wu:
Prompt Switch: Efficient CLIP Adaptation for Text-Video Retrieval. CoRR abs/2308.07648 (2023) - [i87]Chaorui Deng, Da Chen
, Qi Wu:
Identity-Consistent Aggregation for Video Object Detection. CoRR abs/2308.07737 (2023) - [i86]Qi Chen, Chaorui Deng, Zixiong Huang, Bowen Zhang, Mingkui Tan, Qi Wu:
Likelihood-Based Text-to-Image Evaluation with Patch-Level Perceptual and Semantic Credit Assignment. CoRR abs/2308.08525 (2023) - [i85]Yanyuan Qiao, Yuankai Qi, Zheng Yu, Jing Liu, Qi Wu:
March in Chat: Interactive Prompting for Remote Embodied Referring Expression. CoRR abs/2308.10141 (2023) - [i84]Yanyuan Qiao, Zheng Yu, Qi Wu:
VLN-PETL: Parameter-Efficient Transfer Learning for Vision-and-Language Navigation. CoRR abs/2308.10172 (2023) - [i83]Biao Wu, Yutong Xie, Zeyu Zhang, Jinchao Ge, Kaspar Yaxley, Suzan Bahadir, Qi Wu, Yifan Liu, Minh-Son To
:
BHSD: A 3D Multi-Class Brain Hemorrhage Segmentation Dataset. CoRR abs/2308.11298 (2023) - [i82]Wei Suo, Mengyang Sun, Weisong Liu, Yiqi Gao, Peng Wang, Yanning Zhang, Qi Wu:
S3C: Semi-Supervised VQA Natural Language Explanation via Self-Critical Learning. CoRR abs/2309.02155 (2023) - [i81]Xinyu Wang, Bohan Zhuang, Qi Wu:
SwitchGPT: Adapting Large Language Models for Non-Text Outputs. CoRR abs/2309.07623 (2023) - [i80]Yuanmin Tang, Jing Yu, Keke Gai, Jiamin Zhuang, Gang Xiong
, Yue Hu, Qi Wu:
Context-I2W: Mapping Images to Context-dependent Words for Accurate Zero-Shot Composed Image Retrieval. CoRR abs/2309.16137 (2023) - [i79]Yuanmin Tang, Jing Yu, Keke Gai, Yujing Wang, Yue Hu, Gang Xiong
, Qi Wu:
Align before Search: Aligning Ads Image to Text for Accurate Cross-Modal Sponsored Search. CoRR abs/2309.16141 (2023) - [i78]Xiangyu Shi, Yanyuan Qiao, Qi Wu, Lingqiao Liu, Feras Dayoub:
Improving Online Source-free Domain Adaptation for Object Detection by Unsupervised Data Acquisition. CoRR abs/2310.19258 (2023) - [i77]Yuanmin Tang, Jing Yu, Keke Gai, Xiangyan Qu, Yue Hu, Gang Xiong
, Qi Wu:
Watermarking Vision-Language Pre-trained Models for Multi-modal Embedding as a Service. CoRR abs/2311.05863 (2023) - [i76]Yiwen Ye, Yutong Xie, Jianpeng Zhang, Ziyang Chen, Qi Wu, Yong Xia:
Continual Self-supervised Learning: Towards Universal Multi-modal Medical Data Representation Learning. CoRR abs/2311.17597 (2023) - [i75]Yunchuan Ma, Chang Teng, Yuankai Qi, Guorong Li, Laiyun Qing, Qi Wu, Qingming Huang:
Subject-Oriented Video Captioning. CoRR abs/2312.13330 (2023) - [i74]Qi Chen, Dileepa Pitawela, Chongyang Zhao, Gengze Zhou, Hsiang-Ting Chen, Qi Wu:
WebVLN: Vision-and-Language Navigation on Websites. CoRR abs/2312.15820 (2023) - 2022
- [b2]Qi Wu
, Peng Wang
, Xin Wang
, Xiaodong He, Wenwu Zhu:
Visual Question Answering - From Theory to Application. Advances in Computer Vision and Pattern Recognition, Springer 2022, ISBN 978-981-19-0963-4, pp. 1-236 - [j23]Chaorui Deng
, Qi Wu
, Qingyao Wu
, Fuyuan Hu
, Fan Lyu
, Mingkui Tan
:
Visual Grounding Via Accumulated Attention. IEEE Trans. Pattern Anal. Mach. Intell. 44(3): 1670-1684 (2022) - [j22]Chenyu Gao
, Qi Zhu
, Peng Wang
, Hui Li
, Yuliang Liu
, Anton van den Hengel
, Qi Wu
:
Structured Multimodal Attentions for TextVQA. IEEE Trans. Pattern Anal. Mach. Intell. 44(12): 9603-9614 (2022) - [j21]Zeren Sun
, Huafeng Liu, Qiong Wang
, Tianfei Zhou
, Qi Wu
, Zhenmin Tang:
Co-LDL: A Co-Training-Based Label Distribution Learning Method for Tackling Label Noise. IEEE Trans. Multim. 24: 1093-1104 (2022) - [j20]Chuanyi Zhang
, Qiong Wang
, Guo-Sen Xie
, Qi Wu
, Fumin Shen
, Zhenmin Tang:
Robust Learning From Noisy Web Images Via Data Purification for Fine-Grained Recognition. IEEE Trans. Multim. 24: 1198-1209 (2022) - [j19]Amin Parvaneh
, Ehsan Abbasnejad, Qi Wu
, Qinfeng (Javen) Shi
, Anton van den Hengel
:
Show, Price and Negotiate: A Negotiator With Online Value Look-Ahead. IEEE Trans. Multim. 24: 1426-1434 (2022) - [c79]Chenchen Jing, Yunde Jia, Yuwei Wu, Chuanhao Li
, Qi Wu:
Learning the Dynamics of Visual Relational Reasoning via Reinforced Path Routing. AAAI 2022: 1122-1130 - [c78]Jing Gu, Eliana Stefani, Qi Wu, Jesse Thomason, Xin Wang:
Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions. ACL (1) 2022: 7606-7623 - [c77]Xi Tian, Yongliang Yang, Qi Wu:
Enhancing Person Synthesis in Complex Scenes via Intrinsic and Contextual Structure Modeling. BMVC 2022: 491 - [c76]Anthony Manchin, Jamie Sherrah, Qi Wu, Anton van den Hengel:
Program Generation from Diverse Video Demonstrations. BMVC 2022: 1039 - [c75]Yang Ding, Jing Yu, Bang Liu, Yue Hu, Mingxin Cui, Qi Wu:
MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering. CVPR 2022: 5079-5088 - [c74]Chenchen Jing, Yunde Jia, Yuwei Wu, Xinyu Liu, Qi Wu:
Maintaining Reasoning Consistency in Compositional Visual Question Answering. CVPR 2022: 5089-5098 - [c73]Yanyuan Qiao, Yuankai Qi
, Yicong Hong, Zheng Yu, Peng Wang, Qi Wu:
HOP: History-and-Order Aware Pretraining for Vision-and-Language Navigation. CVPR 2022: 15397-15406 - [c72]Yicong Hong, Zun Wang, Qi Wu, Stephen Gould:
Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation. CVPR 2022: 15418-15428 - [c71]Qi Chen, Mingkui Tan, Yuankai Qi
, Jiaqiu Zhou, Yuanqing Li, Qi Wu:
V2C: Visual Voice Cloning. CVPR 2022: 21210-21219 - [c70]Yutong Xie
, Jianpeng Zhang, Yong Xia, Qi Wu:
UniMiSS: Universal Medical Self-supervised Learning via Breaking Dimensionality Barrier. ECCV (21) 2022: 558-575 - [c69]Wei Suo, Mengyang Sun, Kai Niu, Yiqi Gao, Peng Wang, Yanning Zhang, Qi Wu:
A Simple and Robust Correlation Filtering Method for Text-Based Person Search. ECCV (35) 2022: 726-742 - [c68]Wanrong Zhu
, Yuankai Qi
, Pradyumna Narayana, Kazoo Sone, Sugato Basu, Xin Wang, Qi Wu, Miguel P. Eckstein, William Yang Wang:
Diagnosing Vision-and-Language Navigation: What Really Matters. NAACL-HLT 2022: 5981-5993 - [c67]Qi Chen, Chaorui Deng, Qi Wu:
Learning Distinct and Representative Modes for Image Captioning. NeurIPS 2022 - [c66]Mohammad Mahdi Kazemi Moghaddam, Ehsan Abbasnejad, Qi Wu, Qinfeng (Javen) Shi
, Anton van den Hengel:
ForeSI: Success-Aware Visual Navigation Agent. WACV 2022: 3401-3410 - [i73]Yicong Hong, Zun Wang, Qi Wu, Stephen Gould:
Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation. CoRR abs/2203.02764 (2022) - [i72]Yang Ding, Jing Yu, Bang Liu, Yue Hu, Mingxin Cui, Qi Wu:
MuKEA: Multimodal Knowledge Extraction and Accumulation for Knowledge-based Visual Question Answering. CoRR abs/2203.09138 (2022) - [i71]Yanyuan Qiao, Yuankai Qi, Yicong Hong, Zheng Yu, Peng Wang, Qi Wu:
HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation. CoRR abs/2203.11591 (2022) - [i70]Jing Gu, Eliana Stefani, Qi Wu, Jesse Thomason, Xin Eric Wang:
Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions. CoRR abs/2203.12667 (2022) - [i69]Zhipeng Zhang, Xinglin Hou, Kai Niu, Zhongzhen Huang, Tiezheng Ge, Yuning Jiang, Qi Wu, Peng Wang:
Attract me to Buy: Advertisement Copywriting Generation with Multimodal Multi-structured Information. CoRR abs/2205.03534 (2022) - [i68]Yutong Xie, Jianpeng Zhang, Yong Xia, Anton van den Hengel, Qi Wu:
ClusTR: Exploring Efficient Self-attention via Clustering for Vision Transformers. CoRR abs/2208.13138 (2022) - [i67]Qi Chen, Chaorui Deng, Qi Wu:
Learning Distinct and Representative Modes for Image Captioning. CoRR abs/2209.08231 (2022) - [i66]Hao Li, Jinfa Huang, Peng Jin, Guoli Song, Qi Wu, Jie Chen:
Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering. CoRR abs/2209.10326 (2022) - [i65]Gaoxiang Cong, Liang Li, Yuankai Qi, Zhengjun Zha, Qi Wu, Wenyu Wang, Bin Jiang, Ming-Hsuan Yang, Qingming Huang:
Learning to Dub Movies via Hierarchical Prosody Models. CoRR abs/2212.04054 (2022) - 2021
- [j18]Yasi Wang, Yuankai Qi
, Hongxun Yao, Dong Gong
, Qi Wu:
Image editing with varying intensities of processing. Comput. Vis. Image Underst. 211: 103260 (2021) - [j17]Weixia Zhang
, Chao Ma
, Qi Wu
, Xiaokang Yang
:
Language-Guided Navigation via Cross-Modal Grounding and Alternate Adversarial Learning. IEEE Trans. Circuits Syst. Video Technol. 31(9): 3469-3481 (2021) - [j16]Jing Yu
, Xiaoze Jiang
, Zengchang Qin, Weifeng Zhang
, Yue Hu, Qi Wu
:
Learning Dual Encoding Model for Adaptive Visual Understanding in Visual Dialogue. IEEE Trans. Image Process. 30: 220-233 (2021) - [j15]Yanyuan Qiao, Chaorui Deng
, Qi Wu
:
Referring Expression Comprehension: A Survey of Methods and Datasets. IEEE Trans. Multim. 23: 4426-4440 (2021) - [c65]Zhaokai Wang, Renda Bao, Qi Wu, Si Liu:
Confidence-aware Non-repetitive Multimodal Transformers for TextCaps. AAAI 2021: 2835-2843 - [c64]Qi Zhu, Chenyu Gao, Peng Wang, Qi Wu:
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps. AAAI 2021: 3608-3615 - [c63]Li Liu, Mengge He, Guanghui Xu, Mingkui Tan, Qi Wu:
How to Train Your Agent to Read and Write. AAAI 2021: 13397-13405 - [c62]Chaorui Deng, Shizhe Chen, Da Chen
, Yuan He, Qi Wu:
Sketch, Ground, and Refine: Top-Down Dense Video Captioning. CVPR 2021: 234-243 - [c61]Yicong Hong, Qi Wu, Yuankai Qi
, Cristian Rodriguez Opazo, Stephen Gould:
VLN BERT: A Recurrent Vision-and-Language BERT for Navigation. CVPR 2021: 1643-1653 - [c60]Yazhou Yao, Tao Chen, Guo-Sen Xie, Chuanyi Zhang, Fumin Shen, Qi Wu, Zhenmin Tang, Jian Zhang
:
Non-Salient Region Object Mining for Weakly Supervised Semantic Segmentation. CVPR 2021: 2623-2632 - [c59]Chen Gao, Jinyu Chen, Si Liu, Luting Wang
, Qiong Zhang, Qi Wu:
Room-and-Object Aware Knowledge Reasoning for Remote Embodied Referring Expression. CVPR 2021: 3064-3073 - [c58]Yazhou Yao, Zeren Sun, Chuanyi Zhang, Fumin Shen, Qi Wu, Jian Zhang
, Zhenmin Tang:
Jo-SRC: A Contrastive Approach for Combating Noisy Labels. CVPR 2021: 5192-5201 - [c57]Guanghui Xu, Shuaicheng Niu, Mingkui Tan, Yucheng Luo, Qing Du, Qi Wu:
Towards Accurate Text-Based Image Captioning With Content Diversity Exploration. CVPR 2021: 12637-12646 - [c56]Yuankai Qi
, Zizheng Pan, Yicong Hong, Ming-Hsuan Yang, Anton van den Hengel, Qi Wu:
The Road to Know-Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation. ICCV 2021: 1635-1644 - [c55]Chenyu Gao, Qi Zhu, Peng Wang, Qi Wu:
Chop Chop BERT: Visual Question Answering by Chopping VisualBERT's Heads. IJCAI 2021: 664-670 - [c54]Wei Suo, Mengyang Sun, Peng Wang, Qi Wu:
Proposal-free One-stage Referring Expression via Grid-Word Cross-Attention. IJCAI 2021: 1032-1038 - [c53]Jing Yu, Yuan Chai, Yujing Wang, Yue Hu, Qi Wu:
CogTree: Cognition Tree Loss for Unbiased Scene Graph Generation. IJCAI 2021: 1274-1280 - [c52]Yanyuan Qiao, Qi Chen, Chaorui Deng, Ning Ding, Yuankai Qi
, Mingkui Tan, Xincheng Ren, Qi Wu:
R-GAN: Exploring Human-like Way for Reasonable Text-to-Image Synthesis via Generative Adversarial Networks. ACM Multimedia 2021: 2085-2093 - [c51]Dong An
, Yuankai Qi
, Yan Huang, Qi Wu, Liang Wang, Tieniu Tan:
Neighbor-view Enhanced Model for Vision and Language Navigation. ACM Multimedia 2021: 5101-5109 - [c50]Keji He, Yan Huang, Qi Wu, Jianhua Yang, Dong An, Shuanglin Sima, Liang Wang:
Landmark-RxR: Solving Vision-and-Language Navigation with Fine-Grained Alignment Supervision. NeurIPS 2021: 652-663 - [c49]Zhiquan Wen, Guanghui Xu, Mingkui Tan, Qingyao Wu, Qi Wu:
Debiased Visual Question Answering from Feature and Sample Perspectives. NeurIPS 2021: 3784-3796 - [c48]Mohammad Mahdi Kazemi Moghaddam, Qi Wu, Ehsan Abbasnejad, Javen Shi
:
Optimistic Agent: Accurate Graph-Based Value Estimation for More Successful Visual Navigation. WACV 2021: 3732-3741 - [i64]Sourav Garg, Niko Sünderhauf, Feras Dayoub, Douglas Morrison, Akansel Cosgun, Gustavo Carneiro, Qi Wu, Tat-Jun Chin, Ian D. Reid, Stephen Gould, Peter Corke, Michael Milford:
Semantics for Robotic Mapping, Perception and Interaction: A Survey. CoRR abs/2101.00443 (2021) - [i63]Li Liu, Mengge He, Guanghui Xu, Mingkui Tan, Qi Wu:
How to Train Your Agent to Read and Write. CoRR abs/2101.00916 (2021) - [i62]Hu Wang, Hao Chen, Qi Wu, Congbo Ma, Yidong Li, Chunhua Shen:
Multi-intersection Traffic Optimisation: A Benchmark Dataset and a Strong Baseline. CoRR abs/2101.09640 (2021) - [i61]Mohammad Mahdi Kazemi Moghaddam, Ehsan Abbasnejad, Qi Wu, Javen Shi, Anton van den Hengel:
Learning for Visual Navigation by Imagining the Success. CoRR abs/2103.00446 (2021) - [i60]Yazhou Yao, Zeren Sun, Chuanyi Zhang, Fumin Shen, Qi Wu, Jian Zhang, Zhenmin Tang:
Jo-SRC: A Contrastive Approach for Combating Noisy Labels. CoRR abs/2103.13029 (2021) - [i59]Yazhou Yao, Tao Chen, Guo-Sen Xie, Chuanyi Zhang, Fumin Shen, Qi Wu, Zhenmin Tang, Jian Zhang:
Non-Salient Region Object Mining for Weakly Supervised Semantic Segmentation. CoRR abs/2103.14581 (2021) - [i58]Wanrong Zhu, Yuankai Qi, Pradyumna Narayana, Kazoo Sone, Sugato Basu, Xin Eric Wang, Qi Wu, Miguel P. Eckstein, William Yang Wang:
Diagnosing Vision-and-Language Navigation: What Really Matters. CoRR abs/2103.16561 (2021) - [i57]Yuankai Qi, Zizheng Pan, Yicong Hong, Ming-Hsuan Yang, Anton van den Hengel, Qi Wu:
Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation. CoRR abs/2104.04167 (2021) - [i56]Chenyu Gao
, Qi Zhu, Peng Wang, Qi Wu:
Chop Chop BERT: Visual Question Answering by Chopping VisualBERT's Heads. CoRR abs/2104.14741 (2021) - [i55]Wei Suo, Mengyang Sun, Peng Wang, Qi Wu:
Proposal-free One-stage Referring Expression via Grid-Word Cross-Attention. CoRR abs/2105.02061 (2021) - [i54]Guanghui Xu, Shuaicheng Niu, Mingkui Tan, Yucheng Luo, Qing Du, Qi Wu:
Towards Accurate Text-based Image Captioning with Content Diversity Exploration. CoRR abs/2105.03236 (2021) - [i53]Dong An, Yuankai Qi, Yan Huang, Qi Wu, Liang Wang, Tieniu Tan:
Neighbor-view Enhanced Model for Vision and Language Navigation. CoRR abs/2107.07201 (2021) - [i52]Olivia Byrnes, Wendy La, Hu Wang, Congbo Ma, Minhui Xue, Qi Wu:
Data Hiding with Deep Learning: A Survey Unifying Digital Watermarking and Steganography. CoRR abs/2107.09287 (2021) - [i51]Feng Chen, Fei Wu, Qi Wu, Zhiguo Wan:
Memory Regulation and Alignment toward Generalizer RGB-Infrared Person. CoRR abs/2109.08843 (2021) - [i50]Zhihong Lin, Donghao Zhang, Qingyi Tao, Danli Shi, Gholamreza Haffari, Qi Wu, Mingguang He, Zongyuan Ge:
Medical Visual Question Answering: A Survey. CoRR abs/2111.10056 (2021) - [i49]Qi Chen, Yuanqing Li, Yuankai Qi, Jiaqiu Zhou, Mingkui Tan, Qi Wu:
V2C: Visual Voice Cloning. CoRR abs/2111.12890 (2021) - [i48]Yutong Xie, Jianpeng Zhang, Yong Xia, Qi Wu:
Unified 2D and 3D Pre-training for Medical Image classification and Segmentation. CoRR abs/2112.09356 (2021) - [i47]Cristian Rodriguez Opazo, Edison Marrese-Taylor, Basura Fernando, Hiroya Takamura, Qi Wu:
LocFormer: Enabling Transformers to Perform Temporal Moment Localization on Long Untrimmed Videos With a Feature Sampling Approach. CoRR abs/2112.10066 (2021) - 2020
- [j14]Sourav Garg
, Niko Sünderhauf
, Feras Dayoub
, Douglas Morrison, Akansel Cosgun, Gustavo Carneiro, Qi Wu, Tat-Jun Chin, Ian D. Reid, Stephen Gould, Peter Corke
, Michael Milford
:
Semantics for Robotic Mapping, Perception and Interaction: A Survey. Found. Trends Robotics 8(1-2): 1-224 (2020) - [j13]Yan Huang
, Qi Wu
, Wei Wang, Liang Wang:
Image and Sentence Matching via Semantic Concepts and Order Learning. IEEE Trans. Pattern Anal. Mach. Intell. 42(3): 636-650 (2020) - [j12]Qi Chen
, Qi Wu
, Jian Chen
, Qingyao Wu
, Anton van den Hengel
, Mingkui Tan
:
Scripted Video Generation With a Bottom-Up Generative Adversarial Network. IEEE Trans. Image Process. 29: 7454-7467 (2020) - [j11]Jing Yu
, Weifeng Zhang
, Yuhang Lu, Zengchang Qin, Yue Hu, Jianlong Tan, Qi Wu
:
Reasoning on the Relation: Enhancing Visual Representation for Visual Question Answering and Cross-Modal Retrieval. IEEE Trans. Multim. 22(12): 3196-3209 (2020) - [c47]Xiaoze Jiang, Jing Yu, Zengchang Qin, Yingying Zhuang, Xingxing Zhang, Yue Hu, Qi Wu:
DualVD: An Adaptive Dual Encoding Model for Deep Visual Understanding in Visual Dialogue. AAAI 2020: 11125-11132 - [c46]Chenchen Jing, Yuwei Wu, Xiaoxun Zhang, Yunde Jia, Qi Wu:
Overcoming Language Priors in VQA via Decomposed Linguistic Representations. AAAI 2020: 11181-11188 - [c45]Yihan Zheng, Zhiquan Wen, Mingkui Tan, Runhao Zeng, Qi Chen, Yaowei Wang, Qi Wu:
Modular Graph Attention Network for Complex Visual Relational Reasoning. ACCV (6) 2020: 137-153 - [c44]Zhibin Liao, Qi Wu, Chunhua Shen, Anton van den Hengel, Johan Verjans:
AIML at VQA-Med 2020: Knowledge Inference via a Skeleton-based Sentence Mapping Approach for Medical Domain Visual Question Answering. CLEF (Working Notes) 2020 - [c43]Shizhe Chen, Qin Jin, Peng Wang, Qi Wu:
Say As You Wish: Fine-Grained Control of Image Caption Generation With Abstract Scene Graphs. CVPR 2020: 9959-9968 - [c42]Yuankai Qi
, Qi Wu, Peter Anderson, Xin Wang, William Yang Wang, Chunhua Shen, Anton van den Hengel
:
REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments. CVPR 2020: 9979-9988 - [c41]Zhenfang Chen, Peng Wang
, Lin Ma, Kwan-Yee K. Wong
, Qi Wu:
Cops-Ref: A New Dataset and Task on Compositional Referring Expression Comprehension. CVPR 2020: 10083-10092 - [c40]Shizhe Chen, Yida Zhao, Qin Jin, Qi Wu:
Fine-Grained Video-Text Retrieval With Hierarchical Graph Reasoning. CVPR 2020: 10635-10644 - [c39]Qi Chen, Qi Wu, Rui Tang, Yuhan Wang, Shuai Wang, Mingkui Tan:
Intelligent Home 3D: Automatic 3D-House Design From Linguistic Descriptions Only. CVPR 2020: 12622-12631 - [c38]Ehsan Abbasnejad, Iman Abbasnejad, Qi Wu, Javen Shi
, Anton van den Hengel
:
Gold Seeker: Information Gain From Policy Distributions for Goal-Oriented Vision-and-Langauge Reasoning. CVPR 2020: 13447-13456 - [c37]Hu Wang, Qi Wu, Chunhua Shen:
Soft Expert Reward Learning for Vision-and-Language Navigation. ECCV (9) 2020: 126-141 - [c36]Yuankai Qi
, Zizheng Pan, Shengping Zhang, Anton van den Hengel
, Qi Wu:
Object-and-Action Aware Model for Visual Language Navigation. ECCV (10) 2020: 303-317 - [c35]Ruixue Tang, Chao Ma, Wei Emma Zhang
, Qi Wu, Xiaokang Yang:
Semantic Equivalent Adversarial Data Augmentation for Visual Question Answering. ECCV (19) 2020: 437-453 - [c34]Chaorui Deng, Ning Ding, Mingkui Tan, Qi Wu:
Length-Controllable Image Captioning. ECCV (13) 2020: 712-729 - [c33]Yicong Hong, Cristian Rodriguez Opazo, Qi Wu, Stephen Gould:
Sub-Instruction Aware Vision-and-Language Navigation. EMNLP (1) 2020: 3360-3376 - [c32]Xiaoze Jiang, Jing Yu, Yajing Sun, Zengchang Qin, Zihao Zhu, Yue Hu, Qi Wu:
DAM: Deliberation, Abandon and Memory Networks for Generating Detailed and Non-repetitive Responses in Visual Dialogue. IJCAI 2020: 687-693 - [c31]Zihao Zhu, Jing Yu, Yujing Wang, Yajing Sun, Yue Hu, Qi Wu:
Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering. IJCAI 2020: 1097-1103 - [c30]Zhibin Liao, Lingqiao Liu
, Qi Wu, Damien Teney, Chunhua Shen, Anton van den Hengel
, Johan Verjans
:
Medical Data Inquiry Using a Question Answering Model. ISBI 2020: 1490-1493 - [c29]Peng Wang, Dongyang Liu, Hui Li, Qi Wu:
Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge. ACM Multimedia 2020: 28-36 - [c28]Chuanyi Zhang, Yazhou Yao, Xiangbo Shu, Zechao Li, Zhenmin Tang, Qi Wu:
Data-driven Meta-set Based Fine-Grained Visual Recognition. ACM Multimedia 2020: 2372-2381 - [c27]Chenchen Jing, Yuwei Wu, Mingtao Pei, Yao Hu, Yunde Jia, Qi Wu:
Visual-Semantic Graph Matching for Visual Grounding. ACM Multimedia 2020: 4041-4050 - [c26]Fen Liu, Guanghui Xu, Qi Wu, Qing Du, Wei Jia, Mingkui Tan:
Cascade Reasoning Network for Text-based Visual Question Answering. ACM Multimedia 2020: 4060-4069 - [c25]Yicong Hong, Cristian Rodriguez Opazo, Yuankai Qi, Qi Wu, Stephen Gould:
Language and Visual Entity Relationship Graph for Agent Navigation. NeurIPS 2020 - [i46]Shizhe Chen, Qin Jin, Peng Wang, Qi Wu:
Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs. CoRR abs/2003.00387 (2020) - [i45]Shizhe Chen, Yida Zhao, Qin Jin, Qi Wu:
Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning. CoRR abs/2003.00392 (2020) - [i44]Qi Chen, Qi Wu, Rui Tang, Yuhan Wang, Shuai Wang, Mingkui Tan:
Intelligent Home 3D: Automatic 3D-House Design from Linguistic Descriptions Only. CoRR abs/2003.00397 (2020) - [i43]Zhenfang Chen, Peng Wang, Lin Ma, Kwan-Yee K. Wong, Qi Wu:
Cops-Ref: A new Dataset and Task on Compositional Referring Expression Comprehension. CoRR abs/2003.00403 (2020) - [i42]Yicong Hong, Cristian Rodriguez Opazo, Qi Wu, Stephen Gould:
Sub-Instruction Aware Vision-and-Language Navigation. CoRR abs/2004.02707 (2020) - [i41]Mohammad Mahdi Kazemi Moghaddam, Qi Wu, Ehsan Abbasnejad, Javen Shi:
Utilising Prior Knowledge for Visual Navigation: Distil and Adapt. CoRR abs/2004.03222 (2020) - [i40]Chenyu Gao
, Qi Zhu, Peng Wang, Hui Li, Yuliang Liu, Anton van den Hengel, Qi Wu:
Structured Multimodal Attentions for TextVQA. CoRR abs/2006.00753 (2020) - [i39]Peng Wang, Dongyang Liu, Hui Li, Qi Wu:
Give Me Something to Eat: Referring Expression Comprehension with Commonsense Knowledge. CoRR abs/2006.01629 (2020) - [i38]Zihao Zhu, Jing Yu, Yujing Wang, Yajing Sun, Yue Hu, Qi Wu:
Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering. CoRR abs/2006.09073 (2020) - [i37]Xiaoze Jiang, Jing Yu, Yajing Sun, Zengchang Qin, Zihao Zhu, Yue Hu, Qi Wu:
DAM: Deliberation, Abandon and Memory Networks for Generating Detailed and Non-repetitive Responses in Visual Dialogue. CoRR abs/2007.03310 (2020) - [i36]Yanyuan Qiao, Chaorui Deng, Qi Wu:
Referring Expression Comprehension: A Survey of Methods and Datasets. CoRR abs/2007.09554 (2020) - [i35]Chaorui Deng, Ning Ding, Mingkui Tan, Qi Wu:
Length-Controllable Image Captioning. CoRR abs/2007.09580 (2020) - [i34]Ruixue Tang, Chao Ma, Wei Emma Zhang, Qi Wu, Xiaokang Yang:
Semantic Equivalent Adversarial Data Augmentation for Visual Question Answering. CoRR abs/2007.09592 (2020) - [i33]Hu Wang, Qi Wu, Chunhua Shen:
Soft Expert Reward Learning for Vision-and-Language Navigation. CoRR abs/2007.10835 (2020) - [i32]Yuankai Qi, Zizheng Pan, Shengping Zhang, Anton van den Hengel, Qi Wu:
Object-and-Action Aware Model for Visual Language Navigation. CoRR abs/2007.14626 (2020) - [i31]Chuanyi Zhang, Yazhou Yao, Xiangbo Shu, Zechao Li, Zhenmin Tang, Qi Wu:
Data-driven Meta-set Based Fine-Grained Visual Classification. CoRR abs/2008.02438 (2020) - [i30]Jing Yu, Yuan Chai, Yue Hu, Qi Wu:
CogTree: Cognition Tree Loss for Unbiased Scene Graph Generation. CoRR abs/2009.07526 (2020) - [i29]Yicong Hong, Cristian Rodriguez Opazo, Yuankai Qi, Qi Wu, Stephen Gould:
Language and Visual Entity Relationship Graph for Agent Navigation. CoRR abs/2010.09304 (2020) - [i28]Weixia Zhang, Chao Ma, Qi Wu, Xiaokang Yang:
Language-guided Navigation via Cross-Modal Grounding and Alternate Adversarial Learning. CoRR abs/2011.10972 (2020) - [i27]Yicong Hong, Qi Wu, Yuankai Qi, Cristian Rodriguez Opazo, Stephen Gould:
A Recurrent Vision-and-Language BERT for Navigation. CoRR abs/2011.13922 (2020) - [i26]Zhaokai Wang, Renda Bao, Qi Wu, Si Liu:
Confidence-aware Non-repetitive Multimodal Transformers for TextCaps. CoRR abs/2012.03662 (2020) - [i25]Qi Zhu, Chenyu Gao
, Peng Wang, Qi Wu:
Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps. CoRR abs/2012.05153 (2020)
2010 – 2019
- 2019
- [j10]Wenhua Liu
, Yidong Li
, Qi Wu:
An Attribute-Based High-Level Image Representation for Scene Classification. IEEE Access 7: 4629-4640 (2019) - [j9]Jianpeng Zhang, Yutong Xie
, Qi Wu
, Yong Xia
:
Medical image classification using synergic deep learning. Medical Image Anal. 54: 10-19 (2019) - [j8]Junjie Zhang
, Qi Wu
, Jian Zhang
, Chunhua Shen, Jianfeng Lu
, Qiang Wu
:
Heritage image annotation via collective knowledge. Pattern Recognit. 93: 204-214 (2019) - [j7]Fan Lyu
, Qi Wu
, Fuyuan Hu
, Qingyao Wu
, Mingkui Tan
:
Attend and Imagine: Multi-Label Image Classification With Visual Attention and Recurrent Neural Networks. IEEE Trans. Multim. 21(8): 1971-1981 (2019) - [c24]Peng Wang
, Qi Wu, Jiewei Cao, Chunhua Shen, Lianli Gao, Anton van den Hengel:
Neighbourhood Watch: Referring Expression Comprehension via Language-Guided Graph Attention Networks. CVPR 2019: 1960-1968 - [c23]Junjie Zhang, Qi Wu, Jian Zhang
, Chunhua Shen, Jianfeng Lu:
Mind Your Neighbours: Image Annotation With Metadata Neighbourhood Graph Co-Attention Networks. CVPR 2019: 2956-2964 - [c22]Ehsan Abbasnejad, Qi Wu, Qinfeng Shi
, Anton van den Hengel
:
What's to Know? Uncertainty as a Guide to Asking Goal-Oriented Questions. CVPR 2019: 4155-4164 - [c21]Xuguang Duan, Qi Wu, Chuang Gan, Yiwei Zhang
, Wenbing Huang, Anton van den Hengel
, Wenwu Zhu:
Watch, Reason and Code: Learning to Represent Videos Using Program. ACM Multimedia 2019: 1543-1551 - [i24]Yuankai Qi, Qi Wu, Peter Anderson, Marco Liu, Chunhua Shen, Anton van den Hengel:
RERERE: Remote Embodied Referring Expressions in Real indoor Environments. CoRR abs/1904.10151 (2019) - [i23]Amin Parvaneh, Ehsan Abbasnejad, Qi Wu, Javen Shi:
Show, Price and Negotiate: A Hierarchical Attention Recurrent Visual Negotiator. CoRR abs/1905.03721 (2019) - [i22]Shizhe Chen, Yida Zhao, Yuqing Song, Qin Jin, Qi Wu:
Integrating Temporal and Spatial Attentions for VATEX Video Captioning Challenge 2019. CoRR abs/1910.06737 (2019) - [i21]Xiaoze Jiang, Jing Yu, Zengchang Qin, Yingying Zhuang, Xingxing Zhang, Yue Hu, Qi Wu:
DualVD: An Adaptive Dual Encoding Model for Deep Visual Understanding in Visual Dialogue. CoRR abs/1911.07251 (2019) - 2018
- [j6]Qi Wu
, Chunhua Shen
, Peng Wang, Anthony R. Dick
, Anton van den Hengel
:
Image Captioning and Visual Question Answering Based on Attributes and External Knowledge. IEEE Trans. Pattern Anal. Mach. Intell. 40(6): 1367-1381 (2018) - [j5]Peng Wang
, Qi Wu
, Chunhua Shen
, Anthony R. Dick
, Anton van den Hengel
:
FVQA: Fact-Based Visual Question Answering. IEEE Trans. Pattern Anal. Mach. Intell. 40(10): 2413-2427 (2018) - [j4]Junjie Zhang
, Qi Wu
, Chunhua Shen
, Jian Zhang
, Jianfeng Lu
:
Multilabel Image Classification With Regional Latent Semantic Dependencies. IEEE Trans. Multim. 20(10): 2801-2813 (2018) - [c20]Junjie Zhang, Qi Wu, Jian Zhang, Chunhua Shen, Jianfeng Lu:
Kill Two Birds With One Stone: Weakly-Supervised Neural Network for Image Annotation and Tag Refinement. AAAI 2018: 7550-7557 - [c19]Bohan Zhuang, Qi Wu, Chunhua Shen, Ian D. Reid, Anton van den Hengel:
HCVRD: A Benchmark for Large-Scale Human-Centered Visual Relationship Detection. AAAI 2018: 7631-7638 - [c18]Peter Anderson, Abhishek Das, Qi Wu:
Connecting Language and Vision to Actions. ACL (5) 2018: 10-14 - [c17]Peter Anderson
, Qi Wu, Damien Teney, Jake Bruce, Mark Johnson
, Niko Sünderhauf
, Ian D. Reid
, Stephen Gould, Anton van den Hengel
:
Vision-and-Language Navigation: Interpreting Visually-Grounded Navigation Instructions in Real Environments. CVPR 2018: 3674-3683 - [c16]Bohan Zhuang, Qi Wu, Chunhua Shen, Ian D. Reid
, Anton van den Hengel:
Parallel Attention: A Unified Framework for Visual Object Discovery Through Dialogs and Queries. CVPR 2018: 4252-4261 - [c15]Qi Wu, Peng Wang, Chunhua Shen, Ian D. Reid
, Anton van den Hengel:
Are You Talking to Me? Reasoned Visual Dialog Generation Through Adversarial Learning. CVPR 2018: 6106-6115 - [c14]Yan Huang, Qi Wu, Chunfeng Song, Liang Wang:
Learning Semantic Concepts and Order for Image and Sentence Matching. CVPR 2018: 6163-6171 - [c13]Chao Ma, Chunhua Shen, Anthony R. Dick
, Qi Wu, Peng Wang
, Anton van den Hengel, Ian D. Reid
:
Visual Question Answering With Memory-Augmented Networks. CVPR 2018: 6975-6984 - [c12]Chaorui Deng, Qi Wu, Qingyao Wu, Fuyuan Hu, Fan Lyu, Mingkui Tan:
Visual Grounding via Accumulated Attention. CVPR 2018: 7746-7755 - [c11]Junjie Zhang
, Qi Wu
, Chunhua Shen
, Jian Zhang
, Jianfeng Lu
, Anton van den Hengel
:
Goal-Oriented Visual Question Generation via Intermediate Rewards. ECCV (5) 2018: 189-204 - [c10]Jianpeng Zhang, Yutong Xie
, Qi Wu, Yong Xia:
Skin Lesion Classification in Dermoscopy Images Using Synergic Deep Learning. MICCAI (2) 2018: 12-20 - [i20]Peng Wang, Qi Wu, Jiewei Cao, Chunhua Shen, Lianli Gao, Anton van den Hengel:
Neighbourhood Watch: Referring Expression Comprehension via Language-guided Graph Attention Networks. CoRR abs/1812.04794 (2018) - [i19]Ehsan Abbasnejad, Qi Wu, Iman Abbasnejad, Javen Shi, Anton van den Hengel:
An Active Information Seeking Model for Goal-oriented Vision-and-Language Tasks. CoRR abs/1812.06398 (2018) - [i18]Ehsan Abbasnejad, Qi Wu, Javen Shi, Anton van den Hengel:
What's to know? Uncertainty as a Guide to Asking Goal-oriented Questions. CoRR abs/1812.06401 (2018) - 2017
- [j3]Qi Wu, Damien Teney, Peng Wang, Chunhua Shen, Anthony R. Dick
, Anton van den Hengel
:
Visual question answering: A survey of methods and datasets. Comput. Vis. Image Underst. 163: 21-40 (2017) - [j2]Damien Teney
, Qi Wu, Anton van den Hengel
:
Visual Question Answering: A Tutorial. IEEE Signal Process. Mag. 34(6): 63-75 (2017) - [c9]Junjie Zhang, Jian Zhang
, Qi Wu, Qiang Wu
, Jinsong Xu
, Jianfeng Lu, Robin Phua, Kate Curr, Zhenmin Tang
:
Historical Image Annotation by Exploring the Tag Relevance. ACPR 2017: 640-645 - [c8]Peng Wang, Qi Wu, Chunhua Shen, Anton van den Hengel
:
The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions. CVPR 2017: 3909-3918 - [c7]Peng Wang, Qi Wu, Chunhua Shen, Anthony R. Dick, Anton van den Hengel
:
Explicit Knowledge-based Reasoning for Visual Question Answering. IJCAI 2017: 1290-1296 - [i17]Bohan Zhuang, Qi Wu, Chunhua Shen, Ian D. Reid, Anton van den Hengel:
Care about you: towards large-scale human-centric visual relationship detection. CoRR abs/1705.09892 (2017) - [i16]Jianpeng Zhang, Yong Xia, Qi Wu, Yutong Xie:
Classification of Medical Images and Illustrations in the Biomedical Literature Using Synergic Deep Learning. CoRR abs/1706.09092 (2017) - [i15]Bohan Zhuang, Qi Wu, Chunhua Shen, Ian D. Reid, Anton van den Hengel:
Parallel Attention: A Unified Framework for Visual Object Discovery through Dialogs and Queries. CoRR abs/1711.06370 (2017) - [i14]Junjie Zhang, Qi Wu, Jian Zhang, Chunhua Shen, Jianfeng Lu:
Kill Two Birds with One Stone: Weakly-Supervised Neural Network for Image Annotation and Tag Refinement. CoRR abs/1711.06998 (2017) - [i13]Peter Anderson, Qi Wu, Damien Teney, Jake Bruce, Mark Johnson, Niko Sünderhauf, Ian D. Reid, Stephen Gould, Anton van den Hengel:
Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments. CoRR abs/1711.07280 (2017) - [i12]Qi Wu, Peng Wang, Chunhua Shen, Ian D. Reid, Anton van den Hengel:
Are You Talking to Me? Reasoned Visual Dialog Generation through Adversarial Learning. CoRR abs/1711.07613 (2017) - [i11]Junjie Zhang, Qi Wu, Chunhua Shen, Jian Zhang, Jianfeng Lu, Anton van den Hengel:
Asking the Difficult Questions: Goal-Oriented Visual Question Generation via Intermediate Rewards. CoRR abs/1711.07614 (2017) - [i10]Yan Huang, Qi Wu, Liang Wang:
Learning Semantic Concepts and Order for Image and Sentence Matching. CoRR abs/1712.02036 (2017) - 2016
- [c6]Qi Wu, Chunhua Shen, Lingqiao Liu
, Anthony R. Dick
, Anton van den Hengel
:
What Value Do Explicit High Level Concepts Have in Vision to Language Problems? CVPR 2016: 203-212 - [c5]Qi Wu, Peng Wang, Chunhua Shen, Anthony R. Dick
, Anton van den Hengel
:
Ask Me Anything: Free-Form Visual Question Answering Based on Knowledge from External Sources. CVPR 2016: 4622-4630 - [i9]Qi Wu, Chunhua Shen, Anton van den Hengel, Peng Wang, Anthony R. Dick:
Image Captioning and Visual Question Answering Based on Attributes and Their Related External Knowledge. CoRR abs/1603.02814 (2016) - [i8]Peng Wang, Qi Wu, Chunhua Shen, Anton van den Hengel, Anthony R. Dick:
FVQA: Fact-based Visual Question Answering. CoRR abs/1606.05433 (2016) - [i7]Qi Wu, Damien Teney, Peng Wang, Chunhua Shen, Anthony R. Dick, Anton van den Hengel:
Visual Question Answering: A Survey of Methods and Datasets. CoRR abs/1607.05910 (2016) - [i6]Junjie Zhang, Qi Wu, Chunhua Shen, Jian Zhang, Jianfeng Lu:
Multi-Label Image Classification with Regional Latent Semantic Dependencies. CoRR abs/1612.01082 (2016) - [i5]Peng Wang, Qi Wu, Chunhua Shen, Anton van den Hengel:
The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions. CoRR abs/1612.05386 (2016) - 2015
- [b1]Qi Wu:
Modelling visual objects regardless of depictive style. University of Bath, UK, 2015 - [j1]Peter Hall
, Hongping Cai, Qi Wu, Tadeo Corradi:
Cross-depiction problem: Recognition and synthesis of photographs and artwork. Comput. Vis. Media 1(2): 91-103 (2015) - [c4]Hongping Cai, Qi Wu, Peter Hall
:
Beyond Photo-Domain Object Recognition: Benchmarks for the Cross-Depiction Problem. ICCV Workshops 2015: 74-79 - [i4]Hongping Cai, Qi Wu, Tadeo Corradi, Peter Hall:
The Cross-Depiction Problem: Computer Vision Algorithms for Recognising Objects in Artwork and in Photographs. CoRR abs/1505.00110 (2015) - [i3]Qi Wu, Chunhua Shen, Anton van den Hengel, Lingqiao Liu, Anthony R. Dick:
Image Captioning with an Intermediate Attributes Layer. CoRR abs/1506.01144 (2015) - [i2]Peng Wang, Qi Wu, Chunhua Shen, Anton van den Hengel, Anthony R. Dick:
Explicit Knowledge-based Reasoning for Visual Question Answering. CoRR abs/1511.02570 (2015) - [i1]Qi Wu, Peng Wang, Chunhua Shen, Anton van den Hengel, Anthony R. Dick:
Ask Me Anything: Free-form Visual Question Answering Based on Knowledge from External Sources. CoRR abs/1511.06973 (2015) - 2014
- [c3]Qi Wu, Hongping Cai, Peter Hall
:
Learning Graphs to Model Visual Objects across Different Depictive Styles. ECCV (7) 2014: 313-328 - 2013
- [c2]Qi Wu, Peter Hall
:
Modelling Visual Objects Invariant to Depictive Style. BMVC 2013 - 2012
- [c1]Qi Wu, Peter Hall
:
Prime Shapes in Natural Images. BMVC 2012: 1-12
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-04-29 22:21 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint