"XGPT: Cross-modal Generative Pre-Training for Image Captioning."

Qiaolin Xia et al. (2020)
a service of  Schloss Dagstuhl - Leibniz Center for Informatics