default search action
IEEE Transactions on Multimedia, Volume 25
Volume 25, 2023
- Zan-Xia Jin, Heran Wu, Chun Yang, Fang Zhou, Jingyan Qin, Lei Xiao, Xu-Cheng Yin:
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering. 1-12 - Yu Wang, Shiwei Chen:
Multi-Agent Trajectory Prediction With Spatio-Temporal Sequence Fusion. 13-23 - Jiayi Xie, Yaochen Zhu, Zhenzhong Chen:
Micro-Video Popularity Prediction Via Multimodal Variational Information Bottleneck. 24-37 - Zhicheng Guo, Jiaxuan Zhao, Licheng Jiao, Xu Liu, Fang Liu:
A Universal Quaternion Hypergraph Network for Multimodal Video Question Answering. 38-49 - Xiao Lin, Shuzhou Sun, Wei Huang, Bin Sheng, Ping Li, David Dagan Feng:
EAPT: Efficient Attention Pyramid Transformer for Image Processing. 50-61 - Zhi Li, Haoliang Li, Xin Luo, Yongjian Hu, Kwok-Yan Lam, Alex C. Kot:
Asymmetric Modality Translation for Face Presentation Attack Detection. 62-76 - Wei Lu, Desheng Li, Liqiang Nie, Peiguang Jing, Yuting Su:
Learning Dual Low-Rank Representation for Multi-Label Micro-Video Classification. 77-89 - Yun Wang, Tong Zhang, Chuanwei Zhou, Zhen Cui, Jian Yang:
Instance-Aware Deep Graph Learning for Multi-Label Classification. 90-99 - Jae Young Choi, Bumshik Lee:
Combining Deep Convolutional Neural Networks With Stochastic Ensemble Weight Optimization for Facial Expression Recognition in the Wild. 100-111 - Zerui Shao, Yifei Pu, Jiliu Zhou, Bihan Wen, Yi Zhang:
Hyper RPCA: Joint Maximum Correntropy Criterion and Laplacian Scale Mixture Modeling on-the-Fly for Moving Object Detection. 112-125 - Yajing Liu, Zhiwei Xiong, Ya Li, Xinmei Tian, Zheng-Jun Zha:
Domain Generalization Via Encoding and Resampling in a Unified Latent Space. 126-139 - Hangwei Chen, Xiongli Chai, Feng Shao, Xuejin Wang, Qiuping Jiang, Xiangchao Meng, Yo-Sung Ho:
Perceptual Quality Assessment of Cartoon Images. 140-153 - Yang Li, Shengbin Meng, Xinfeng Zhang, Meng Wang, Shiqi Wang, Yue Wang, Siwei Ma:
User-Generated Video Quality Assessment: A Subjective and Objective Study. 154-166 - Yan Yang, Jun Yu, Jian Zhang, Weidong Han, Hanliang Jiang, Qingming Huang:
Joint Embedding of Deep Visual and Semantic Features for Medical Image Report Generation. 167-178 - Hancheng Zhu, Yong Zhou, Leida Li, Yaqian Li, Yandong Guo:
Learning Personalized Image Aesthetics From Subjective and Objective Attributes. 179-190 - Jun Cheng, Fusheng Hao, Fengxiang He, Liu Liu, Qieshi Zhang:
Mixer-Based Semantic Spread for Few-Shot Learning. 191-202 - Haojie Yuan, Qi Chu, Feng Zhu, Rui Zhao, Bin Liu, Nenghai Yu:
AutoMA: Towards Automatic Model Augmentation for Transferable Adversarial Attacks. 203-213 - Zefan Li, Bingbing Ni, Xiaokang Yang, Wenjun Zhang, Wen Gao:
Residual Quantization for Low Bit-Width Neural Networks. 214-227 - Zhaoliang Chen, Jie Yao, Guobao Xiao, Shiping Wang:
Efficient and Differentiable Low-Rank Matrix Completion With Back Propagation. 228-242 - Tong Xue, Abdallah El Ali, Tianyi Zhang, Gangyi Ding, Pablo César:
CEAP-360VR: A Continuous Physiological and Behavioral Emotion Annotation Dataset for 360$^\circ$ VR Videos. 243-255 - Gaosheng Liu, Huanjing Yue, Jiamin Wu, Jing-Yu Yang:
Intra-Inter View Interaction Network for Light Field Image Super-Resolution. 256-266 - Zhihao Wu, Jie Wen, Yong Xu, Jian Yang, David Zhang:
Multiple Instance Detection Networks With Adaptive Instance Refinement. 267-279 - Yanhua Yang, Xiaozhe Zhang, Muli Yang, Cheng Deng:
Adaptive Bias-Aware Feature Generation for Generalized Zero-Shot Learning. 280-290 - Tung-I Chen, Yueh-Cheng Liu, Hung-Ting Su, Yu-Cheng Chang, Yu-Hsiang Lin, Jia-Fong Yeh, Wen-Chin Chen, Winston H. Hsu:
Dual-Awareness Attention for Few-Shot Object Detection. 291-301 - Laizhong Cui, Erchao Ni, Yipeng Zhou, Zhi Wang, Lei Zhang, Jiangchuan Liu, Yuedong Xu:
Towards Real-Time Video Caching at Edge Servers: A Cost-Aware Deep Q-Learning Solution. 302-314 - Sutong Wang, Jiacheng Zhu, Yunqiang Yin, Dujuan Wang, T. C. Edwin Cheng, Yanzhang Wang:
Interpretable Multi-Modal Stacking-Based Ensemble Learning Method for Real Estate Appraisal. 315-328 - Zhihao Zhang, Xianqiang Yang, Chao Xu:
Natural Image Stitching With Layered Warping Constraint. 329-338 - Hao Tang, Guoshuai Zhao, Yuxia Wu, Xueming Qian:
Multisample-Based Contrastive Loss for Top-K Recommendation. 339-351 - Ke Zhang, Chun Yuan, Yiming Zhu, Yong Jiang, Lishu Luo:
Weakly Supervised Instance Segmentation by Exploring Entire Object Regions. 352-363 - Astha Verma, A. Venkata Subramanyam, Zheng Wang, Shin'ichi Satoh, Rajiv Ratn Shah:
Unsupervised Domain Adaptation for Person Re-Identification Via Individual-Preserving and Environmental-Switching Cyclic Generation. 364-377 - Carlos M. Lentisco, Luis Bellido, Andrés Cárdenas, Ricardo Flores Moyano, David Fernández:
Design of a 5G Multimedia Broadcast Application Function Supporting Adaptive Error Recovery. 378-388 - Huicong Wu, Liang Xiao, Le Sun, Byeungwoo Jeon:
A Novel Video Stabilization Model With Motion Morphological Component Priors. 389-404 - Xuehao Gao, Yang Yang, Yimeng Zhang, Maosen Li, Jin-Gang Yu, Shaoyi Du:
Efficient Spatio-Temporal Contrastive Learning for Skeleton-Based 3-D Action Recognition. 405-417 - Cheng Xue, Xionghu Zhong, Minjie Cai, Hao Chen, Wenwu Wang:
Audio-Visual Event Localization by Learning Spatial and Semantic Co-Attention. 418-429 - Guang Han, Jinpeng Su, Yaoming Liu, Yuqiu Zhao, Sam Kwong:
Multi-Stage Visual Tracking With Siamese Anchor-Free Proposal Network. 430-442 - Lei Yu, Bishan Wang, Jingwei He, Gui-Song Xia, Wen Yang:
Single Image Deraining With Continuous Rain Density Estimation. 443-456 - Jianjun Xiang, Gangyi Jiang, Mei Yu, Zhidi Jiang, Yo-Sung Ho:
No-Reference Light Field Image Quality Assessment Using Four-Dimensional Sparse Transform. 457-472 - Mehdi Rahmati, Zhuoran Qi, Dario Pompili:
Underwater Adaptive Video Transmissions Using MIMO-Based Software-Defined Acoustic Modems. 473-485 - Nan Jiang, Kuiran Wang, Xiaoke Peng, Xuehui Yu, Qiang Wang, Junliang Xing, Guorong Li, Guodong Guo, Qixiang Ye, Jianbin Jiao, Jian Zhao, Zhenjun Han:
Anti-UAV: A Large-Scale Benchmark for Vision-Based UAV Tracking. 486-500 - Yujie Huang, Ming-e Jing, Jinjia Zhou, Yuhao Liu, Yibo Fan:
LCCStyle: Arbitrary Style Transfer With Low Computational Complexity. 501-514 - Jing Yi, Yaochen Zhu, Jiayi Xie, Zhenzhong Chen:
Cross-Modal Variational Auto-Encoder for Content-Based Micro-Video Background Music Recommendation. 515-528 - Luntian Mou, Chao Zhou, Pengtao Xie, Pengfei Zhao, Ramesh C. Jain, Wen Gao, Baocai Yin:
Isotropic Self-Supervised Learning for Driver Drowsiness Detection With Attention-Based Multimodal Fusion. 529-542 - Wenhui Li, Yan Wang, Yuting Su, Xuanya Li, An-An Liu, Yongdong Zhang:
Multi-Scale Fine-Grained Alignments for Image and Sentence Matching. 543-556 - Yongqiang Kong, Yunhong Wang, Annan Li, Qiuyu Huang:
Self-Sufficient Feature Enhancing Networks for Video Salient Object Detection. 557-571 - Qinchuan Zhang, Yi Jiang, Qin Zhou, Yiru Zhao, Yao Liu, Hongtao Lu, Xian-Sheng Hua:
Single Person Dense Pose Estimation via Geometric Equivariance Consistency. 572-583 - Kailun Zhou, Liping Zhao, Zigao Ye, Huihui Wang, Tao Lin, Sheng Feng, Yufen Yang:
Equal Value String and Copy Above String Based String Prediction for SCC in AVS3. 584-592 - Maja Krivokuca, Ehsan Miandji, Christine Guillemot, Philip A. Chou:
Compression of Plenoptic Point Cloud Attributes Using 6-D Point Clouds and 6-D Transforms. 593-607 - Xiaoqing Luo, Yuanhao Gao, Anqi Wang, Zhancheng Zhang, Xiaojun Wu:
IFSepR: A General Framework for Image Fusion Based on Separate Representation Learning. 608-623 - Shihao Xu, Haocong Rao, Xiping Hu, Jun Cheng, Bin Hu:
Prototypical Contrast and Reverse Prediction: Unsupervised Skeleton Based Action Recognition. 624-634 - Huabing Zhou, Wei Wu, Yanduo Zhang, Jiayi Ma, Haibin Ling:
Semantic-Supervised Infrared and Visible Image Fusion Via a Dual-Discriminator Generative Adversarial Network. 635-648 - Ming Li, Bin Fu, Zhengfu Zhang, Yu Qiao:
Character-Aware Sampling and Rectification for Scene Text Recognition. 649-661 - Mingyue Su, Guanghua Gu, Xianlong Ren, Hao Fu, Yao Zhao:
Semi-Supervised Knowledge Distillation for Cross-Modal Hashing. 662-675 - Lei Zhu, Xiaoqiang Wang, Ping Li, Xin Yang, Qing Zhang, Weiming Wang, Carola-Bibiane Schönlieb, C. L. Philip Chen:
S $^3$ Net: Self-Supervised Self-Ensembling Network for Semi-Supervised RGB-D Salient Object Detection. 676-689 - Xinjue Hu, Yuxuan Pan, Yumei Wang, Lin Zhang, Shervin Shirmohammadi:
Multiple Description Coding for Best-Effort Delivery of Light Field Video Using GNN-Based Compression. 690-705 - Le Wang, Qing Li, Sanping Zhou, Nanning Zheng:
Multi-Panda Tracking. 706-720 - Changsheng Gao, Dong Liu, Li Li, Feng Wu:
Towards Task-Generic Image Compression: A Study of Semantics-Oriented Metrics. 721-735 - Pei Lv, Jianqi Fan, Xixi Nie, Weiming Dong, Xiaoheng Jiang, Bing Zhou, Mingliang Xu, Changsheng Xu:
User-Guided Personalized Image Aesthetic Assessment Based on Deep Reinforcement Learning. 736-749 - Xiao Tan, Huaian Chen, Kai Xu, Yi Jin, Changan Zhu:
Deep SR-HDR: Joint Learning of Super-Resolution and High Dynamic Range Imaging for Dynamic Scenes. 750-763 - Zhen Bai, Zhi Liu, Gongyang Li, Yang Wang:
Adaptive Group-Wise Consistency Network for Co-Saliency Detection. 764-776 - Chenghu Du, Feng Yu, Minghua Jiang, Ailing Hua, Xiong Wei, Tao Peng, Xinrong Hu:
VTON-SCFA: A Virtual Try-On Network Based on the Semantic Constraints and Flow Alignment. 777-791 - Shiji Zhou, Zhi Wang, Chenghao Hu, Yinan Mao, Haopeng Yan, Shanghang Zhang, Chuan Wu, Wenwu Zhu:
Caching in Dynamic Environments: A Near-Optimal Online Learning Approach. 792-804 - Shuyi Li, Bob Zhang, Lunke Fei, Shuping Zhao, Yicong Zhou:
Learning Sparse and Discriminative Multimodal Feature Codes for Finger Recognition. 805-815 - Wenxue Cui, Shaohui Liu, Feng Jiang, Debin Zhao:
Image Compressed Sensing Using Non-Local Neural Network. 816-830 - Nastaran Nourbakhsh Kaashki, Pengpeng Hu, Adrian Munteanu:
Anet: A Deep Neural Network for Automatic 3D Anthropometric Measurement Extraction. 831-844 - Xiaoyan Cai, Sen Liu, Junwei Han, Libin Yang, Zhenguo Liu, Tianming Liu:
ChestXRayBERT: A Pretrained Language Model for Chest Radiology Report Summarization. 845-855 - Xuemeng Song, Shi-Ting Fang, Xiaolin Chen, Yinwei Wei, Zhongzhou Zhao, Liqiang Nie:
Modality-Oriented Graph Learning Toward Outfit Compatibility Modeling. 856-867 - Jie Nie, Zian Zhao, Lei Huang, Weizhi Nie, Zhiqiang Wei:
Cross-Domain Recommendation Via User-Clustering and Multidimensional Information Fusion. 868-880 - Haimin Zhang, Min Xu:
Recognition of Emotions in User-Generated Videos through Frame-Level Adaptation and Emotion Intensity Learning. 881-891 - Fei Peng, Bo Long, Min Long:
A Semi-Fragile Reversible Watermarking for Authenticating 3D Models Based on Virtual Polygon Projection and Double Modulation Strategy. 892-906 - Karam Park, Jae Woong Soh, Nam Ik Cho:
A Dynamic Residual Self-Attention Network for Lightweight Single Image Super-Resolution. 907-918 - Ming Li, Jun Liu, Ce Zheng, Xinming Huang, Ziming Zhang:
Exploiting Multi-View Part-Wise Correlation via an Efficient Transformer for Vehicle Re-Identification. 919-929 - Liyuan Ma, Kejie Huang, Dongxu Wei, Zhaoyan Ming, Haibin Shen:
FDA-GAN: Flow-Based Dual Attention GAN for Human Pose Transfer. 930-941 - Chongyang Bai, Haipeng Chen, Srijan Kumar, Jure Leskovec, V. S. Subrahmanian:
M2P2: Multimodal Persuasion Prediction Using Adaptive Fusion. 942-952 - Prasen Kumar Sharma, Arun Abraham, Vikram Nelvoy Rajendiran:
A Generalized Zero-Shot Quantization of Deep Convolutional Neural Networks Via Learned Weights Statistics. 953-965 - Fan Zhao, Wenda Zhao, Huimin Lu, Yong Liu, Libo Yao, Yu Liu:
Depth-Distilled Multi-Focus Image Fusion. 966-978 - Xuanhan Wang, Yuyu Guo, Jingkuan Song, Lianli Gao, Heng Tao Shen:
AMANet: Adaptive Multi-Path Aggregation for Learning Human 2D-3D Correspondences. 979-992 - Tiejian Zhang, Xinwang Liu, Lei Gong, Siwei Wang, Xin Niu, Li Shen:
Late Fusion Multiple Kernel Clustering With Local Kernel Alignment Maximization. 993-1007 - Yiming Wang, Dongxia Chang, Zhiqiang Fu, Yao Zhao:
Consistent Multiple Graph Embedding for Multi-View Clustering. 1008-1018 - Jingjing Xiong, Lai-Man Po, Wing Yin Yu, Yuzhi Zhao, Kwok-Wai Cheung:
Distortion Map-Guided Feature Rectification for Efficient Video Semantic Segmentation. 1019-1032 - Wei Qin, Hanwang Zhang, Richang Hong, Ee-Peng Lim, Qianru Sun:
Causal Interventional Training for Image Recognition. 1033-1044 - Shikun Li, Tongliang Liu, Jiyong Tan, Dan Zeng, Shiming Ge:
Trustable Co-Label Learning From Multiple Noisy Annotators. 1045-1057 - Jiebo Luo:
Editorial. 1058-1059 - Yonggang Wen:
Editorial. 1060 - Wenqian Wang, Faliang Chang, Chunsheng Liu, Guangxin Li, Bin Wang:
GA-Net: A Guidance Aware Network for Skeleton-Based Early Activity Recognition. 1061-1073 - Qifan Wang, Yinwei Wei, Jianhua Yin, Jianlong Wu, Xuemeng Song, Liqiang Nie:
DualGNN: Dual Graph Neural Network for Multimedia Recommendation. 1074-1084 - Xiaoping Liang, Zhenjun Tang, Jingli Wu, Zhixin Li, Xinpeng Zhang:
Robust Image Hashing With Isomap and Saliency Map for Copy Detection. 1085-1097 - Shuping Zhao, Lunke Fei, Jie Wen, Jigang Wu, Bob Zhang:
Intrinsic and Complete Structure Learning Based Incomplete Multiview Clustering. 1098-1110 - Shixiang Wu, Chao Dong, Yu Qiao:
Blind Image Restoration Based on Cycle-Consistent Network. 1111-1124 - Jose Jaena Mari Ople, Tai-Ming Huang, Ming-Chih Chiu, Yi-Ling Chen, Kai-Lung Hua:
Adjustable Model Compression Using Multiple Genetic Algorithm. 1125-1132 - Le Wang, Mo Zhou, Zhenxing Niu, Qilin Zhang, Nanning Zheng:
Adaptive Ladder Loss for Learning Coherent Visual-Semantic Embedding. 1133-1147 - Weide Liu, Xiangfei Kong, Tzu-Yi Hung, Guosheng Lin:
Cross-Image Region Mining With Region Prototypical Network for Weakly Supervised Segmentation. 1148-1160 - Ziqiang Wang, Zhi Liu, Gongyang Li, Yang Wang, Tianhong Zhang, Lihua Xu, Jijun Wang:
Spatio-Temporal Self-Attention Network for Video Saliency Prediction. 1161-1174 - Rui Wang, Jun Liu, Qiuhong Ke, Duo Peng, Yinjie Lei:
Dear-Net: Learning Diversities for Skeleton-Based Early Action Recognition. 1175-1189 - Cheng Wang, Bingpeng Ma, Hong Chang, Shiguang Shan, Xilin Chen:
Person Search by a Bi-Directional Task-Consistent Learning Model. 1190-1203 - Jipeng Wu, Rongrong Ji, Qiang Wang, Shengchuan Zhang, Xiaoshuai Sun, Yan Wang, Mingliang Xu, Feiyue Huang:
Fast Monocular Depth Estimation via Side Prediction Aggregation with Continuous Spatial Refinement. 1204-1216 - Di Wang, Caiping Zhang, Quan Wang, Yumin Tian, Lihuo He, Lin Zhao:
Hierarchical Semantic Structure Preserving Hashing for Cross-Modal Retrieval. 1217-1229 - Min Cao, Cong Ding, Chen Chen, Hao Dou, Xiyuan Hu, Junchi Yan:
Progressive Context-Aware Graph Feature Learning for Target Re-Identification. 1230-1242 - Yuting Su, Wei Zhao, Peiguang Jing, Liqiang Nie:
Exploiting Low-Rank Latent Gaussian Graphical Model Estimation for Visual Sentiment Distributions. 1243-1255 - Gaoang Wang, Yizhou Wang, Renshu Gu, Weijie Hu, Jenq-Neng Hwang:
Split and Connect: A Universal Tracklet Booster for Multi-Object Tracking. 1256-1268 - Qiao Liu, Di Yuan, Nana Fan, Peng Gao, Xin Li, Zhenyu He:
Learning Dual-Level Deep Representation for Thermal Infrared Tracking. 1269-1281 - Wenhao Li, Hong Liu, Runwei Ding, Mengyuan Liu, Pichao Wang, Wenming Yang:
Exploiting Temporal Contexts With Strided Transformer for 3D Human Pose Estimation. 1282-1293 - Mengxi Jia, Xinhua Cheng, Shijian Lu, Jian Zhang:
Learning Disentangled Representation Implicitly Via Transformer for Occluded Person Re-Identification. 1294-1305 - Zhe Tang, Yi Yang, Wen Li, Defu Lian, Lixin Duan:
Deep Cross-Attention Network for Crowdfunding Success Prediction. 1306-1319 - Kun Zhang, Zhendong Mao, An-An Liu, Yongdong Zhang:
Unified Adaptive Relevance Distinguishable Attention Network for Image-Text Matching. 1320-1332 - Dongnan Liu, Chaoyi Zhang, Yang Song, Heng Huang, Chenyu Wang, Michael Barnett, Tom Weidong Cai:
Decompose to Adapt: Cross-Domain Object Detection Via Feature Disentanglement. 1333-1344 - Bin Chen, Kunhong Liu, Yong Xu, Qingqiang Wu, Junfeng Yao:
Block Division Convolutional Network With Implicit Deep Features Augmentation for Micro-Expression Recognition. 1345-1358 - Yingjian Li, Zheng Zhang, Bingzhi Chen, Guangming Lu, David Zhang:
Deep Margin-Sensitive Representation Learning for Cross-Domain Facial Expression Recognition. 1359-1373 - Jianjun Sun, Yan Zhao, Shigang Wang, Jian Wei:
3D Holoscopic Image Compression Based on Gaussian Mixture Model. 1374-1389 - Huan Liu, Wentao Liu, Zhixiang Chi, Yang Wang, Yuanhao Yu, Jun Chen, Jin Tang:
Fast Human Pose Estimation in Compressed Videos. 1390-1400 - Yujian Feng, Yimu Ji, Fei Wu, Guangwei Gao, Yang Gao, Tianliang Liu, Shangdong Liu, Xiao-Yuan Jing, Jiebo Luo:
Occluded Visible-Infrared Person Re-Identification. 1401-1413 - Haoyu Zhao, Qi Wang, Guowei Zhan, Weidong Min, Yi Zou, Shimiao Cui:
Need Only One More Point (NOOMP): Perspective Adaptation Crowd Counting in Complex Scenes. 1414-1426 - Jianjun Qian, Shumin Zhu, Chaoyu Zhao, Jian Yang, Wai Keung Wong:
OTFace: Hard Samples Guided Optimal Transport Loss for Deep Face Representation. 1427-1438 - Tianyu Shen, Deqi Li, Fei-Yue Wang, Hua Huang:
Depth-Aware Multi-Person 3D Pose Estimation With Multi-Scale Waterfall Representations. 1439-1451 - Qianqian Yu, Keqi Fan, Yuhui Zheng:
Domain Adaptive Transformer Tracking Under Occlusions. 1452-1461