default search action
IEEE Transactions on Multimedia, Volume 24
Volume 24, 2022
- Pan Gao, Pengwei Zhang, Aljosa Smolic:
Quality Assessment for Omnidirectional Video: A Spatio-Temporal Distortion Modeling Approach. 1-16 - Haonan Su, Long Yu, Cheolkon Jung:
Joint Contrast Enhancement and Noise Reduction of Low Light Images Via JND Transform. 17-32 - Yongqiang Gui, Hancheng Lu, Feng Wu, Chang Wen Chen:
LensCast: Robust Wireless Video Transmission Over MmWave MIMO With Lens Antenna Array. 33-48 - Haonan Fan, Hai-Miao Hu, Shuailing Liu, Weiqing Lu, Shiliang Pu:
Correlation Graph Convolutional Network for Pedestrian Attribute Recognition. 49-60 - Chih-Hung Liang, Yu-An Chen, Yueh-Cheng Liu, Winston H. Hsu:
Raw Image Deblurring. 61-72 - Zhenyu Wu, Shuai Li, Chenglizhao Chen, Aimin Hao, Hong Qin:
Deeper Look at Image Salient Object Detection: Bi-Stream Network With a Small Training Dataset. 73-86 - Lei Cao, Huijun Zhang, Ling Feng:
Building and Using Personal Knowledge Graph to Improve Suicidal Ideation Detection on Social Media. 87-102 - Chunxiao Liu, Zhendong Mao, Tianzhu Zhang, An-An Liu, Bin Wang, Yongdong Zhang:
Focus Your Attention: A Focal Attention for Multimodal Learning. 103-115 - Fei Ye, Chaoqin Huang, Jinkun Cao, Maosen Li, Ya Zhang, Cewu Lu:
Attribute Restoration Framework for Anomaly Detection. 116-127 - Shaoyue Song, Zhenjiang Miao, Hongkai Yu, Jianwu Fang, Kang Zheng, Cong Ma, Song Wang:
Deep Domain Adaptation Based Multi-Spectral Salient Object Detection. 128-140 - Haitao Zeng, Xinhang Song, Gongwei Chen, Shuqiang Jiang:
Amorphous Region Context Modeling for Scene Recognition. 141-151 - Xinpeng Huang, Ping An, Yilei Chen, Deyang Liu, Liquan Shen:
Low Bitrate Light Field Compression With Geometry and Content Consistency. 152-165 - Xingyuan Zhang, Fuhai Zhang:
Differentiable Spatial Regression: A Novel Method for 3D Hand Pose Estimation. 166-176 - Xingyu Chen, Jin Li, Xuguang Lan, Nanning Zheng:
Generalized Zero-Shot Learning Via Multi-Modal Aggregated Posterior Aligning Neural Network. 177-187 - Jingjia Huang, Wei Yan, Thomas H. Li, Shan Liu, Ge Li:
Learning the Global Descriptor for 3-D Object Recognition Based on Multiple Views Decomposition. 188-201 - Canqiang Chen, Chunmei Qing, Xiangmin Xu, Patrick Dickinson:
Cross Parallax Attention Network for Stereo Image Super-Resolution. 202-216 - Xun Gong, Zu Yao, Xin Li, Yueqiao Fan, Bin Luo, Jianfeng Fan, Boji Lao:
LAG-Net: Multi-Granularity Network for Person Re-Identification via Local Attention System. 217-229 - Jinwei Wang, Junjie Zhao, Qilin Yin, Xiangyang Luo, Yuhui Zheng, Yun-Qing Shi, Sunil Kr. Jha:
SmsNet: A New Deep Convolutional Neural Network Model for Adversarial Example Detection. 230-244 - Joongchol Shin, Hasil Park, Joonki Paik:
Region-Based Dehazing via Dual-Supervised Triple-Convolutional Network. 245-260 - Yu-Jen Ma, Hong-Han Shuai, Wen-Huang Cheng:
Spatiotemporal Dilated Convolution With Uncertain Matching for Video-Based Crowd Estimation. 261-273 - Che Sun, Hao Song, Xinxiao Wu, Yunde Jia, Jiebo Luo:
Exploiting Informative Video Segments for Temporal Action Localization. 274-287 - Xu Chen, Chenqiang Gao, Chaoyu Li, Yi Yang, Deyu Meng:
Infrared Action Detection in the Dark via Cross-Stream Attention Mechanism. 288-300 - Xuefeng Zhu, Xiaojun Wu, Tianyang Xu, Zhen-Hua Feng, Josef Kittler:
Robust Visual Object Tracking Via Adaptive Attribute-Aware Discriminative Correlation Filters. 301-312 - Xing Zhang, Zuxuan Wu, Yu-Gang Jiang:
SAM: Modeling Scene, Object and Action With Semantics Attention Modules for Video Recognition. 313-322 - Bo Wang, Mingwei Xu, Fengyuan Ren, Jianping Wu:
Improving Robustness of DASH Against Unpredictable Network Variations. 323-337 - Aihua Zheng, Menglan Hu, Bo Jiang, Yan Huang, Yan Yan, Bin Luo:
Adversarial-Metric Learning for Audio-Visual Cross-Modal Matching. 338-351 - Mengyang Zhang, Guohui Tian, Ying Zhang, Peng Duan:
Reinforcement Learning for Logic Recipe Generation: Bridging Gaps From Images to Plans. 352-365 - Mauricio Perez, Jun Liu, Alex C. Kot:
Interaction Relational Network for Mutual Action Recognition. 366-376 - Lê Minh Ngô, Sezer Karaoglu, Theo Gevers:
Self-Supervised Face Image Manipulation by Conditioning GAN on Face Decomposition. 377-385 - Pantelis Maniotis, Nikolaos Thomos:
Viewport-Aware Deep Reinforcement Learning Approach for 360$^\circ$ Video Caching. 386-399 - Xinchao Dong, Liquan Shen, Mei Yu, Hao Yang:
Fast Intra Mode Decision Algorithm for Versatile Video Coding. 400-414 - Yaoyu Li, Hantao Yao, Changsheng Xu:
Intra-Domain Consistency Enhancement for Unsupervised Person Re-Identification. 415-425 - Zhihao Shi, Xiaohong Liu, Kangdi Shi, Linhui Dai, Jun Chen:
Video Frame Interpolation via Generalized Deformable Convolution. 426-439 - Yang Zhang, Moyun Liu, Jingwu He, Fei Pan, Yanwen Guo:
Affinity Fusion Graph-Based Framework for Natural Image Segmentation. 440-450 - Zhuoman Liu, Wei Jia, Ming Yang, Peiyao Luo, Yong Guo, Mingkui Tan:
Deep View Synthesis via Self-Consistent Generative Network. 451-465 - Peng-Fei Zhang, Yang Li, Zi Huang, Xin-Shun Xu:
Aggregation-Based Graph Convolutional Hashing for Unsupervised Cross-Modal Retrieval. 466-479 - Ziqiang Zheng, Zhibin Yu, Haiyong Zheng, Yang Yang, Heng Tao Shen:
One-Shot Image-to-Image Translation via Part-Global Learning With a Multi-Adversarial Framework. 480-491 - Tengpeng Li, Kaihua Zhang, Shiwen Shen, Bo Liu, Qingshan Liu, Zhu Li:
Image Co-Saliency Detection and Instance Co-Segmentation Using Attention Graph Clustering Based Graph Convolutional Network. 492-505 - Xusong Chen, Chenyi Lei, Dong Liu, Guoxin Wang, Haihong Tang, Zheng-Jun Zha, Houqiang Li:
E-Commerce Storytelling Recommendation Using Attentional Domain-Transfer Network and Adversarial Pre-Training. 506-518 - Zhaoqing Pan, Feng Yuan, Jianjun Lei, Wanqing Li, Nam Ling, Sam Kwong:
MIEGAN: Mobile Image Enhancement via a Multi-Module Cascade Neural Network. 519-533 - Liming Xu, Xianhua Zeng, Weisheng Li, Ling Bai:
IDHashGAN: Deep Hashing With Generative Adversarial Nets for Incomplete Data Retrieval. 534-545 - Huafeng Liu, Chuanyi Zhang, Yazhou Yao, Xiu-Shen Wei, Fumin Shen, Zhenmin Tang, Jian Zhang:
Exploiting Web Images for Fine-Grained Visual Recognition by Eliminating Open-Set Noise and Utilizing Hard Examples. 546-557 - Jianjie Lu, Weidong Zhang, Haibing Yin:
Generate and Purify: Efficient Person Data Generation for Re-Identification. 558-566 - Qin Xu, Yiming Mei, Jinpei Liu, Chenglong Li:
Multimodal Cross-Layer Bilinear Pooling for RGBT Tracking. 567-580 - Lijuan Sun, Songhe Feng, Jun Liu, Gengyu Lyu, Congyan Lang:
Global-Local Label Correlation for Partial Multi-Label Learning. 581-593 - Huan Li, Ping Wei, Ping Hu:
AVN: An Adversarial Variation Network Model for Handwritten Signature Verification. 594-608 - Zhongze Chen, Jing Li, Jia Wu, Jun Chang, Yafu Xiao, Xiaoting Wang:
Drift-Proof Tracking With Deep Reinforcement Learning. 609-624 - Rizard Renanda Adhi Pramono, Yie-Tarng Chen, Wen-Hsien Fang:
Spatial-Temporal Action Localization With Hierarchical Self-Attention. 625-639 - Heng Yao, Mian Zou, Chuan Qin, Xinpeng Zhang:
Signal-Dependent Noise Estimation for a Real-Camera Model via Weight and Shape Constraints. 640-654 - Jun Chen, Xuejiao Li, Linbo Luo, Jiayi Ma:
Multi-Focus Image Fusion Based on Multi-Scale Gradients and Image Matting. 655-667 - Linchao Zhu, Hehe Fan, Yawei Luo, Mingliang Xu, Yi Yang:
Temporal Cross-Layer Correlation Mining for Action Recognition. 668-676 - Zhaoyu Zhang, Mengyan Li, Haonian Xie, Jun Yu, Tongliang Liu, Chang Wen Chen:
TWGAN: Twin Discriminator Generative Adversarial Networks. 677-688 - Md. Moniruzzaman, Zhaozheng Yin, Zhihai He, Ruwen Qin, Ming C. Leu:
Human Action Recognition by Discriminative Feature Pooling and Video Segment Attention Model. 689-701 - Meng Chang, Huajun Feng, Zhihai Xu, Qi Li:
Low-Light Image Restoration With Short- and Long-Exposure Raw Pairs. 702-714 - Hanli Wang, Pengjie Tang, Qinyu Li, Meng Cheng:
Emotion Expression With Fact Transfer for Video Description. 715-727 - Sihao Lin, Wenhao Wu, Si Wu, Yong Xu, Hau-San Wong:
Unreliable-to-Reliable Instance Translation for Semi-Supervised Pedestrian Detection. 728-739 - Zhi Zeng, Ting Wang, Fulei Ma, Liang Zhang, Peiyi Shen, Syed Afaq Ali Shah, Mohammed Bennamoun:
Probability-Based Framework to Fuse Temporal Consistency and Semantic Information for Background Segmentation. 740-754 - Yuan-fang Zhang, Jiangbin Zheng, Wenjing Jia, Wenfeng Huang, Long Li, Nian Liu, Fei Li, Xiangjian He:
Deep RGB-D Saliency Detection Without Depth. 755-767 - Hao Zhou, Wengang Zhou, Yun Zhou, Houqiang Li:
Spatial-Temporal Multi-Cue Network for Sign Language Recognition and Translation. 768-779 - Amir Shirian, Subarna Tripathi, Tanaya Guha:
Dynamic Emotion Modeling With Learnable Graphs and Graph Inception Network. 780-790 - Jingkuan Song, Jingqiu Zhang, Lianli Gao, Zhou Zhao, Heng Tao Shen:
AgeGAN++: Face Aging and Rejuvenation With Dual Conditional GANs. 791-804 - Desheng Cai, Shengsheng Qian, Quan Fang, Changsheng Xu:
Heterogeneous Hierarchical Feature Aggregation Network for Personalized Micro-Video Recommendation. 805-818 - Huijing Zhan, Jie Lin, Kenan Emir Ak, Boxin Shi, Ling-Yu Duan, Alex C. Kot:
$A^3$-FKG: Attentive Attribute-Aware Fashion Knowledge Graph for Outfit Preference Prediction. 819-831 - Hongchen Tan, Xiuping Liu, Baocai Yin, Xin Li:
Cross-Modal Semantic Matching Generative Adversarial Networks for Text-to-Image Synthesis. 832-845 - Aite Zhao, Junyu Dong, Jianbo Li, Lin Qi, Huiyu Zhou:
Associated Spatio-Temporal Capsule Network for Gait Recognition. 846-860 - Jiaxu Leng, Ying Liu, Zhihui Wang, Haibo Hu, Xinbo Gao:
CrossNet: Detecting Objects as Crosses. 861-875 - Yangyang Shu, Qian Li, Chang Xu, Shaowu Liu, Guandong Xu:
V-SVR+: Support Vector Regression With Variational Privileged Information. 876-889 - Yifang Yin, Ying Zhang, Zhenguang Liu, Sheng Wang, Rajiv Ratn Shah, Roger Zimmermann:
GPS2Vec: Pre-Trained Semantic Embeddings for Worldwide GPS Coordinates. 890-903 - Huixia Ben, Yingwei Pan, Yehao Li, Ting Yao, Richang Hong, Meng Wang, Tao Mei:
Unpaired Image Captioning With semantic-Constrained Self-Learning. 904-916 - Yanhao Tan, Mohammad Muntasir Rahman, Yanfu Yan, Jian Xue, Ling Shao, Ke Lu:
Fine-Grained Categorization From RGB-D Images. 917-928 - Xiao Luan, Yuanyuan Zhao, Weihua Ou, Linghui Liu, Weisheng Li, Yucheng Shu, Hongmin Geng:
Collaborative Learning With a Multi-Branch Framework for Feature Enhancement. 929-941 - Xinyuan Qian, Alessio Brutti, Oswald Lanz, Maurizio Omologo, Andrea Cavallaro:
Audio-Visual Tracking of Concurrent Speakers. 942-954 - Han Fang, Dongdong Chen, Feng Wang, Zehua Ma, Honggu Liu, Wenbo Zhou, Weiming Zhang, Nenghai Yu:
TERA: Screen-to-Camera Image Code With Transparency, Efficiency, Robustness and Adaptability. 955-967 - Tao Chen, Guo-Sen Xie, Yazhou Yao, Qiong Wang, Fumin Shen, Zhenmin Tang, Jian Zhang:
Semantically Meaningful Class Prototype Learning for One-Shot Image Segmentation. 968-980 - Pandeng Li, Hongtao Xie, Shaobo Min, Zheng-Jun Zha, Yongdong Zhang:
Online Residual Quantization Via Streaming Data Correlation Preserving. 981-994 - Tianze Gao, Huihui Pan, Zidong Wang, Huijun Gao:
A CRF-Based Framework for Tracklet Inactivation in Online Multi-Object Tracking. 995-1007 - Mahesh Kumar Krishna Reddy, Mrigank Rochan, Yiwei Lu, Yang Wang:
AdaCrowd: Unlabeled Scene Adaptation for Crowd Counting. 1008-1019 - Yukun Zuo, Hantao Yao, Liansheng Zhuang, Changsheng Xu:
Seek Common Ground While Reserving Differences: A Model-Agnostic Module for Noisy Domain Adaptation. 1020-1030 - Qi Wang, Weidong Min, Qing Han, Qian Liu, Cheng Zha, Haoyu Zhao, Zitai Wei:
Inter-Domain Adaptation Label for Data Augmentation in Vehicle Re-Identification. 1031-1041 - Tao Chen, Shui-Hua Wang, Qiong Wang, Zheng Zhang, Guo-Sen Xie, Zhenmin Tang:
Enhanced Feature Alignment for Unsupervised Domain Adaptation of Semantic Segmentation. 1042-1054 - Xiuwen Gong, Jiahui Yang, Dong Yuan, Wei Bao:
Generalized Large Margin $k$NN for Partial Label Learning. 1055-1066 - Jing Yi, Zhenzhong Chen:
Multi-Modal Variational Graph Auto-Encoder for Recommendation Systems. 1067-1079 - Zhengning Wu, Xiaobo Xia, Ruxin Wang, Jiatong Li, Jun Yu, Yinian Mao, Tongliang Liu:
LR-SVM+: Learning Using Privileged Information with Noisy Labels. 1080-1092 - Zeren Sun, Huafeng Liu, Qiong Wang, Tianfei Zhou, Qi Wu, Zhenmin Tang:
Co-LDL: A Co-Training-Based Label Distribution Learning Method for Tackling Label Noise. 1093-1104 - Huafeng Liu, Haofeng Zhang, Jianfeng Lu, Zhenmin Tang:
Exploiting Web Images for Fine-Grained Visual Recognition via Dynamic Loss Correction and Global Sample Selection. 1105-1115 - Xiaobo Shen, Guohua Dong, Yuhui Zheng, Long Lan, Ivor W. Tsang, Quan-Sen Sun:
Deep Co-Image-Label Hashing for Multi-Label Image Retrieval. 1116-1126 - Hao-Chiang Shao, Hsin-Chieh Wang, Weng-Tai Su, Chia-Wen Lin:
Ensemble Learning With Manifold-Based Data Splitting for Noisy Label Correction. 1127-1140 - Junya Teng, Xiankai Lu, Yongshun Gong, Xinfang Liu, Xiushan Nie, Yilong Yin:
Regularized Two Granularity Loss Function for Weakly Supervised Video Moment Retrieval. 1141-1151 - Sijie Song, Jiaying Liu, Lilang Lin, Zongming Guo:
Learning to Recognize Human Actions From Noisy Skeleton Data Via Noise Adaptation. 1152-1163 - Jingyu Hao, Chengjia Wang, Guang Yang, Zhifan Gao, Jinglin Zhang, Heye Zhang:
Annealing Genetic GAN for Imbalanced Web Data Learning. 1164-1174 - Bin Zhu, Chong-Wah Ngo, Wing Kwong Chan:
Learning From Web Recipe-Image Pairs for Food Recognition: Problem, Baselines and Performance. 1175-1185 - Yaoyao Zhong, Weihong Deng, Han Fang, Jiani Hu, Dongyue Zhao, Xian Li, Dongchao Wen:
Dynamic Training Data Dropout for Robust Deep Face Recognition. 1186-1197 - Chuanyi Zhang, Qiong Wang, Guo-Sen Xie, Qi Wu, Fumin Shen, Zhenmin Tang:
Robust Learning From Noisy Web Images Via Data Purification for Fine-Grained Recognition. 1198-1209 - Shiji Zhou, Lianzhe Wang, Shanghang Zhang, Zhi Wang, Wenwu Zhu:
Active Gradual Domain Adaptation: Dataset and Approach. 1210-1220 - Gongmian Wang, Xing Xu, Fumin Shen, Huimin Lu, Yanli Ji, Heng Tao Shen:
Cross-Modal Dynamic Networks for Video Moment Retrieval With Text Query. 1221-1232 - Bingwen Hu, Ping Liu, Zhedong Zheng, Mingwu Ren:
SPG-VTON: Semantic Prediction Guidance for Multi-Pose Virtual Try-on. 1233-1246 - Zhenfeng Xue, Weijie Mao, Liang Zheng:
Learning to Simulate Complex Scenes for Street Scene Segmentation. 1253-1265 - Mitra Tajrobehkar, Kaihua Tang, Hanwang Zhang, Joo-Hwee Lim:
Align R-CNN: A Pairwise Head Network for Visual Relationship Detection. 1266-1276 - Peiguang Jing, Jing Zhang, Liqiang Nie, Shu Ye, Jing Liu, Yuting Su:
Tripartite Graph Regularized Latent Low-Rank Representation for Fashion Compatibility Prediction. 1277-1287 - Yaomin Wang, Wenguang He:
High Capacity Reversible Data Hiding in Encrypted Image Based on Adaptive MSB Prediction. 1288-1298 - Shiguang Liu, Ting Zhu:
Structure-Guided Arbitrary Style Transfer for Artistic Image and Video. 1299-1312 - Dung Nguyen, Duc Thanh Nguyen, Rui Zeng, Thanh Thi Nguyen, Son N. Tran, Thin Nguyen, Sridha Sridharan, Clinton Fookes:
Deep Auto-Encoders With Sequential Learning for Multimodal Dimensional Emotion Recognition. 1313-1324 - Yalan Ye, Yukun He, Tongjie Pan, Jingjing Li, Heng Tao Shen:
Alleviating Domain Shift via Discriminative Learning for Generalized Zero-Shot Learning. 1325-1337 - Haoyu Tang, Jihua Zhu, Meng Liu, Zan Gao, Zhiyong Cheng:
Frame-Wise Cross-Modal Matching for Video Moment Retrieval. 1338-1349 - Tianchi Huang, Rui-Xiao Zhang, Lifeng Sun:
Zwei: A Self-Play Reinforcement Learning Framework for Video Transmission Services. 1350-1365 - Chong Mou, Jian Zhang, Xiaopeng Fan, Hangfan Liu, Ronggang Wang:
COLA-Net: Collaborative Attention Network for Image Restoration. 1366-1377 - Kaihao Zhang, Wenhan Luo, Lin Ma, Wenqi Ren, Hongdong Li:
Disentangled Feature Networks for Facial Portrait and Caricature Generation. 1378-1388 - Kyohoon Sim, Jiachen Yang, Wen Lu, Xinbo Gao:
Blind Stereoscopic Image Quality Evaluator Based on Binocular Semantic and Quality Channels. 1389-1398 - Giulia Slavic, Mohamad Baydoun, Damian Campo, Lucio Marcenaro, Carlo S. Regazzoni:
Multilevel Anomaly Detection Through Variational Autoencoders and Bayesian Models for Self-Aware Embodied Agents. 1399-1414 - Lu Zhang, Jingsong Xu, Yongshun Gong, Litao Yu, Jian Zhang, Jialie Shen:
Unsupervised Image and Text Fusion for Travel Information Enhancement. 1415-1425 - Amin Parvaneh, Ehsan Abbasnejad, Qi Wu, Qinfeng (Javen) Shi, Anton van den Hengel:
Show, Price and Negotiate: A Negotiator With Online Value Look-Ahead. 1426-1434 - Jialu Huang, Jing Liao, Sam Kwong:
Unsupervised Image-to-Image Translation via Pre-Trained StyleGAN2 Network. 1435-1448 - Huaiwen Zhang, Shengsheng Qian, Quan Fang, Changsheng Xu:
Multi-Modal Meta Multi-Task Learning for Social Media Rumor Detection. 1449-1459 - Rencan Nie, Chaozhen Ma, Jinde Cao, Hongwei Ding, Dongming Zhou:
A Total Variation With Joint Norms For Infrared and Visible Image Fusion. 1460-1472 - Yitong Yan, Chuangchuang Liu, Changyou Chen, Xianfang Sun, Longcun Jin, Xinyi Peng, Xiang Zhou:
Fine-Grained Attention and Feature-Sharing Generative Adversarial Networks for Single Image Super-Resolution. 1473-1487