


default search action
IEEE Transactions on Multimedia, Volume 26
Volume 26, 2024
- Qinwei Xu
, Ruipeng Zhang
, Ya Zhang
, Yiyan Wu
, Yanfeng Wang
:
Federated Adversarial Domain Hallucination for Privacy-Preserving Domain Generalization. 1-14 - Xingxing Zhang
, Shupeng Gui, Jian Jin
, Zhenfeng Zhu
, Yao Zhao
:
ATZSL: Defensive Zero-Shot Recognition in the Presence of Adversaries. 15-27 - Weiqing Lu
, Hai-Miao Hu
, Jinzuo Yu, Yibo Zhou, Hanzi Wang
, Bo Li:
Orientation-Aware Pedestrian Attribute Recognition Based on Graph Convolution Network. 28-40 - Zhuang Li
, Leilei Cao
, Hongbin Wang, Lihong Xu
:
End-to-End Instance-Level Human Parsing by Segmenting Persons. 41-50 - Xun Cai, Qingjie Shi, Yanbo Gao
, Shuai Li
, Wei Hua
, Tian Xie:
A Structure-Preserving and Illumination-Consistent Cycle Framework for Image Harmonization. 51-64 - Saizhe Ding, Jinze Chen, Yang Wang
, Yu Kang
, Weiguo Song, Jie Cheng, Yang Cao
:
E-MLB: Multilevel Benchmark for Event-Based Camera Denoising. 65-76 - Jiang Li
, Xiaoping Wang
, Guoqing Lv
, Zhigang Zeng
:
GraphCFC: A Directed Graph Based Cross-Modal Feature Complementation Approach for Multimodal Conversational Emotion Recognition. 77-89 - Zhe Li
, Xinyu Wang
, Yuliang Liu
, Lianwen Jin
, Yichao Huang, Kai Ding:
Improving Handwritten Mathematical Expression Recognition via Similar Symbol Distinguishing. 90-102 - Zheng Li
, Caili Guo
, Zerun Feng
, Jenq-Neng Hwang
, Zhongtian Du:
Integrating Language Guidance Into Image-Text Matching for Correcting False Negatives. 103-116 - Muqing Deng
, Zhuyao Fan
, Peng Lin, Xiaoreng Feng:
Human Gait Recognition Based on Frontal-View Sequences Using Gait Dynamics and Deep Learning. 117-126 - Huimin Zeng
, Jie Huang
, Jiacheng Li
, Zhiwei Xiong
:
Region-Aware Portrait Retouching With Sparse Interactive Guidance. 127-140 - Jiawei Liu
, Weining Wang
, Sihan Chen, Xinxin Zhu, Jing Liu
:
Sounding Video Generator: A Unified Framework for Text-Guided Sounding Video Generation. 141-153 - Yan-Bo Liu
, Guo Cao
, Boshan Shi, Yingxiang Hu
:
CCANet: A Collaborative Cross-Modal Attention Network for RGB-D Crowd Counting. 154-165 - Wenhan Wu
, Wenfeng Yi, Jinghai Li, Maoyin Chen
, Xiaoping Zheng
:
Automatic Identification of Human Subgroups in Time-Dependent Pedestrian Flow Networks. 166-177 - Ali Köksal
, Kenan E. Ak, Ying Sun
, Deepu Rajan
, Joo Hwee Lim
:
Controllable Video Generation With Text-Based Instructions. 190-201 - Bosheng Ding, Ruiheng Zhang
, Lixin Xu
, Guanyu Liu
, Shuo Yang
, Yumeng Liu
, Qi Zhang:
U2D2Net: Unsupervised Unified Image Dehazing and Denoising Network for Single Hazy Image Enhancement. 202-217 - Zhiwu Qing
, Shiwei Zhang
, Ziyuan Huang
, Xiang Wang
, Yuehuan Wang
, Yiliang Lv, Changxin Gao
, Nong Sang
:
MAR: Masked Autoencoders for Efficient Action Recognition. 218-233 - Jinguang Wang
, Shengsheng Qian
, Jun Hu
, Richang Hong
:
Positive Unlabeled Fake News Detection via Multi-Modal Masked Transformer Network. 234-244 - Jianxun Lou
, Hanhe Lin
, Philippa Young, Richard White
, Zelei Yang, Susan Cheng Shelmerdine, David Marshall, Emiliano Spezi
, Marco Palombo
, Hantao Liu
:
Predicting Radiologists' Gaze With Computational Saliency Models in Mammogram Reading. 256-269 - Md. Moniruzzaman
, Zhaozheng Yin
:
Feature Weakening, Contextualization, and Discrimination for Weakly Supervised Temporal Action Localization. 270-283 - Zehui Chen
, Chenhongyi Yang, Jiahao Chang
, Feng Zhao
, Zheng-Jun Zha
, Feng Wu:
DDOD: Dive Deeper into the Disentanglement of Object Detector. 284-298 - Kai Zeng
, Kejiang Chen
, Weiming Zhang
, Yaofei Wang:
Upward Robust Steganography Based on Overflow Alleviation. 299-312 - Yukun Su
, Jingliang Deng, Ruizhou Sun, Guosheng Lin
, Hanjing Su, Qingyao Wu
:
A Unified Transformer Framework for Group-Based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection. 313-325 - Jun Wang
, Peng Yin, Yuanyun Wang
, Wenhui Yang:
CMAT: Integrating Convolution Mixer and Self-Attention for Visual Tracking. 326-338 - Lorenzo Agnolucci
, Leonardo Galteri
, Marco Bertini
, Alberto Del Bimbo
:
Perceptual Quality Improvement in Videoconferencing Using Keyframes-Based GAN. 339-352 - Wanting Zhou
, Longteng Kong
, Yushan Han, Jie Qin
, Zhenan Sun
:
Contextualized Relation Predictive Model for Self-Supervised Group Activity Representation Learning. 353-366 - Hua Li
, Junyan Liang, Ruiqi Wu, Runmin Cong
, Wenhui Wu
, Sam Tak-Wu Kwong
:
Stereo Superpixel Segmentation via Decoupled Dynamic Spatial-Embedding Fusion Network. 367-378 - Peipei Zhu
, Xiao Wang
, Lin Zhu
, Zhenglong Sun
, Wei-Shi Zheng
, Yaowei Wang
, Changwen Chen
:
Prompt-Based Learning for Unpaired Image Captioning. 379-393 - Zhiyu Wang
, Chao Yang
, Bin Jiang
, Junsong Yuan
:
A Dual Reinforcement Learning Framework for Weakly Supervised Phrase Grounding. 394-405 - Andong Lu
, Zhang Zhang
, Yan Huang
, Yifan Zhang
, Chenglong Li
, Jin Tang
, Liang Wang
:
Illumination Distillation Framework for Nighttime Person Re-Identification and a New Benchmark. 406-419 - Parham Hadikhani
, Daphne Teck Ching Lai
, Wee-Hong Ong
:
Human Activity Discovery With Automatic Multi-Objective Particle Swarm Optimization Clustering With Gaussian Mutation and Game Theory. 420-435 - Quanpeng Song
, Jiaxin Li
, Si Wu
, Hau-San Wong
:
A Graph-Based Discriminator Architecture for Multi-Attribute Facial Image Editing. 436-446 - Jianping Gou
, Xin He
, Lan Du
, Baosheng Yu, Wenbai Chen
, Zhang Yi
:
Hierarchical Locality-Aware Deep Dictionary Learning for Classification. 447-461 - Senmao Ye
, Huan Wang, Mingkui Tan
, Fei Liu
:
Recurrent Affine Transformation for Text-to-Image Synthesis. 462-473 - Weitao You
, Juntao Ji
, Lingyun Sun
, Changyuan Yang, Mi Yu, Shi Chen
, Jiayi Yao
:
Automatic Generation of Interactive Nonlinear Video for Online Apparel Shopping Navigation. 474-486 - Aijia Yang
, Sihao Lin
, Chung-Hsing Yeh
, Minglei Shu
, Yi Yang, Xiaojun Chang
:
Context Matters: Distilling Knowledge Graph for Enhanced Object Detection. 487-500 - Qinghua Ren
, Qirong Mao
, Shijian Lu
:
Prototypical Bidirectional Adaptation and Learning for Cross-Domain Semantic Segmentation. 501-513 - Weizhi Nie
, Yuru Bao, Yue Zhao
, Anan Liu
:
Long Dialogue Emotion Detection Based on Commonsense Knowledge Graph Guidance. 514-528 - Ziqi Yuan
, Yihe Liu, Hua Xu
, Kai Gao:
Noise Imitation Based Adversarial Training for Robust Multimodal Sentiment Analysis. 529-539 - Arbind Agrahari Baniya
, Tsz-Kwan Lee
, Peter W. Eklund
, Sunil Aryal
:
Omnidirectional Video Super-Resolution Using Deep Learning. 540-554 - Yufan Hu
, Junyu Gao
, Changsheng Xu
:
Learning Multi-Expert Distribution Calibration for Long-Tailed Video Classification. 555-567 - Yanshan Li
, Huajie Liang
, Rui Yu
:
BI-CAM: Generating Explanations for Deep Neural Networks Using Bipolar Information. 568-580 - Rongtao Xu
, Changwei Wang
, Shibiao Xu
, Weiliang Meng
, Xiaopeng Zhang
:
Wave-Like Class Activation Map With Representation Fusion for Weakly-Supervised Semantic Segmentation. 581-592 - Hezhen Hu, Junfu Pu
, Wengang Zhou
, Hang Fang, Houqiang Li
:
Prior-Aware Cross Modality Augmentation Learning for Continuous Sign Language Recognition. 593-606 - Xiongli Chai
, Feng Shao
, Qiuping Jiang
, Xuejin Wang
, Long Xu
, Yo-Sung Ho
:
Blind Quality Evaluator of Light Field Images by Group-Based Representations and Multiple Plane-Oriented Perceptual Characteristics. 607-622 - Yuxuan Liu
, Hongwei Ge
, Zhen Wang
, Yaqing Hou
, Mingde Zhao
:
Discriminative Identity-Feature Exploring and Differential Aware Learning for Unsupervised Person Re-Identification. 623-636 - Chunyang Xie
, Dongheng Zhang
, Zhi Wu
, Cong Yu
, Yang Hu
, Yan Chen
:
RPM: RF-Based Pose Machines. 637-649 - Mingliang Zhou
, Xingtai Wu, Xuekai Wei
, Tao Xiang
, Bin Fang
, Sam Kwong
:
Low-Light Enhancement Method Based on a Retinex Model for Structure Preservation. 650-662 - Zilong Yu, Yunyun Yang
, Yongbin Zhu
, Bixue Guo, Chun Li
:
CS-IntroVAE: Cauchy-Schwarz Divergence-Based Introspective Variational Autoencoder. 663-672 - Shentong Mo
, Miao Xin
:
BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-Term Pose Forecasting. 673-686 - Songhan He
, Dawen Xu
, Lin Yang
, Weipeng Liang
:
Adaptive HEVC Video Steganography With High Performance Based on Attention-Net and PU Partition Modes. 687-700 - Xueping Wang
, Min Liu
, Fei Wang, Jianhua Dai
, An-An Liu
, Yaonan Wang
:
Relation-Preserving Feature Embedding for Unsupervised Person Re-Identification. 714-723 - Shiqi Lin, Tao Yu
, Ruoyu Feng, Xin Li
, Xiaoyuan Yu, Lei Xiao, Zhibo Chen
:
Local Patch AutoAugment With Multi-Agent Collaboration. 724-736 - Kaipeng Zhang
, Yoichi Sato
:
Semantic Image Segmentation by Dynamic Discriminative Prototypes. 737-749 - Shaowei Weng
, Tangguo Zhu, Tiancong Zhang
, Chunyu Zhang:
UCM-Net: A U-Net-Like Tampered-Region-Related Framework for Copy-Move Forgery Detection. 750-763 - Dandan Zhu
, Kaiwei Zhang
, Nana Zhang, Qiangqiang Zhou, Xiongkuo Min
, Guangtao Zhai, Xiaokang Yang:
Unified Audio-Visual Saliency Model for Omnidirectional Videos With Spatial Audio. 764-775 - Vladimir Frants
, Sos S. Agaian
, Karen Panetta
:
QSAM-Net: Rain Streak Removal by Quaternion Neural Network With Self-Attention Module. 789-798 - Renshuai Liu
, Yao Cheng, Sifei Huang, Chengyang Li, Xuan Cheng
:
Transformer-Based High-Fidelity Facial Displacement Completion for Detailed 3D Face Reconstruction. 799-810 - Jinfu Liu
, Xinshun Wang
, Can Wang, Yuan Gao
, Mengyuan Liu
:
Temporal Decoupling Graph Convolutional Network for Skeleton-Based Gesture Recognition. 811-823 - Yuan Sun
, Zhenwen Ren
, Peng Hu
, Dezhong Peng, Xu Wang
:
Hierarchical Consensus Hashing for Cross-Modal Retrieval. 824-836 - Yaguang Song
, Xiaoshan Yang, Yaowei Wang
, Changsheng Xu
:
Recovering Generalization via Pre-Training-Like Knowledge Distillation for Out-of-Distribution Visual Question Answering. 837-851 - Ruimin Li
, Jiajun Xiang, Feixiang Sun, Ye Yuan, Longwu Yuan, Shuiping Gou
:
Multiscale Cross-Modal Homogeneity Enhancement and Confidence-Aware Fusion for Multispectral Pedestrian Detection. 852-863 - Wenjie Li
, Juncheng Li
, Guangwei Gao
, Weihong Deng
, Jiantao Zhou
, Jian Yang
, Guo-Jun Qi
:
Cross-Receptive Focused Inference Network for Lightweight Image Super-Resolution. 864-877 - Kedeng Tong
, Xin Jin
, Yuqing Yang, Chen Wang, Jinshi Kang
, Fan Jiang
:
Learned Focused Plenoptic Image Compression With Microimage Preprocessing and Global Attention. 890-903 - Ke Zhang
, Hanliang Jiang, Jian Zhang
, Qingming Huang
, Jianping Fan
, Jun Yu
, Weidong Han
:
Semi-Supervised Medical Report Generation via Graph-Guided Hybrid Feature Consistency. 904-915 - Gangjian Zhang
, Shikui Wei
, Huaxin Pang
, Shuang Qiu
, Yao Zhao
:
Enhance Composed Image Retrieval via Multi-Level Collaborative Localization and Semantic Activeness Perception. 916-928 - Jianjun Xiang, Peng Chen
, Yuanjie Dang
, Ronghua Liang
, Gangyi Jiang
:
Pseudo Light Field Image and 4D Wavelet-Transform-Based Reduced-Reference Light Field Image Quality Assessment. 929-943 - Jinyu Wen
, Feiwei Qin
, Jiao Du
, Meie Fang
, Xinhua Wei, C. L. Philip Chen
, Ping Li
:
MsgFusion: Medical Semantic Guided Two-Branch Network for Multimodal Brain Image Fusion. 944-957 - Yuxin Xiang
, Dongjie Tang
, Rui Huang, Yong Yao
, Chao Xie, Qiming Shi, Randy Xu, Mohammad Reza Haghighat
, Cathy Bao, Yicheng Gu
, Zhengwei Qi
, Haibing Guan
:
CARE: Cloudified Android With Optimized Rendering Platform. 958-971 - Tuan T. Nguyen
, Hoang H. Nguyen
, Mina Sartipi, Marco Fisichella
:
Multi-Vehicle Multi-Camera Tracking With Graph-Based Tracklet Features. 972-983 - Geng Chen
, Huazhu Fu
, Tao Zhou
, Guobao Xiao
, Keren Fu
, Yong Xia
, Yanning Zhang
:
Fusion-Embedding Siamese Network for Light Field Salient Object Detection. 984-994 - Bing Cao
, Haifang Cao, Jiaxu Liu, Pengfei Zhu
, Changqing Zhang
, Qinghua Hu
:
Autoencoder-Based Collaborative Attention GAN for Multi-Modal Image Synthesis. 995-1010 - Jiesheng Wu
, Fangwei Hao, Weiyun Liang
, Jing Xu
:
Transformer Fusion and Pixel-Level Contrastive Learning for RGB-D Salient Object Detection. 1011-1026 - Tao Xie
, Li Wang, Ke Wang
, Ruifeng Li
, Xinyu Zhang
, Haoming Zhang
, Linqi Yang, Huaping Liu
, Jun Li
:
FARP-Net: Local-Global Feature Aggregation and Relation-Aware Proposals for 3D Object Detection. 1027-1040 - Shuyue Lan
, Zhilu Wang
, Ermin Wei
, Amit K. Roy-Chowdhury
, Qi Zhu
:
Collaborative Multi-Agent Video Fast-Forwarding. 1041-1054 - Shulei Ji
, Xinyu Yang
:
EmoMusicTV: Emotion-Conditioned Symbolic Music Generation With Hierarchical Transformer VAE. 1076-1088 - Yuqi Zhang, Qi Qian, Hongsong Wang
, Chong Liu
, Weihua Chen
, Fan Wang
:
Graph Convolution Based Efficient Re-Ranking for Visual Retrieval. 1089-1101 - Zhenguo Yang
, Zhuopan Yang, Zhiwei Guo, Zehang Lin, Haizhong Zhu, Qing Li
, Wenyin Liu
:
Towards Temporal Event Detection: A Dataset, Benchmarks and Challenges. 1102-1113 - Chengrui Zhang
, Junxin Chen
, Dongming Chen
, Wei Wang
, Yushu Zhang
, Yicong Zhou
:
Exploiting Substitution Box for Cryptanalyzing Image Encryption Schemes With DNA Coding and Nonlinear Dynamics. 1114-1128 - Weizhi Nie
, Xin Wen
, Jing Liu, Jiawei Chen
, Jiancan Wu, Guoqing Jin
, Jing Lu, An-An Liu:
Knowledge-Enhanced Causal Reinforcement Learning Model for Interactive Recommendation. 1129-1142 - Wei Zhou
, Weitao Jiang
, Dihu Chen
, Haifeng Hu
, Tao Su
:
Mining Semantic Information With Dual Relation Graph Network for Multi-Label Image Classification. 1143-1157 - Lin Zhao
, Hui Zhou
, Xinge Zhu
, Xiao Song, Hongsheng Li
, Wenbing Tao
:
LIF-Seg: LiDAR and Camera Image Fusion for 3D LiDAR Semantic Segmentation. 1158-1168 - Zhaoyi Li
, Ping Zhong
, Jiawei Huang
, Feng Gao, Jian-Xin Wang
:
Achieving QoE Fairness in Bitrate Allocation of 360° Video Streaming. 1169-1178 - Feifei Ding
, Jianjun Li
, Wanyong Tian, Shanqing Zhang, Wenqiang Yuan:
Unsupervised Domain Adaptation via Risk-Consistent Estimators. 1179-1187 - Jian Xiao
, Xiaojun Bi
:
Model-Guided Generative Adversarial Networks for Unsupervised Fine-Grained Image Generation. 1188-1199 - Jiayuan Sun
, Luping Ji
, Jiewen Zhu
:
Shared Coupling-Bridge Scheme for Weakly Supervised Local Feature Learning. 1200-1212 - Kangle Wu
, Jun Huang
, Yong Ma
, Fan Fan
, Jiayi Ma
:
Cycle-Retinex: Unpaired Low-Light Image Enhancement via Retinex-Inline CycleGAN. 1213-1228 - Yuanyuan Shi, Xiaolong Fu, Yunan Li
, Kaibin Miao, Xiangzeng Liu
, Bocheng Zhao, Qiguang Miao
:
A Semi-Supervised Underexposed Image Enhancement Network With Supervised Context Attention and Multi-Exposure Fusion. 1229-1243 - Theyab A. Alotaibi
, Ishtiaq Rasool Khan
, Farid Bourennani:
Quality Assessment of Tone-Mapped Images Using Fundamental Color and Structural Features. 1244-1254 - Bowen Yuan
, Yefei Sheng
, Bing-Kun Bao
, Yi-Ping Phoebe Chen
, Changsheng Xu
:
Semantic Distance Adversarial Learning for Text-to-Image Synthesis. 1255-1266 - Weitao Feng
, Lei Bai
, Yongqiang Yao
, Weihao Gan, Wei Wu
, Wanli Ouyang
:
Similarity- and Quality-Guided Relation Learning for Joint Detection and Tracking. 1267-1280 - Inske Groenen
, Stevan Rudinac
, Marcel Worring
:
PanorAMS: Automatic Annotation for Detecting Objects in Urban Context. 1281-1294 - Jian Zhu
, Hanli Wang
, Bin He
:
Multi-Modal Structure-Embedding Graph Transformer for Visual Commonsense Reasoning. 1295-1305 - Lei Ma
, Hanyu Hong
, Fanman Meng
, Qingbo Wu
, Jinmeng Wu
:
Deep Progressive Asymmetric Quantization Based on Causal Intervention for Fine-Grained Image Retrieval. 1306-1318 - Jianping Gou
, Nannan Xie, Yunhao Yuan
, Lan Du
, Weihua Ou
, Zhang Yi
:
Reconstructed Graph Constrained Auto-Encoders for Multi-View Representation Learning. 1319-1332 - Shuai Xiao
, Guipeng Lan
, Jiachen Yang
, Wen Lu, Qinggang Meng
, Xinbo Gao
:
MCS-GAN: A Different Understanding for Generalization of Deep Forgery Detection. 1333-1345 - Yanxiong Li
, Wenchang Cao, Wei Xie, Jialong Li, Emmanouil Benetos
:
Few-Shot Class-Incremental Audio Classification Using Dynamically Expanded Classifier With Self-Attention Modified Prototypes. 1346-1360 - Jiamin Zhuang
, Jing Yu
, Yang Ding, Xiangyan Qu
, Yue Hu
:
Towards Fast and Accurate Image-Text Retrieval With Self-Supervised Fine-Grained Alignment. 1361-1372 - Quan Wang, Sheng Li
, Zichi Wang
, Xinpeng Zhang
, Guorui Feng
:
Multi-Source Style Transfer via Style Disentanglement Network. 1373-1383 - Yan Dai
, Xiaojia Chen
, Xuanhan Wang
, Minghui Pang
, Lianli Gao
, Heng Tao Shen
:
ReSParser: Fully Convolutional Multiple Human Parsing With Representative Sets. 1384-1394 - Tianli Sun
, Haonan Chen, Guosheng Hu
, Lianghua He
, Cairong Zhao
:
Explainability of Speech Recognition Transformers via Gradient-Based Attention Visualization. 1395-1406 - Zhongyu Bai
, Hongli Xu
, Xiangyue Zhang
, Qichuan Ding
:
GCSANet: Arbitrary Style Transfer With Global Context Self-Attentional Network. 1407-1420 - Ruixuan Cong
, Hao Sheng
, Da Yang
, Zhenglong Cui
, Rongshan Chen
:
Exploiting Spatial and Angular Correlations With Deep Efficient Transformers for Light Field Image Super-Resolution. 1421-1435 - Lei Jin
, Xiaojuan Wang
, Xuecheng Nie
, Wendong Wang
, Yandong Guo, Shuicheng Yan
, Jian Zhao
:
Rethinking the Person Localization for Single-Stage Multi-Person Pose Estimation. 1436-1447 - Lvlong Lai
, Jian Chen
, Qingyao Wu
:
Zero-Shot Single-View Point Cloud Reconstruction via Cross-Category Knowledge Transferring. 1448-1459 - Liyun Zuo
, Baoyan Wang, Lei Zhang
, Jun Xu
, Xiantong Zhen
:
Variational Neuron Shifting for Few-Shot Image Classification Across Domains. 1460-1473 - Qing Yu
, Go Irie
, Kiyoharu Aizawa
:
Self-Labeling Framework for Open-Set Domain Adaptation With Few Labeled Samples. 1474-1487 - Xiaotian Wu
, Xinjie Feng:
Size Invariant Visual Cryptography Schemes With Evolving Threshold Access Structures. 1488-1503 - Bo Jiang
, Shuxian Luo, Xiao Wang
, Chuanfu Li, Jin Tang
:
AMatFormer: Efficient Feature Matching via Anchor Matching Transformer. 1504-1515 - Shicai Wei
, Chunbo Luo
, Yang Luo
, Jialang Xu
:
Privileged Modality Learning via Multimodal Hallucination. 1516-1527 - Yuanjiang Cao
, Lina Yao
, Le Pan, Quan Z. Sheng
, Xiaojun Chang
:
Guided Image-to-Image Translation by Discriminator-Generator Communication. 1528-1538 - Yongle Zhang
, Yimin Liu
, Ruotong Hu
, Qiang Wu
, Jian Zhang
:
Mutual Dual-Task Generator With Adaptive Attention Fusion for Image Inpainting. 1539-1550 - Lei Zhang
, Leiting Chen, Chuan Zhou
, Xin Li
, Fan Yang
, Zhang Yi
:
Weighted Graph-Structured Semantics Constraint Network for Cross-Modal Retrieval. 1551-1564 - Hongchao Li
, Aihua Zheng
, Liping Sun
, Yonglong Luo
:
Camera Topology Graph Guided Vehicle Re-Identification. 1565-1577 - Xixi Wang
, Bo Jiang
, Xiao Wang
, Jinhui Tang
, Bin Luo
:
Rethinking Batch Sample Relationships for Data Representation: A Batch-Graph Transformer Based Approach. 1578-1588 - Haoran Qi
, Yuwei Qiu
, Xing Luo, Zhi Jin
:
An Efficient Latent Style Guided Transformer-CNN Framework for Face Super-Resolution. 1589-1599 - Wentao Tan, Changxing Ding
, Pengfei Wang
, Mingming Gong
, Kui Jia
:
Style Interleaved Learning for Generalizable Person Re-Identification. 1600-1612 - Yong Zhang
, Yingwei Pan
, Ting Yao
, Rui Huang
, Tao Mei
, Chang Wen Chen
:
End-to-End Video Scene Graph Generation With Temporal Propagation Transformer. 1613-1625 - Yue Wu
, Jiaming Liu
, Maoguo Gong
, Peiran Gong
, Xiaolong Fan
, A. Kai Qin
, Qiguang Miao
, Wenping Ma
:
Self-Supervised Intra-Modal and Cross-Modal Contrastive Learning for Point Cloud Understanding. 1626-1638 - An-An Liu
, Chenxi Huang, Ning Xu
, Hongshuo Tian
, Jing Liu
, Yongdong Zhang
:
Counterfactual Visual Dialog: Robust Commonsense Knowledge Learning From Unbiased Training. 1639-1651 - Shumin Zhu
, Xingxing Zou
, Jianjun Qian
, Wai Keung Wong
:
Learning Structured Relation Embeddings for Fine-Grained Fashion Attribute Recognition. 1652-1664 - Sheng Yu
, Di-Hua Zhai
, Yuanqing Xia
, Dong Li
, Shiqi Zhao
:
CatTrack: Single-Stage Category-Level 6D Object Pose Tracking via Convolution and Vision Transformer. 1665-1680 - Jiangli Shi, Feng Shao
, Chongzhen Tian, Hangwei Chen
, Long Xu
, Yo-Sung Ho
:
Progressive Bidirectional Feature Extraction and Enhancement Network for Quality Evaluation of Night-Time Images. 1690-1705 - Yaosi Hu
, Chong Luo
, Zhenzhong Chen
:
A Benchmark for Controllable Text -Image-to-Video Generation. 1706-1719 - Huanlong Zhang
, Jingchao Wang
, Jianwei Zhang
, Tianzhu Zhang
, Bineng Zhong
:
One-Stream Vision-Language Memory Network for Object Tracking. 1720-1730 - Shengping Zhang
, Xiaoyu Han
, Weigang Zhang
, Xiangyuan Lan, Hongxun Yao
, Qingming Huang
:
Limb-Aware Virtual Try-On Network With Progressive Clothing Warping. 1731-1746 - Shi-Xue Zhang
, Chun Yang, Xiaobin Zhu
, Xu-Cheng Yin
:
Arbitrary Shape Text Detection via Boundary Transformer. 1747-1760 - Xudong Tan
, Menghan Hu
, Guangtao Zhai
, Yan Zhu, Wenfang Li, Xiao-Ping Zhang
:
Lightweight Video-Based Respiration Rate Detection Algorithm: An Application Case on Intensive Care. 1761-1775 - Zhenxi Zhao
, Xinting Yang
, Jintao Liu
, Chao Zhou
, Chunjiang Zhao
:
GCVC: Graph Convolution Vector Distribution Calibration for Fish Group Activity Recognition. 1776-1789 - Sen Wu, Guoshuai Zhao
, Xueming Qian
:
Resolving Zero-Shot and Fact-Based Visual Question Answering via Enhanced Fact Retrieval. 1790-1800 - Pengcheng Lei
, Faming Fang
, Tieyong Zeng
, Guixu Zhang
:
Flow Guidance Deformable Compensation Network for Video Frame Interpolation. 1801-1812 - Haoyang Zhang
, Guixi Liu
, Yi Zhang
, Zhaohui Hao:
Robust Multi-Model Visual Tracking With Distractor-Aware Template-Coupled Correlation Filters Joint Learning. 1813-1828 - Bo Li
, Xiao Lin
, Bin Liu
, Zhi-Fen He
, Yu-Kun Lai
:
Lightweight Text-Driven Image Editing With Disentangled Content and Attributes. 1829-1841 - Chi Chen
, Ang Jin, Zhiye Wang, Yongwei Zheng, Bisheng Yang
, Jian Zhou
, Yuhang Xu, Zhigang Tu
:
SGSR-Net: Structure Semantics Guided LiDAR Super-Resolution Network for Indoor LiDAR SLAM. 1842-1854 - Huairui Wang
, Zhenzhong Chen
, Chang Wen Chen
:
Learned Video Compression via Heterogeneous Deformable Compensation Network. 1855-1866 - Wenhui Li
, Song Yang
, Qiang Li
, Xuanya Li
, An-An Liu
:
Commonsense-Guided Semantic and Relational Consistencies for Image-Text Retrieval. 1867-1880 - Qibing Qin
, Kezhen Xie, Wenfeng Zhang
, Chengduan Wang
, Lei Huang
:
Deep Neighborhood Structure-Preserving Hashing for Large-Scale Image Retrieval. 1881-1893 - Yutong Luo
, Xinyue Zhong, Minchen Zeng, Jialan Xie
, Shiyuan Wang, Guangyuan Liu
:
CGLF-Net: Image Emotion Recognition Network by Combining Global Self-Attention Features and Local Multiscale Features. 1894-1908 - Yanhua Yang, Rui Pan, Xiangyu Li, Xu Yang
, Cheng Deng
:
Dual-Stream Contrastive Learning for Compositional Zero-Shot Recognition. 1909-1919 - Yu Jiang
, Yuehang Wang
, Siqi Li
, Yongji Zhang
, Minghao Zhao
, Yue Gao
:
Event-Based Low-Illumination Image Enhancement. 1920-1931 - Zhuoran Du
, Shikui Wei
, Ting Liu
, Shunli Zhang, Xiaotong Chen
, Shiyin Zhang, Yao Zhao
:
Exploring the Applicability of Spectral Recovery in Semantic Segmentation of RGB Images. 1932-1943 - Zhichao Yang
, Leida Li
, Yuzhe Yang
, Yaqian Li
, Weisi Lin
:
Multi-Level Transitional Contrast Learning for Personalized Image Aesthetics Assessment. 1944-1956 - Tingting Wu
, Xiao Ding
, Hao Zhang
, Jinglong Gao
, Minji Tang, Li Du, Bing Qin
, Ting Liu:
DiscrimLoss: A Universal Loss for Hard Samples and Incorrect Samples Discrimination. 1957-1968 - Yueming Lyu
, Peibin Chen, Jingna Sun, Bo Peng
, Xu Wang
, Jing Dong
:
DRAN: Detailed Region-Adaptive Normalization for Conditional Image Synthesis. 1969-1982 - Jin Huang, Yongshun Gong
, Lu Zhang
, Jian Zhang
, Liqiang Nie
, Yilong Yin
:
Modeling Multiple Aesthetic Views for Series Photo Selection. 1983-1995 - Jiangfeng Du
, Silin Zhou, Jie Yu, Peng Han
, Shuo Shang
:
Cross-Task Multimodal Reinforcement for Long Tail Next POI Recommendation. 1996-2005 - Yuanhui Wang
, Ben Ye
, Zhanchuan Cai
:
Dynamic Template Updating Using Spatial-Temporal Information in Siamese Trackers. 2006-2015 - Jianhong Pan
, Siyuan Yang
, Lin Geng Foo
, Qiuhong Ke
, Hossein Rahmani
, Zhipeng Fan, Jun Liu
:
Progressive Channel-Shrinking Network. 2016-2026 - Donghua Chen
, Runtong Zhang
:
Building Multimodal Knowledge Bases With Multimodal Computational Sequences and Generative Adversarial Networks. 2027-2040 - Xingzheng Wang
, Kaiqiang Chen
, Zixuan Wang
, Wenhao Huang
:
PMSNet: Parallel Multi-Scale Network for Accurate Low-Light Light-Field Image Enhancement. 2041-2055 - Yinghui Xing
, Qirui Wu
, De Cheng
, Shizhou Zhang
, Guoqiang Liang
, Peng Wang
, Yanning Zhang
:
Dual Modality Prompt Tuning for Vision-Language Pre-Trained Model. 2056-2068 - Bin Wan
, Xiaofei Zhou
, Yaoqi Sun
, Tingyu Wang, Chengtao Lv
, Shuai Wang
, Haibing Yin
, Chenggang Yan
:
MFFNet: Multi-Modal Feature Fusion Network for V-D-T Salient Object Detection. 2069-2081 - Jie Xu
, Xiaoqian Zhang
, Changming Zhao
, Zili Geng
, Yuren Feng
, Ke Miao
, Yunji Li
:
Improving Fine-Grained Image Classification With Multimodal Information. 2082-2095 - Yanliang Jin
, Ze-Yu Ji
, Dan Zeng
, Xiao-Ping (Steven) Zhang
:
VWP:An Efficient DRL-Based Autonomous Driving Model. 2096-2108 - Qiang Qi
, Yan Yan
, Hanzi Wang
:
Class-Aware Dual-Supervised Aggregation Network for Video Object Detection. 2109-2123 - Peihao Wu, Wenqian Wang
, Faliang Chang
, Chunsheng Liu
, Bin Wang
:
DSS-Net: Dynamic Self-Supervised Network for Video Anomaly Detection. 2124-2136 - Zhiquan Wen
, Shuaicheng Niu, Ge Li
, Qingyao Wu
, Mingkui Tan
, Qi Wu
:
Test-Time Model Adaptation for Visual Question Answering With Debiased Self-Supervisions. 2137-2147 - Dongjie Ye
, Zhangkai Ni
, Wenhan Yang, Hanli Wang
, Shiqi Wang
, Sam Kwong
:
Glow in the Dark: Low-Light Image Enhancement With External Memory. 2148-2163 - Chuan Qin
, Xiaomeng Li, Zhenyi Zhang, Fengyong Li
, Xinpeng Zhang
, Guorui Feng
:
Print-Camera Resistant Image Watermarking With Deep Noise Simulation and Constrained Learning. 2164-2177 - Ying Zeng, Sijie Mai
, Wenjun Yan
, Haifeng Hu
:
Multimodal Reaction: Information Modulation for Cross-Modal Representation Learning. 2178-2191 - Dongdong Ni
, Zhenhong Jia
, Jie Yang
, Nikola K. Kasabov
:
Online Low-Light Sand-Dust Video Enhancement Using Adaptive Dynamic Brightness Correction and a Rolling Guidance Filter. 2192-2206 - Hangzhi Jiang
, Xin Zhang
, Shiming Xiang
:
Non-Maximum Suppression Guided Label Assignment for Object Detection in Crowd Scenes. 2207-2218 - Weizhi Xian
, Mingliang Zhou
, Bin Fang
, Tao Xiang
, Weijia Jia
, Bin Chen
:
Perceptual Quality Analysis in Deep Domains Using Structure Separation and High-Order Moments. 2219-2234 - Ziyu Chen
, Hanli Wang
, Chang Wen Chen
:
Self-Supervised Video Representation Learning by Serial Restoration With Elastic Complexity. 2235-2248 - Fuming Sun
, Peng Ren
, Bowen Yin
, Fasheng Wang
, Haojie Li
:
CATNet: A Cascaded and Aggregated Transformer Network for RGB-D Salient Object Detection. 2249-2262 - Jianbing Wu
, Hong Liu
, Wei Shi, Mengyuan Liu
, Wenhao Li
:
Style-Agnostic Representation Learning for Visible-Infrared Person Re-Identification. 2263-2275 - Zhentao He, Feng Shao
, Gang Chen, Xiongli Chai
, Yo-Sung Ho
:
SCFANet: Semantics and Context Feature Aggregation Network for 360° Salient Object Detection. 2276-2288 - Yu Sun
, Lubing Xu, Qian Bao
, Wu Liu
, Wenpeng Gao
, Yili Fu
:
Learning Monocular Regression of 3D People in Crowds via Scene-Aware Blending and De-Occlusion. 2289-2302 - Zhong Zhang
, Di He
, Shuang Liu
, Baihua Xiao
, Tariq S. Durrani
:
Completed Part Transformer for Person Re-Identification. 2303-2313 - Yuanzhi Wang
, Tao Lu
, Yuan Yao, Yanduo Zhang
, Zixiang Xiong
:
Learning to Hallucinate Face in the Dark. 2314-2326 - Rui Shi
, Tianxing Li
, Liguo Zhang
, Yasushi Yamaguchi
:
Visualization Comparison of Vision Transformers and Convolutional Neural Networks. 2327-2339 - Yepeng Tang
, Weining Wang
, Chunjie Zhang
, Jing Liu
, Yao Zhao
:
Temporal Action Proposal Generation With Action Frequency Adaptive Network. 2340-2353 - Sitong Su
, Junchen Zhu
, Lianli Gao
, Jingkuan Song
:
Utilizing Greedy Nature for Multimodal Conditional Image Synthesis in Transformers. 2354-2366 - Shuaiqi Jing
, Haonan Zhang
, Pengpeng Zeng
, Lianli Gao
, Jingkuan Song
, Heng Tao Shen
:
Memory-Based Augmentation Network for Video Captioning. 2367-2379 - Xiaobin Tan
, Simin Li, Shunyi Wang
, Yangyang Liu, Quan Zheng
, Jian Yang
:
Cooperative Bargaining Game Based Adaptive Video Multicast Over Mobile Edge Networks. 2380-2394 - Gangyang Hou
, Bo Ou
, Min Long
, Fei Peng
:
Separable Reversible Data Hiding for Encrypted 3D Mesh Models Based on Octree Subdivision and Multi-MSB Prediction. 2395-2407 - Mengya Han
, Yibing Zhan
, Yong Luo
, Han Hu
, Kehua Su
, Bo Du
:
Textual Enhanced Adaptive Meta-Fusion for Few-Shot Visual Recognition. 2408-2418 - Hao Tang
, Guoshuai Zhao
, Jing Gao
, Xueming Qian
:
Personalized Representation With Contrastive Loss for Recommendation Systems. 2419-2429 - Linfeng Xu
, Qingbo Wu
, Lili Pan
, Fanman Meng
, Hongliang Li
, Chiyuan He
, Hanxin Wang
, Shaoxu Cheng
, Yu Dai
:
Towards Continual Egocentric Activity Recognition: A Multi-Modal Egocentric Activity Dataset for Continual Learning. 2430-2443 - Jing Liu
, Zhiwei Fan
, Ziwen Yang
, Yuting Su, Xiaokang Yang
:
Multi-Stage Spatio-Temporal Fusion Network for Fast and Accurate Video Bit-Depth Enhancement. 2444-2455 - Yu Lu
, Feiyue Ni
, Haofan Wang
, Xiaofeng Guo
, Linchao Zhu
, Zongxin Yang
, Ruihua Song
, Lele Cheng
, Yi Yang:
Show Me a Video: A Large-Scale Narrated Video Dataset for Coherent Story Illustration. 2456-2466 - Wei Zhang, KangBin Zhou, Luyao Teng
, Feiyi Tang, NaiQi Wu
, Shaohua Teng
, Jian Li
:
Dynamic Confidence Sampling and Label Semantic Guidance Learning for Domain Adaptive Retrieval. 2467-2479 - Jingcheng Ke
, Jia Wang, Jun-Cheng Chen
, I-Hong Jhuo, Chia-Wen Lin
, Yen-Yu Lin
:
CLIPREC: Graph-Based Domain Adaptive Network for Zero-Shot Referring Expression Comprehension. 2480-2492 - Xi Yang
, Xiaoqi Wang
, Dong Yang
:
Improving Cross-Modal Constraints: Text Attribute Person Search With Graph Attention Networks. 2493-2503 - Jinghan Ru
, Jun Tian
, Chengwei Xiao
, Jingjing Li
, Heng Tao Shen
:
Imbalanced Open Set Domain Adaptation via Moving-Threshold Estimation and Gradual Alignment. 2504-2514 - Chen Hui
, Shengping Zhang
, Wenxue Cui
, Shaohui Liu
, Feng Jiang
, Debin Zhao
:
Rate-Adaptive Neural Network for Image Compressive Sensing. 2515-2530 - Yang Liu
, Xingming Zhang
, Janne Kauttonen
, Guoying Zhao
:
Uncertain Facial Expression Recognition via Multi-Task Assisted Correction. 2531-2543 - Yang Liu
, Yong Xu
, Peipei Wu
, Wenwu Wang
:
Labelled Non-Zero Diffusion Particle Flow SMC-PHD Filtering for Multi-Speaker Tracking. 2544-2559 - Yutao Liu
, Ke Gu
, Jingchao Cao
, Shiqi Wang
, Guangtao Zhai
, Junyu Dong
, Sam Kwong
:
UIQI: A Comprehensive Quality Evaluation Index for Underwater Images. 2560-2573 - Xinchen Ye
, Yanjun Guo
, Baoli Sun
, Rui Xu
, Zhihui Wang
, Haojie Li
:
C2ANet: Cross-Scale and Cross-Modality Aggregation Network for Scene Depth Super-Resolution. 2574-2584 - Fugui Fan
, Yuting Su, Liqiang Nie
, Peiguang Jing
, Daozheng Hong
, Yu Liu
:
Dual-Domain Aligned Deep Hierarchical Matrix Factorization Method for Micro-Video Multi-Label Classification. 2598-2607 - Jingang Shi
, Yusi Wang
, Zitong Yu
, Guanxin Li
, Xiaopeng Hong
, Fei Wang
, Yihong Gong
:
Exploiting Multi-Scale Parallel Self-Attention and Local Variation via Dual-Branch Transformer-CNN Structure for Face Super-Resolution. 2608-2620 - Fan Zhang
, Na Liu
, Fuqing Duan
:
Coarse-to-Fine Depth Super-Resolution With Adaptive RGB-D Feature Attention. 2621-2633 - Tongyao Jia
, Jiafeng Li
, Li Zhuo
, Tianjian Yu
:
Semi-Supervised Single-Image Dehazing Network via Disentangled Meta-Knowledge. 2634-2647 - Fen Xiao
, Zhengdong Pu
, Jiaqi Chen
, Xieping Gao
:
DGFNet: Depth-Guided Cross-Modality Fusion Network for RGB-D Salient Object Detection. 2648-2658 - Wentian Zhao
, Xinxiao Wu
:
Boosting Entity-Aware Image Captioning With Multi-Modal Knowledge Graph. 2659-2670 - Shaolin Su
, Hanhe Lin
, Vlad Hosu
, Oliver Wiedemann, Jinqiu Sun, Yu Zhu
, Hantao Liu
, Yanning Zhang
, Dietmar Saupe
:
Going the Extra Mile in Face Image Quality Assessment: A Novel Database and Model. 2671-2685 - Jian Wang
, Fan Li
, Xuchong Zhang
, Hongbin Sun
:
Adversarial Obstacle Generation Against LiDAR-Based 3D Object Detection. 2686-2699 - Zefeng Lu
, Ronghao Lin
, Haifeng Hu
:
Tri-Level Modality-Information Disentanglement for Visible-Infrared Person Re-Identification. 2700-2714 - Bo Hu
, Guang Zhu
, Leida Li
, Ji Gan
, Weisheng Li
, Xinbo Gao
:
Blind Image Quality Index With Cross-Domain Interaction and Cross-Scale Integration. 2729-2739 - Ronghao Lin
, Haifeng Hu
:
Dynamically Shifting Multimodal Representations via Hybrid-Modal Attention for Multimodal Sentiment Analysis. 2740-2755 - Zixi Wang, Fan Li, Yunfei Zhang, Yuan Zhang:
Low-Rate Feature Compression for Collaborative Intelligence: Reducing Redundancy in Spatial and Statistical Levels. 2756-2771 - Yanbiao Ma
, Licheng Jiao
, Fang Liu
, Shuyuan Yang
, Xu Liu
, Puhua Chen
:
Feature Distribution Representation Learning Based on Knowledge Transfer for Long-Tailed Classification. 2772-2784 - Mijanur Rahaman Palash, Bharat K. Bhargava
:
EMERSK -Explainable Multimodal Emotion Recognition With Situational Knowledge. 2785-2794 - Pei An
, Yucong Duan, Yuliang Huang, Jie Ma
, Yanfei Chen, Liheng Wang, You Yang
, Qiong Liu
:
SP-Det: Leveraging Saliency Prediction for Voxel-Based 3D Object Detection in Sparse Point Cloud. 2795-2808 - Xue Li
, Jiong Yu
, Shaochen Jiang
, Hongchun Lu
, Ziyang Li
:
MSViT: Training Multiscale Vision Transformers for Image Retrieval. 2809-2823 - Mengkun Liu
, Licheng Jiao
, Xu Liu
, Lingling Li
, Fang Liu
, Shuyuan Yang
, Xiangrong Zhang
:
Bio-Inspired Multi-Scale Contourlet Attention Networks. 2824-2837 - Ping Xu
, Lei Liu
, Haifeng Zheng
, Xin Yuan
, Chen Xu
, Lingyun Xue
:
Degradation-Aware Dynamic Fourier-Based Network for Spectral Compressive Imaging. 2838-2850 - Mingyi Yang
, Junyan Huo
, Xile Zhou
, Wenhan Qiao
, Shuai Wan
, Hao Wang
, Fuzheng Yang
:
Joint Rate-Distortion Optimization for Video Coding and Learning-Based In-Loop Filtering. 2851-2865 - Daizong Liu
, Wei Hu
, Xin Li
:
Robust Geometry-Dependent Attack for 3D Point Clouds. 2866-2877 - Li Wang, Tao Xie
, Xinyu Zhang
, Zhiqiang Jiang
, Linqi Yang
, Haoming Zhang
, Xiaoyu Li
, Yilong Ren
, Haiyang Yu
, Jun Li
, Huaping Liu
:
Auto-Points: Automatic Learning for Point Cloud Analysis With Neural Architecture Search. 2878-2893 - Decheng Liu
, Zeyang Zheng
, Chunlei Peng
, Yukai Wang
, Nannan Wang
, Xinbo Gao
:
Hierarchical Forgery Classifier on Multi-Modality Face Forgery Clues. 2894-2905 - Yunpeng Xiao
, Xuehong Li
, Qunqing Zhang
, Rui Lv
, Qian Li
, Rong Wang
:
Spreading Mosaic: An Image Restoration-Inspired Social Rumor Propagation Model. 2906-2917 - Honghao Dai
, Shanshan Gao
, Hong Huang
, Deqian Mao
, Chenhao Zhang
, Yuanfeng Zhou
:
An Adaptive Sample Assignment Network for Tiny Object Detection. 2918-2931 - Yan Zhang
, Yuning Su
, Xiaoying Sun
:
A QoE Physiological Measure of VR With Vibrotactile Feedback Based on Frontal Lobe Power Asymmetry. 2932-2942 - Shenjian Gong
, Jian Yang
, Shanshan Zhang
:
Adaptive Teaching for Cross-Domain Crowd Counting. 2943-2952 - Xiaoyu Kong
, Yongyong Chen
, Zhenyu He
:
When Channel Correlation Meets Sparse Prior: Keeping Interpretability in Image Compressive Sensing. 2953-2965 - Haixin Ding
, Shengchuan Zhang, Qiong Wu, Songlin Yu, Jie Hu, Liujuan Cao
, Rongrong Ji
:
Bilateral Knowledge Interaction Network for Referring Image Segmentation. 2966-2977 - Kaile Du
, Fan Lyu
, Linyan Li, Fuyuan Hu
, Wei Feng
, Fenglei Xu
, Xuefeng Xi
, Hanjing Cheng
:
Multi-Label Continual Learning Using Augmented Graph Convolutional Network. 2978-2992 - Kejun Wu
, You Yang
, Qiong Liu
, Gangyi Jiang
, Xiao-Ping Zhang
:
Hierarchical Independent Coding Scheme for Varifocal Multiview Images Based on Angular-Focal Joint Prediction. 2993-3006 - Kaijie Zhao
, Haitao Zhao
, Zhongze Wang
, Jingchao Peng
, Zhengwei Hu
:
Object-Preserving Siamese Network for Single-Object Tracking on Point Clouds. 3007-3017 - Sijie Mai
, Ya Sun
, Aolin Xiong
, Ying Zeng
, Haifeng Hu
:
Multimodal Boosting: Addressing Noisy Modalities and Identifying Modality Contribution. 3018-3033 - Yongchao Du
, Min Wang
, Wengang Zhou
, Houqiang Li
:
Progressive Similarity Preservation Learning for Deep Scalable Product Quantization. 3034-3045 - Zhiqiang Bao
, Zihao Chen
, Chang-Dong Wang
, Wei-Shi Zheng
, Zhenhua Huang
, Yunwen Chen
:
Post-Distillation via Neural Resuscitation. 3046-3060 - Qin Yang
, Yuqi Li
, Chenglin Li
, Hao Wang
, Sa Yan
, Li Wei
, Wenrui Dai
, Junni Zou
, Hongkai Xiong
, Pascal Frossard
:
SVGC-AVA: 360-Degree Video Saliency Prediction With Spherical Vector-Based Graph Convolution and Audio-Visual Attention. 3061-3076 - Xin Ma
, Chang Liu
, Chunyu Xie
, Long Ye
, Yafeng Deng
, Xiangyang Ji
:
Disjoint Masking With Joint Distillation for Efficient Masked Image Modeling. 3077-3087 - Tianhao Qi
, Hongtao Xie
, Pandeng Li
, Jiannan Ge
, Yongdong Zhang
:
Balanced Classification: A Unified Framework for Long-Tailed Object Detection. 3088-3101 - Zhen Long
, Ce Zhu
, Jie Chen
, Zihan Li
, Yazhou Ren
, Yipeng Liu
:
Multi-View MERA Subspace Clustering. 3102-3112 - Chuanming Wang
, Huiyuan Fu
, Huadong Ma
:
Learning Mutually Exclusive Part Representations for Fine-Grained Image Classification. 3113-3124 - Shuman Fang
, Zhiwen Lin
, Ke Yan
, Jie Li
, Xianming Lin
, Rongrong Ji
:
HODN: Disentangling Human-Object Feature for HOI Detection. 3125-3136 - Fengyong Li
, Yang Sheng
, Xinpeng Zhang
, Chuan Qin
:
iSCMIS:Spatial-Channel Attention Based Deep Invertible Network for Multi-Image Steganography. 3137-3152 - Di Li
, Susanto Rahardja
:
Learning Deep Representations for Photo Retouching. 3153-3163 - Yulai Xie
, Jingjing Niu
, Yang Zhang
, Fang Ren
:
Global-Shared Text Representation Based Multi-Stage Fusion Transformer Network for Multi-Modal Dense Video Captioning. 3164-3179 - Yan Dai
, Beitao Chen
, Lianli Gao
, Jingkuan Song
, Heng Tao Shen
:
DMH-CL: Dynamic Model Hardness Based Curriculum Learning for Complex Pose Estimation. 3180-3193 - Ke Nai
, Shaomiao Chen
:
Learning a Novel Ensemble Tracker for Robust Visual Tracking. 3194-3206 - Mingdao Wang
, Xueming Li, Siqi Chen
, Xianlin Zhang
, Lei Ma, Yue Zhang
:
Learning Representations by Contrastive Spatio-Temporal Clustering for Skeleton-Based Action Recognition. 3207-3220 - Shuai Shen
, Wanhua Li
, Xiaoke Huang
, Zheng Zhu
, Jie Zhou
, Jiwen Lu
:
SD-NeRF: Towards Lifelike Talking Head Animation via Spatially-Adaptive Dual-Driven NeRFs. 3221-3234 - Shiwei Wang
, Liquan Shen
, Jingyue Liu
:
Spatial-Temporal Inter-Layer Reference Frame Generation Network for Spatial SHVC. 3235-3250 - Guosong Zhu
, Zhen Qin
, Yi Ding
, Yao Liu
, Zhiguang Qin
:
MFNet:Real-Time Motion Focus Network for Video Frame Interpolation. 3251-3262 - Xiang Fang
, Daizong Liu
, Pan Zhou
, Zichuan Xu
, Ruixuan Li
:
Hierarchical Local-Global Transformer for Temporal Sentence Grounding. 3263-3277 - Huiwen Ren
, Shanshe Wang
, Siwei Ma
, Wen Gao:
SVT-AVS3: An Open-Source High-Performance AVS3 Encoder With Scalable Video Technology. 3291-3301 - Ke Xian
, Juewen Peng
, Zhiguo Cao
, Jianming Zhang
, Guosheng Lin
:
ViTA: Video Transformer Adaptor for Robust Video Depth Estimation. 3302-3316 - Lei Wei
, Shuai Wan
, Zhecheng Wang
, Fuzheng Yang
:
Near-Lossless Compression of Point Cloud Attribute Using Quantization Parameter Cascading and Rate-Distortion Optimization. 3317-3330 - Xiaofeng Yang
, Fayao Liu
, Guosheng Lin
:
Neural Logic Vision Language Explainer. 3331-3340 - Shuwei Shao
, Zhongcai Pei
, Weihai Chen
, Ran Li
, Zhong Liu
, Zhengguo Li
:
URCDC-Depth: Uncertainty Rectified Cross-Distillation With CutFlip for Monocular Depth Estimation. 3341-3353 - Yu Zhou
, Weikang Gong
, Yanjing Sun
, Leida Li
, Ke Gu
, Jinjian Wu
:
Quality Assessment for Stitched Panoramic Images via Patch Registration and Bidimensional Feature Aggregation. 3354-3365 - Yawen Zeng
, Ning Han
, Keyu Pan
, Qin Jin
:
Temporally Language Grounding With Multi-Modal Multi-Prompt Tuning. 3366-3377 - Mingzheng Feng
, Jianbo Su
:
Learning Multi-Layer Attention Aggregation Siamese Network for Robust RGBT Tracking. 3378-3391 - Zhengyun Lu
, Lu Jin
, Zechao Li
, Jinhui Tang
:
Self-Paced Relational Contrastive Hashing for Large-Scale Image Retrieval. 3392-3404 - Yang Yu
, Rongrong Ni
, Siyuan Yang
, Yao Zhao
, Alex C. Kot
:
Narrowing Domain Gaps With Bridging Samples for Generalized Face Forgery Detection. 3405-3417 - Ping Li
, Chenhan Zhang, Xianghua Xu
:
Fast Fourier Inception Networks for Occluded Video Prediction. 3418-3429 - Baoliang Chen
, Lingyu Zhu
, Hanwei Zhu
, Wenhan Yang
, Linqi Song
, Shiqi Wang
:
Gap-Closing Matters: Perceptual Quality Evaluation and Optimization of Low-Light Image Enhancement. 3430-3443 - Junlong Gao
, Jiguo Li
, Chuanmin Jia
, Shanshe Wang
, Siwei Ma
, Wen Gao:
Cross Modal Compression With Variable Rate Prompt. 3444-3456 - Zhiwei Zhao
, Bin Liu
, Yan Lu
, Qi Chu
, Nenghai Yu
, Chang Wen Chen
:
Joint Identity-Aware Mixstyle and Graph-Enhanced Prototype for Clothes-Changing Person Re-Identification. 3457-3468 - Fang Peng
, Xiaoshan Yang
, Linhui Xiao
, Yaowei Wang
, Changsheng Xu
:
SgVA-CLIP: Semantic-Guided Visual Adapting of Vision-Language Models for Few-Shot Image Classification. 3469-3480 - Haojie Ding
, Bin Wang
, Guoliang Kang
, Weijia Li
, Conghui He
, Yao Zhao
, Yunchao Wei
:
DropQueries: A Simple Way to Discover Comprehensive Segment Representations. 3481-3490 - Maregu Assefa
, Wei Jiang
, Jinyu Zhan
, Kumie Gedamu
, Getinet Yilma
, Melese Ayalew
, Deepak Adhikari
:
Audio-Visual Contrastive and Consistency Learning for Semi-Supervised Action Recognition. 3491-3504 - Yue Wu
, Jiaming Liu
, Maoguo Gong
, Zhixiao Liu
, Qiguang Miao
, Wenping Ma
:
MPCT: Multiscale Point Cloud Transformer With a Residual Network. 3505-3516 - Xiaogang Song
, Haoyue Hu, Li Liang
, Weiwei Shi, Guo Xie
, Xiaofeng Lu, Xinhong Hei
:
Unsupervised Monocular Estimation of Depth and Visual Odometry Using Attention and Depth-Pose Consistency Loss. 3517-3529 - Aoqi Li
, Saihui Hou
, Qingyuan Cai
, Yang Fu
, Yongzhen Huang
:
Gait Recognition With Drones: A Benchmark. 3530-3540 - Yanan Chen
, Ang Li
, Dan Wu
, Liang Zhou
:
Toward General Cross-Modal Signal Reconstruction for Robotic Teleoperation. 3541-3553 - Zining Chen
, Weiqiu Wang
, Zhicheng Zhao
, Fei Su
, Aidong Men, Yuan Dong
:
Cluster-Instance Normalization: A Statistical Relation-Aware Normalization for Generalizable Person Re-Identification. 3554-3566 - Tao Chen
, Yanrong Guo
, Shijie Hao
, Richang Hong
:
Semi-Supervised Domain Adaptation for Major Depressive Disorder Detection. 3567-3579 - Xiaoying Ding
, Zhao Chen
, Weisi Lin
, Zhenzhong Chen
:
Towards 3D Colored Mesh Saliency: Database and Benchmarks. 3580-3591 - Jiachen Yang
, Chen Cheng
, Shuai Xiao
, Guipeng Lan
, Jiabao Wen
:
High Fidelity Face-Swapping With Style ConvTransformer and Latent Space Selection. 3604-3615 - Yuan Gao
, Xin Li
, Hui Yan
:
Rethinking Graph Contrastive Learning: An Efficient Single-View Approach via Instance Discrimination. 3616-3625 - Siyu Liu
, Jian Cheng
, Ziying Xia
, Zhilong Xi
, Qin Hou
, Zhicheng Dong
:
HCM: Online Action Detection With Hard Video Clip Mining. 3626-3639 - Guipeng Lan
, Shuai Xiao
, Jiachen Yang
, Yanshuang Zhou
, Jiabao Wen
, Wen Lu
, Xinbo Gao
:
Image Aesthetics Assessment Based on Hypernetwork of Emotion Fusion. 3640-3650 - Binglu Wang
, Tianci Bu, Zaiyi Hu
, Le Yang
, Yongqiang Zhao
, Xuelong Li
:
Coarse-to-Fine Nutrition Prediction. 3651-3662 - Wenwu Yang
, Yeqing Zhao, Bailin Yang
, Jianbing Shen
:
Learning 3D Face Reconstruction From the Cycle-Consistency of Dynamic Faces. 3663-3675 - Kai Zhuang
, Qiang Li
, Yuan Yuan, Qi Wang
:
Multi-Domain Adaptation for Motion Deblurring. 3676-3688 - Gen Luo
, Yiyi Zhou
, Jiamu Sun
, Xiaoshuai Sun
, Rongrong Ji
:
A Survivor in the Era of Large-Scale Pretraining: An Empirical Study of One-Stage Referring Expression Comprehension. 3689-3700 - Ye Yao
, Ke Wang
, Qi Chang
, Shaowei Weng
:
Reversible Data Hiding in Encrypted Images Using Global Compression of Zero-Valued High Bit-Planes and Block Rearrangement. 3701-3714 - Cong Yu, Dongheng Zhang, Zhi Wu, Chunyang Xie, Zhi Lu, Yang Hu, Yan Chen:
MobiRFPose: Portable RF-Based 3D Human Pose Camera. 3715-3727 - Shili Zhou
, Weimin Tan
, Bo Yan
:
A Motion Distillation Framework for Video Frame Interpolation. 3728-3740 - Chunlei Peng
, Zimo Kong
, Decheng Liu
, Nannan Wang
, Xinbo Gao
:
Disguised Heterogeneous Face Generation With Iterative-Adversarial Style Unification. 3741-3753 - Zhengzhuo Xu
, Zenghao Chai
, Chengyin Xu
, Chun Yuan
, Haiqin Yang
:
Towards Effective Collaborative Learning in Long-Tailed Recognition. 3754-3764 - Jiaxu Leng
, Yiran Liu
, Xinbo Gao
, Zhihui Wang
:
CRNet: Context-guided Reasoning Network for Detecting Hard Objects. 3765-3777 - Xingxing Wei
, Shiji Zhao
:
Boosting Adversarial Transferability With Learnable Patch-Wise Masks. 3778-3787 - Dongliang Chen
, Guihua Wen
, Pengcheng Wen
, Pei Yang
, Rui Chen
, Cheng Li
:
Cross-Domain Sample Relationship Learning for Facial Expression Recognition. 3788-3798 - Tongbao Chen
, Wenmin Wang
, Zhe Jiang
, Ruochen Li
, Bingshu Wang
:
Cross-Modality Knowledge Calibration Network for Video Corpus Moment Retrieval. 3799-3813 - Haifeng Guo
, Sam Kwong
, Dongjie Ye
, Shiqi Wang
:
Enhanced Context Mining and Filtering for Learned Video Compression. 3814-3826 - Dongqing Wu
, Huihui Li
, Cang Gu
, Hang Liu, Cuili Xu
, Yinxuan Hou
, Lei Guo:
Feature First: Advancing Image-Text Retrieval Through Improved Visual Features. 3827-3841 - Huilin Zhu
, Jingling Yuan
, Xian Zhong
, Liang Liao
, Zheng Wang
:
Find Gold in Sand: Fine-Grained Similarity Mining for Domain-Adaptive Crowd Counting. 3842-3855 - Qiangqiang Shen
, Tingting Xu
, Yongsheng Liang
, Yongyong Chen
, Zhenyu He
:
Robust Tensor Recovery for Incomplete Multi-View Clustering. 3856-3870 - Zhenyu Weng
, Huiping Zhuang
, Fulin Luo
, Haizhou Li
, Zhiping Lin
:
Few-Shot Contrastive Transfer Learning With Pretrained Model for Masked Face Verification. 3871-3883 - Zerun Feng
, Zhimin Zeng
, Caili Guo
, Zheng Li
, Lin Hu
:
Learning From Noisy Correspondence With Tri-Partition for Cross-Modal Matching. 3884-3896 - Jiaming Liu
, Yue Wu
, Maoguo Gong
, Zhixiao Liu
, Qiguang Miao
, Wenping Ma
:
Inter-Modal Masked Autoencoder for Self-Supervised Learning on Point Clouds. 3897-3908 - Guangyong Gao
, Hui Zhang
, Zhihua Xia
, Xiangyang Luo
, Yun-Qing Shi:
Reversible Data Hiding-Based Contrast Enhancement With Multi-Group Stretching for ROI of Medical Image. 3909-3923 - Zhixuan Li
, Weining Ye
, Tingting Jiang
, Tie-Jun Huang
:
GIN: Generative INvariant Shape Prior for Amodal Instance Segmentation. 3924-3936 - Xiaochuan Li
, Baoyu Fan
, Runze Zhang
, Kun Zhao
, Zhenhua Guo
, Yaqian Zhao
, Rengang Li
:
Inexactly Matched Referring Expression Comprehension With Rationale. 3937-3950 - Jiale Cheng
, Dongzi Shi
, Chenyang Li
, Yu Li
, Hao Ni
, Lianwen Jin
, Xin Zhang
:
Skeleton-Based Gesture Recognition With Learnable Paths and Signature Features. 3951-3961 - Shidong Cao
, Wenhao Chai
, Shengyu Hao
, Yanting Zhang
, Hangyue Chen
, Gaoang Wang
:
DiffFashion: Reference-Based Fashion Design With Structure-Aware Transfer by Diffusion Models. 3962-3975 - Junyan Wang
, Yiqi Jiang
, Yang Long
, Xiuyu Sun
, Maurice Pagnucco
, Yang Song
:
Deconfounding Causal Inference for Zero-Shot Action Recognition. 3976-3986 - Duzhen Zhang
, Feilong Chen
, Jianlong Chang
, Xiuyi Chen
, Qi Tian
:
Structure Aware Multi-Graph Network for Multi-Modal Emotion Recognition in Conversations. 3987-3997 - Xu Wang
, Weifeng Kong, Qiudan Zhang
, You Yang
, Tiesong Zhao
, Jianmin Jiang
:
Distortion-Aware Self-Supervised Indoor 360$^{\circ }$ Depth Estimation via Hybrid Projection Fusion and Structural Regularities. 3998-4011 - Wenxue Cui
, Xiaopeng Fan
, Jian Zhang
, Debin Zhao
:
Deep Unfolding Network for Image Compressed Sensing by Content-Adaptive Gradient Updating and Deformation-Invariant Non-Local Modeling. 4012-4027 - Di Wang
, Changning Tian
, Xiao Liang
, Lin Zhao
, Lihuo He
, Quan Wang
:
Dual-Perspective Fusion Network for Aspect-Based Multimodal Sentiment Analysis. 4028-4038 - Yong Wang
, Hongbo Kang
, Doudou Wu
, Wenming Yang
, Longbin Zhang
:
Global and Local Spatio-Temporal Encoder for 3D Human Pose Estimation. 4039-4049 - Yixuan Lyu
, Hong Zhang
, Yan Li
, Hanyang Liu
, Yifan Yang
, Ding Yuan
:
UEDG:Uncertainty-Edge Dual Guided Camouflage Object Detection. 4050-4060 - Shao-Jie Zhang
, Jia-Hui Pan, Jibin Gao
, Wei-Shi Zheng
:
Adaptive Stage-Aware Assessment Skill Transfer for Skill Determination. 4061-4072 - Yan Ju
, Shan Jia
, Jialing Cai
, Haiying Guan
, Siwei Lyu
:
GLFF: Global and Local Feature Fusion for AI-Synthesized Image Detection. 4073-4085 - Yangyang Shu
, Qian Li
, Lingqiao Liu
, Guandong Xu
:
Semi-Supervised Adversarial Learning for Attribute-Aware Photo Aesthetic Assessment. 4086-4096 - Shulan Ruan
, Kun Zhang
, Le Wu
, Tong Xu
, Qi Liu
, Enhong Chen
:
Color Enhanced Cross Correlation Net for Image Sentiment Analysis. 4097-4109 - Harry Cheng
, Yangyang Guo
, Jianhua Yin
, Haonan Chen, Jiafang Wang
, Liqiang Nie
:
Audio-Driven Talking Video Frame Restoration. 4110-4122 - Song Tang
, Yuji Shi
, Zihao Song
, Mao Ye
, Changshui Zhang
, Jianwei Zhang
:
Progressive Source-Aware Transformer for Generalized Source-Free Domain Adaptation. 4138-4152 - Yuwu Lu
, Wai Keung Wong
, Chun Yuan
, Zhihui Lai
, Xuelong Li
:
Low-Rank Correlation Learning for Unsupervised Domain Adaptation. 4153-4167 - Xiaobin Tan
, Shunyi Wang
, Xiang Xu
, Quan Zheng
, Jian Yang
, Shuangwu Chen
:
DACOD360: Deadline-Aware Content Delivery for 360-Degree Video Streaming Over MEC Networks. 4168-4182 - Yunzuo Zhang
, Tian Zhang
, Cunyu Wu
, Ran Tao
:
Multi-Scale Spatiotemporal Feature Fusion Network for Video Saliency Prediction. 4183-4193 - Binwei Xu
, Haoran Liang
, Ronghua Liang
, Peng Chen
:
Synthesize Boundaries: A Boundary-Aware Self-Consistent Framework for Weakly Supervised Salient Object Detection. 4194-4205 - Zhengning Wu
, Tianyu He
, Xiaobo Xia
, Jun Yu
, Xu Shen, Tongliang Liu
:
Conditional Consistency Regularization for Semi-Supervised Multi-Label Image Classification. 4206-4216 - Zhiwei Ding
, Guilin Lan, Yanzhi Song
, Zhouwang Yang
:
SGIR: Star Graph-Based Interaction for Efficient and Robust Multimodal Representation. 4217-4229 - Jun Rao
, Xv Meng
, Liang Ding
, Shuhan Qi
, Xuebo Liu
, Min Zhang
, Dacheng Tao
:
Parameter-Efficient and Student-Friendly Knowledge Distillation. 4230-4241 - Ze Zhou
, Yinghui Sun
, Quansen Sun
, Chaobo Li
, Zhenwen Ren
:
Unit Correlation With Interactive Feature for Robust and Effective Tracking. 4242-4254 - Lin Yang
, Rangding Wang
, Dawen Xu
, Li Dong
, Songhan He
:
Centralized Error Distribution-Preserving Adaptive Steganography for HEVC. 4255-4270 - Liping Bao
, Longhui Wei
, Wengang Zhou
, Lin Liu
, Lingxi Xie
, Houqiang Li
, Qi Tian
:
Multi-Granularity Matching Transformer for Text-Based Person Search. 4281-4293 - Yuxuan Liu
, Hongwei Ge
, Zhen Wang
, Yaqing Hou
, Mingde Zhao
:
Clothes-Changing Person Re-Identification via Universal Framework With Association and Forgetting Learning. 4294-4307 - TianYu Ning
, Bineng Zhong
, Qihua Liang
, Zhenjun Tang
, Xianxian Li
:
Robust Tracking via Bidirectional Transduction With Mask Information. 4308-4319 - Zhejing Hu
, Xiao Ma
, Yan Liu, Gong Chen
, Yongxu Liu
, Roger B. Dannenberg
:
The Beauty of Repetition: An Algorithmic Composition Model With Motif-Level Repetition Generator and Outline-to-Music Generator in Symbolic Music Generation. 4320-4333 - Linhui Xiao
, Xiaoshan Yang
, Fang Peng
, Ming Yan
, Yaowei Wang
, Changsheng Xu
:
CLIP-VG: Self-Paced Curriculum Adapting of CLIP for Visual Grounding. 4334-4347 - Siran Chen
, Qinglin Xu
, Yue Ma
, Yu Qiao
, Yali Wang
:
Attentive Snippet Prompting for Video Retrieval. 4348-4359 - Yeqing Ren
, Haipeng Peng
, Lixiang Li
, Yixian Yang
:
Lightweight Voice Spoofing Detection Using Improved One-Class Learning and Knowledge Distillation. 4360-4374 - Jiacheng Wang
, Ping Liu
, Jingen Liu
, Wei Xu
:
Text-Guided Eyeglasses Manipulation With Spatial Constraints. 4375-4388 - Yuanzhi Liang
, Linchao Zhu
, Xiaohan Wang
, Yi Yang:
IcoCap: Improving Video Captioning by Compounding Images. 4389-4400 - Ping Ping
, Bobiao Guo
, Olano Teah Bloh
, Yingchi Mao
, Feng Xu
:
Hiding Multiple Images into a Single Image Using Up-Sampling. 4401-4415 - Xin Yang
, Chenyang Zhao
, Jinqi Yang
, Yong Song
, Yufei Zhao
:
Negative-Driven Training Pipeline for Siamese Visual Tracking. 4416-4429 - Hao Wu
, Lincong Fang
, Qian Yu
, Chengzhuan Yang
:
Learning Robust Point Representation for 3D Non-Rigid Shape Retrieval. 4430-4444 - Junjie Zhang
, Mingyan Wang
, Haoran Jiang
, Xinyu Zhang
, Chenggang Yan
, Dan Zeng
:
STAT: Multi-Object Tracking Based on Spatio-Temporal Topological Constraints. 4445-4457 - Tong Tang
, Zhiyang Yin
, Jie Li
, Honggang Wang, Dapeng Wu
, Ruyan Wang
:
End-to-End Distortion Modeling for Error-Resilient Screen Content Video Coding. 4458-4468 - Aoran Zhang
, Zhigang Ling
, Yaonan Wang
:
Multi-Layer Decoupling Attention Network for Weakly Supervised Object Localization. 4469-4479 - Xinyuan Qian
, Wei Xue
, Qiquan Zhang
, Ruijie Tao
, Haizhou Li
:
Deep Cross-Modal Retrieval Between Spatial Image and Acoustic Speech. 4480-4489 - Chaoyang Zhou, Zengmao Wang
, Xiaoping Zhang, Bo Du
:
Domain Complementary Adaptation by Leveraging Diversity and Discriminability From Multiple Sources. 4490-4501 - Han Zhang
, Yiding Li
, Xuelong Li
:
Constrained Bipartite Graph Learning for Imbalanced Multi-Modal Retrieval. 4502-4514 - Runsheng Wang
, Yuxuan Shi
, Hefei Ling
, Zongyi Li
, Chengxin Zhao, Bohao Wei
, He Li
, Ping Li
:
Gait Recognition With Multi-Level Skeleton-Guided Refinement. 4515-4526 - Ruibin Wang
, Xianghua Ying
, Bowei Xing
:
Exploiting Temporal Correlations for 3D Human Pose Estimation. 4527-4539 - Tian-Bao Li
, Yuting Su, Dan Song
, Wenhui Li
, Zhiqiang Wei
, An-An Liu
:
Progressive Fourier Adversarial Domain Adaptation for Object Classification and Retrieval. 4540-4553 - Tianwen Qian
, Ran Cui
, Jingjing Chen
, Pai Peng
, Xiaowei Guo
, Yu-Gang Jiang
:
Locate Before Answering: Answer Guided Question Localization for Video Question Answering. 4554-4563 - Wujie Zhou
, Yuqi Cai, Liting Zhang, Weiqing Yan
, Lu Yu:
UTLNet: Uncertainty-Aware Transformer Localization Network for RGB-Depth Mirror Segmentation. 4564-4574 - Guanhua Zheng
, Jitao Sang
, Changsheng Xu
:
TIF: Threshold Interception and Fusion for Compact and Fine-Grained Visual Attribution. 4575-4589 - Yuanhong Zhong
, Chenxu Zhang
, Xun Yang
, Shanshan Wang
:
Video Compressed Sensing Reconstruction via an Untrained Network with Low-Rank Regularization. 4590-4601 - Wenfeng Song
, Tangli Chu
, Shuai Li
, Nannan Li
, Aimin Hao
, Hong Qin
:
Joints-Centered Spatial-Temporal Features Fused Skeleton Convolution Network for Action Recognition. 4602-4616 - Yuanyuan Jiang
, Jianqin Yin
, Yonghao Dang
:
Leveraging the Video-Level Semantic Consistency of Event for Audio-Visual Event Localization. 4617-4627 - Yifei Zhang
, Chang Liu
, Yu Zhou
, Weiping Wang
, Qixiang Ye
, Xiangyang Ji
:
Beyond Instance Discrimination: Relation-Aware Contrastive Self-Supervised Learning. 4628-4640 - Jinsong Shi, Pan Gao
, Aljosa Smolic
:
Blind Image Quality Assessment via Transformer Predicted Error Map and Perceptual Quality Token. 4641-4651 - Xin Li
, Yiting Lu
, Zhibo Chen
:
FreqAlign: Excavating Perception-Oriented Transferability for Blind Image Quality Assessment From a Frequency Perspective. 4652-4666 - Yi Ke Yun
, Weisi Lin
:
Towards a Complete and Detail-Preserved Salient Object Detection. 4667-4680 - Fanzhao Lin
, Shiming Ge
, Kexin Bao
, Chenggang Yan
, Dan Zeng
:
Learning Shape-Biased Representations for Infrared Small Target Detection. 4681-4692 - Hui Wu
, Min Wang
, Wengang Zhou
, Houqiang Li
:
Structure Similarity Preservation Learning for Asymmetric Image Retrieval. 4693-4705 - Ke Zhang
, Yan Yang
, Jun Yu
, Hanliang Jiang
, Jianping Fan
, Qingming Huang
, Weidong Han
:
Multi-Task Paired Masking With Alignment Modeling for Medical Vision-Language Pre-Training. 4706-4721 - Ankur
, Rajeev Kumar
, Ajay K. Sharma
:
Bit-Plane Based Reversible Data Hiding in Encrypted Images Using Multi-Level Blocking With Quad-Tree. 4722-4735 - Zengbin Wang
, Saihui Hou
, Man Zhang
, Xu Liu
, Chunshui Cao
, Yongzhen Huang
:
GaitParsing: Human Semantic Parsing for Gait Recognition. 4736-4748 - Bo Qin
, Fanqing Meng
, Shijin Yuan
, Bin Mu
:
CAU: A Causality Attention Unit for Spatial-Temporal Sequence Forecast. 4749-4763 - Linfeng Tang
, Ziang Chen
, Jun Huang
, Jiayi Ma
:
CAMF: An Interpretable Infrared and Visible Image Fusion Network Based on Class Activation Mapping. 4776-4791 - Xuan Han
, Mingyu You
, Ping Lu
:
Improving the Conditional Fine-Grained Image Generation With Part Perception. 4792-4804 - Yue Lu
, Xingyu Chen
, Zhengxing Wu
, Min Tan
, Junzhi Yu
:
Binary Similarity Few-Shot Object Detection With Modeling of Hard Negative Samples. 4805-4818 - Jie Gui
, Xiaofeng Cong
, Lei He
, Yuan Yan Tang, James Tin-Yau Kwok
:
Illumination Controllable Dehazing Network based on Unsupervised Retinex Embedding. 4819-4830 - Mengkun Liu
, Licheng Jiao
, Xu Liu
, Lingling Li
, Fang Liu
, Shuyuan Yang
, Shuang Wang
, Biao Hou
:
Multi-Scale Contourlet Knowledge Guide Learning Segmentation. 4831-4845 - Xiaomeng Wang
, Honglong Chen
, Peng Sun
, Junjian Li
, Anqing Zhang
, Weifeng Liu
, Nan Jiang
:
AdvST: Generating Unrestricted Adversarial Images via Style Transfer. 4846-4858 - Tianrun Chen
, Chaotao Ding
, Lanyun Zhu
, Ying Zang
, Yiyi Liao
, Zejian Li
, Lingyun Sun
:
Reality3DSketch: Rapid 3D Modeling of Objects From Single Freehand Sketches. 4859-4870 - Kezhou Lin
, Xiaohan Wang
, Linchao Zhu
, Bang Zhang, Yi Yang:
SKIM: Skeleton-Based Isolated Sign Language Recognition With Part Mixing. 4271-4280 - Qiuping Jiang
, Yaozu Kang, Zhihua Wang
, Wenqi Ren
, Chongyi Li
:
Perception-Driven Deep Underwater Image Enhancement Without Paired Supervision. 4884-4897 - Renjie Pan
, Hua Yang
, Cunyan Li
, Jinhai Yang
:
Joint Intra & Inter-Grained Reasoning: A New Look Into Semantic Consistency of Image-Text Retrieval. 4912-4925 - Wei Lu
, Yujia Zhai
, Jiaze Han, Peiguang Jing
, Yu Liu
, Yuting Su:
VMemNet: A Deep Collaborative Spatial-Temporal Network With Attention Representation for Video Memorability Prediction. 4926-4937 - Huasheng Wang
, Jianxun Lou
, Xiaochang Liu
, Hongchen Tan
, Roger M. Whitaker
, Hantao Liu
:
SSPNet: Predicting Visual Saliency Shifts. 4938-4949 - Yingjiao Pei
, Zhongyuan Wang
, Na Li
, Heling Chen
, Baojin Huang
, Weiping Tu
:
Deep Hashing Network With Hybrid Attention and Adaptive Weighting for Image Retrieval. 4961-4973 - Lanxiao Wang
, Hongliang Li
, Minjian Zhang
, Heqian Qiu
, Fanman Meng
, Qingbo Wu
, Linfeng Xu
:
CrowdCaption++: Collective-Guided Crowd Scenes Captioning. 4974-4986 - Huihui Gong
, Minjing Dong
, Siqi Ma
, Seyit Camtepe
, Surya Nepal
, Chang Xu
:
Stealthy Physical Masked Face Recognition Attack via Adversarial Style Optimization. 5014-5025 - Lingzhi He
, Feng Li
, Runmin Cong
, Yao Zhao
:
Reflection Intensity Guided Single Image Reflection Removal and Transmission Recovery. 5026-5039 - Zijin Yang
, Kejiang Chen
, Kai Zeng
, Weiming Zhang
, Nenghai Yu
:
Provably Secure Robust Image Steganography. 5040-5053 - Xian Zhao
, Lei Huang
, Jie Nie
, Zhiqiang Wei
:
Towards Adaptive Multi-Scale Intermediate Domain via Progressive Training for Unsupervised Domain Adaptation. 5054-5064 - Wentao Ma
, Xinyi Wu
, Shan Zhao
, Tongqing Zhou
, Dan Guo
, Lichuan Gu
, Zhiping Cai
, Meng Wang
:
FedSH: Towards Privacy-Preserving Text-Based Person Re-Identification. 5065-5077 - Mu Wang
, Xingyan Chen
, Xu Yang, Shuai Peng, Yu Zhao
, Mingwei Xu
, Changqiao Xu
:
CoLive: Edge-Assisted Clustered Learning Framework for Viewport Prediction in 360$^{\circ }$ Live Streaming. 5078-5091 - Yueli Cui
, Gangyi Jiang
, Mei Yu
, Yeyao Chen
, Yo-Sung Ho
:
Stitched Wide Field of View Light Field Image Quality Assessment: Benchmark Database and Objective Metric. 5092-5107 - Hongbo Sun
, Xiangteng He
, Yuxin Peng
:
HCL: Hierarchical Consistency Learning for Webly Supervised Fine-Grained Recognition. 5108-5119 - Mengqi Yuan
, Gengyun Jia
, Bing-Kun Bao
:
GPT-Based Knowledge Guiding Network for Commonsense Video Captioning. 5147-5158 - Yiming Liu
, Mengxi Zhang
, Bo Jiang
, Bo Hou
, Dan Liu
, Jie Chen
, Heqing Lian
:
Flexible Alignment Super-Resolution Network for Multi-Contrast Magnetic Resonance Imaging. 5159-5169 - Linfei Wang
, Yibing Zhan
, Wei Liu
, Baosheng Yu
, Dapeng Tao
:
Bounding Box Vectorization for Oriented Object Detection With Tanimoto Coefficient Regression. 5181-5193 - Junyu Shi
, Jianqi Zhong
, Wenming Cao
:
Multi-Semantics Aggregation Network Based on the Dynamic-Attention Mechanism for 3D Human Motion Prediction. 5194-5206 - Zhenyu Wang
, Yunzhou Zhang
, Yan Liu
, Cao Qin
, Sonya A. Coleman
, Dermot Kerr
:
LARNet: Towards Lightweight, Accurate and Real-Time Salient Object Detection. 5207-5222 - Zehua Fu
, Wenhang Zuo
, Zhenghui Hu
, Qingjie Liu
, Yunhong Wang
:
Improving Multi-Person Pose Tracking With a Confidence Network. 5223-5233 - Jinhong Deng
, Xiaoyue Zhang
, Wen Li
, Lixin Duan
, Dong Xu
:
Cross-Domain Detection Transformer Based on Spatial-Aware and Semantic-Aware Token Alignment. 5234-5245 - Hong Liu
, Yongqing Sun
, Yukihiro Bandoh
, Masaki Kitahara, Shin'ichi Satoh
:
Deep Counterfactual Representation Learning for Visual Recognition Against Weather Corruptions. 5257-5272 - Yalan Qin
, Nan Pu
, Hanzhou Wu
:
EDMC: Efficient Multi-View Clustering via Cluster and Instance Space Learning. 5273-5283 - Yuan Bian
, Min Liu
, Xueping Wang
, Yi Tang
, Yaonan Wang
:
Occlusion-Aware Feature Recover Model for Occluded Person Re-Identification. 5284-5295 - Peifu Liu
, Tingfa Xu
, Huan Chen
, Shiyun Zhou
, Haolin Qin
, Jianan Li
:
Spectrum-Driven Mixed-Frequency Network for Hyperspectral Salient Object Detection. 5296-5310 - Yitao Peng
, Lianghua He
, Die Hu
, Yihang Liu
, Longzhen Yang
, Shaohua Shang
:
Hierarchical Dynamic Masks for Visual Explanation of Neural Networks. 5311-5325 - Junyi Wu
, Yan Huang
, Min Gao
, Zhipeng Gao
, Jianqiang Zhao
, Huiji Zhang
, Anguo Zhang
:
A Two-Stream Hybrid Convolution-Transformer Network Architecture for Clothing-Change Person Re-Identification. 5326-5339 - Kexin Tang
, Nuowen Kan
, Yuankun Jiang
, Chenglin Li
, Wenrui Dai
, Junni Zou
, Hongkai Xiong
:
Successor Feature-Based Transfer Reinforcement Learning for Video Rate Adaptation With Heterogeneous QoE Preferences. 5340-5357 - Wenwen Wei
, Ping Wei
, Jialu Qin
, Zhimin Liao
, Shuaijie Wang
, Xiang Cheng
, Meiqin Liu
, Nanning Zheng
:
3D Scene Graph Generation From Point Clouds. 5358-5368 - Yueheng Li
, Hao Chen
, Bowei Xu
, Zicheng Zhang
, Zhan Ma
:
Improving Adaptive Real-Time Video Communication via Cross-Layer Optimization. 5369-5382 - Keyan Ding, Rijin Zhong, Zhihua Wang
, Yang Yu
, Yuming Fang
:
Adaptive Structure and Texture Similarity Metric for Image Quality Assessment and Optimization. 5398-5409 - Yufan Hu
, Junyu Gao
, Jianfeng Dong
, Bin Fan
, Hongmin Liu
:
Exploring Rich Semantics for Open-Set Action Recognition. 5410-5421 - Bingyu Hu
, Jiawei Liu
, Kecheng Zheng
, Zheng-Jun Zha
:
Unleashing Knowledge Potential of Source Hypothesis for Source-Free Domain Adaptation. 5422-5434 - Mingze He
, Hongxia Wang, Fei Zhang
, Yuyuan Xiang
:
Exploring Accurate Invariants on Polar Harmonic Fourier Moments in Polar Coordinates for Robust Image Watermarking. 5435-5449 - Wenda Zhao
, Guang Hu, Fei Wei, Haipeng Wang
, You He
, Huchuan Lu
:
Attacking Defocus Detection With Blur-Aware Transformation for Defocus Deblurring. 5450-5460 - Ye Yao
, Linchao Huang
, Hui Wang
, Qi Chang
, Yizhi Ren
, Fengjun Xiao
:
Robust Adaptive Steganography Based on Adaptive STC-ECC. 5477-5489 - Xiangzeng Liu
, Kunpeng Liu
, Jianfeng Guo
, Peipei Zhao
, Yi-Ning Quan
, Qiguang Miao
:
Pose-Guided Attention Learning for Cloth-Changing Person Re-Identification. 5490-5498 - Kai Gao
, Ji-Hwei Horng
, Chin-Chen Chang
:
Reversible Data Hiding for Encrypted 3D Mesh Models With Secret Sharing Over Galois Field. 5499-5510 - Yuxia Wu
, Guoshuai Zhao
, Mingdi Li
, Zhuocheng Zhang
, Xueming Qian
:
Reason Generation for Point of Interest Recommendation Via a Hierarchical Attention-Based Transformer Model. 5511-5522 - Ge Zhu
, Jinbao Li
, Yahong Guo:
PriorNet: Two Deep Prior Cues for Salient Object Detection. 5523-5535 - Jingyang Lin
, Hang Hua
, Ming Chen
, Yikang Li
, Jenhao Hsiao
, Chiuman Ho
, Jiebo Luo
:
VideoXum: Cross-Modal Visual and Textural Summarization of Videos. 5548-5560 - Haoyue Shi
, Le Wang
, Sanping Zhou
, Gang Hua
, Wei Tang
:
Abnormal Ratios Guided Multi-Phase Self-Training for Weakly-Supervised Video Anomaly Detection. 5575-5587 - Hao Liu
, Jingjing Wu
, Feng Li
, Jianguo Jiang
, Richang Hong
:
SYRER: Synergistic Relational Reasoning for RGB-D Cross-Modal Re-Identification. 5600-5614 - Han Yan
, Haijun Zhang
, Zhao Zhang
:
Learning to Disentangle the Colors, Textures, and Shapes of Fashion Items: A Unified Framework. 5615-5629 - Yun Wang
, Lu Zhu
, Yuanyuan Liu
:
CFENet: Boosting Few-Shot Semantic Segmentation With Complementary Feature-Enhanced Network. 5630-5640 - Yan Hu, Xiaozhao Fang
, Peipei Kang
, Yonghao Chen
, Yuting Fang, Shengli Xie
:
Dual Noise Elimination and Dynamic Label Correlation Guided Partial Multi-Label Learning. 5641-5656 - Nanfeng Jiang
, Weiling Chen
, Jielian Lin
, Tiesong Zhao
, Chia-Wen Lin
:
Video Compression Artifacts Removal With Spatial-Temporal Attention-Guided Enhancement. 5657-5669 - An-An Liu
, Yingchen Zhai
, Ning Xu
, Hongshuo Tian
, Weizhi Nie
, Yongdong Zhang
:
Event-Aware Retrospective Learning for Knowledge-Based Image Captioning. 4898-4911 - Xi Yang
, Zihan Wang, Ziyu Wei, Dong Yang:
SCSP: An Unsupervised Image-to-Image Translation Network Based on Semantic Cooperative Shape Perception. 4950-4960 - Zexing Du
, Di He
, Xue Wang
, Qing Wang
:
Learning Semantics-Guided Representations for Scoring Figure Skating. 4987-4997 - Zhuo Zhang
, Hongfei Wang
, Jie Geng
, Xinyang Deng
, Wen Jiang
:
A New Data Augmentation Method Based on Mixup and Dempster-Shafer Theory. 4998-5013 - Jiahui Zhang
, Jinlong Shi
, Danping Zou
, Xin Shu
, Suqin Bai
, Jiawen Lu
, Haowei Zhu
, Jun Ni
, Yunhan Sun
:
EPM-Net: Efficient Feature Extraction, Point-Pair Feature Matching for Robust 6-D Pose Estimation. 5120-5130 - Yiqiao Mao
, Xiaoqiang Yan
, Jiaming Liu
, Yangdong Ye
:
ConGMC: Consistency-Guided Multimodal Clustering via Mutual Information Maximin. 5131-5146 - Jinguang Wang
, Shengsheng Qian
, Jun Hu
, Richang Hong
:
Comment-Context Dual Collaborative Masked Transformer Network for Fake News Detection. 5170-5180 - Jun Zhou
, Chi Xu
, Yuting Ge
, Li Cheng
:
Realistic Depth Image Synthesis for 3D Hand Pose Estimation. 5246-5256 - Yujie Fu
, Pengju Zhang
, Fulin Tang
, Yihong Wu
:
Covariant Peak Constraint for Accurate Keypoint Detection and Keypoint-Specific Descriptor Learning. 5383-5397 - Daizong Liu
, Jiahao Zhu
, Xiang Fang
, Zeyu Xiong
, Huan Wang
, Renfu Li
, Pan Zhou
:
Conditional Video Diffusion Network for Fine-Grained Temporal Sentence Grounding. 5461-5476 - Aoran Xiao
, Dayan Guan
, Xiaoqin Zhang
, Shijian Lu
:
Domain Adaptive LiDAR Point Cloud Segmentation With 3D Spatial Consistency. 5536-5547 - Wei Cong
, Yang Cong
, Jiahua Dong
, Gan Sun
, Henghui Ding
:
Gradient-Semantic Compensation for Incremental Semantic Segmentation. 5561-5574 - Nam Joon Kim
, Hyun Kim
:
Trunk Pruning: Highly Compatible Channel Pruning for Convolutional Neural Networks Without Fine-Tuning. 5588-5599 - Yajie Wang
, Mulin Chen
, Xuelong Li
:
Continuous Emotion-Based Image-to-Music Generation. 5670-5679 - Zhenghong Lin
, Qishan Yan
, Weiming Liu
, Shiping Wang
, Menghan Wang
, Yanchao Tan
, Carl Yang
:
Automatic Hypergraph Generation for Enhancing Recommendation With Sparse Optimization. 5680-5693 - Yuer Ma
, Yi Liu
, Limin Wang
, Wenxiong Kang
, Yu Qiao
, Yali Wang
:
Dual Masked Modeling for Weakly-Supervised Temporal Boundary Discovery. 5694-5704 - Jin Yang
, Ping Wei
, Ziyang Ren
, Nanning Zheng
:
Gated Multi-Scale Transformer for Temporal Action Localization. 5705-5717 - Wenqing Wang
, Yawei Luo
, Zhiqing Chen
, Tao Jiang
, Yi Yang, Jun Xiao
:
Taking a Closer Look At Visual Relation: Unbiased Video Scene Graph Generation With Decoupled Label Learning. 5718-5728 - Yihao Huang
, Felix Juefei-Xu
, Qing Guo
, Geguang Pu
, Yang Liu
:
Natural & Adversarial Bokeh Rendering via Circle-of-Confusion Predictive Network. 5729-5740 - Guanghui Yue
, Honglv Wu
, Qiuping Jiang
, Tianwei Zhou
, Weiqing Yan
, Tianfu Wang
:
Perceptual Quality Assessment of Retouched Face Images. 5741-5752 - Ruohong Huan
, Guowei Zhong
, Peng Chen
, Ronghua Liang
:
UniMF: A Unified Multimodal Framework for Multimodal Sentiment Analysis in Missing Modalities and Unaligned Multimodal Sequences. 5753-5768 - Yihong Chen
, Hao Zheng
, Yanchun Li
, Wanli Ouyang
, Jiang Zhu
:
Online Handwritten Chinese Character Recognition Based on 1-D Convolution and Two-Streams Transformers. 5769-5781 - Qing Ding
, Liquan Shen
, Liangwei Yu
, Hao Yang
, Mai Xu
:
Blind Quality Enhancement for Compressed Video. 5782-5794 - Wenying Wen
, Ziye Yuan
, Shuren Qi
, Yushu Zhang
, Yuming Fang
:
PPM-SEM: A Privacy-Preserving Mechanism for Sharing Electronic Patient Records and Medical Images in Telemedicine. 5795-5806 - Yubin Cho
, Hyunwoo Yu
, Suk-Ju Kang
:
Cross-Aware Early Fusion With Stage-Divided Vision and Language Transformer Encoders for Referring Image Segmentation. 5823-5833 - Jing Li
, Qianqian Wang
, Ming Yang
, Quanxue Gao
, Xinbo Gao
:
Efficient Anchor Graph Factorization for Multi-View Clustering. 5834-5845 - Jia-Wei Ma
, Min Liang
, Lei Chen
, Shu Tian
, Song-Lu Chen
, Jingyan Qin
, Xu-Cheng Yin
:
Sample Weighting with Hierarchical Equalization Loss for Dense Object Detection. 5846-5859 - Shibo Li
, Shuyuan Zhu
, Yao Ge
, Bing Zeng
, Muhammad Ali Imran
, Qammer H. Abbasi
, Jonathan M. Cooper
:
Depth-Guided Deep Video Inpainting. 5860-5871 - Ying Luo
, Guoliang Kang
, Kexin Liu
, Fuzhen Zhuang
, Jinhu Lü
:
Taking a Closer Look at Factor Disentanglement: Dual-Path Variational Autoencoder Learning for Domain Generalization. 5872-5883 - XiuYu Zhang
, Minrui Xu
, Rui Tan
, Dusit Niyato
:
Learning-Based Auction for Matching Demand and Supply of Holographic Digital Twin Over Immersive Communications. 5884-5896 - Yawen Cui
, Zitong Yu
, Wei Peng
, Qi Tian
, Li Liu
:
Rethinking Few-Shot Class-Incremental Learning With Open-Set Hypothesis in Hyperbolic Geometry. 5897-5910 - Yun Zhang
, Haoqin Lin
, Jing Sun
, Linwei Zhu
, Sam Kwong
:
Learning to Predict Object-Wise Just Recognizable Distortion for Image and Video Compression. 5925-5938 - Minda Zhao
, Xingqun Qi
, Zhipeng Hu
, Lincheng Li
, Yongqiang Zhang
, Zi Huang
, Xin Yu
:
Calligraphy Font Generation via Explicitly Modeling Location-Aware Glyph Component Deformations. 5939-5950 - Qide Wang
, Daxin Liu
, Zhenyu Liu
, Jiatong Xu
, Jianrong Tan
:
3D Object Segmentation Using Cross-Window Point Transformer With Latent Semantic Boundary Guidance. 5951-5961 - Chi Ji
, Guangyong Gao
, Yunqing Shi:
Reversible Data Hiding in Encrypted Images With Adaptive Huffman Code Based on Dynamic Prediction Axes. 5962-5975 - Chunyi Zhou
, Dekang Liu
, Tianlei Wang
, Jiangmin Tian
, Jiuwen Cao
:
M$^{3}$ANet: Multi-Modal and Multi-Attention Fusion Network for Ship License Plate Recognition. 5976-5986 - Zhizhe Liu
, Zhenfeng Zhu
, Shuai Zheng
, Yawei Zhao
, Kunlun He
, Yao Zhao
:
From Observation to Concept: A Flexible Multi-View Paradigm for Medical Report Generation. 5987-5995 - Laijin Meng
, Xinghao Jiang
, Tanfeng Sun
, Zeyu Zhao
, Qiang Xu
:
A Robust Coverless Video Steganography Based on the Similarity of Inter-Frames. 5996-6011 - Wang Tang
, Linbo Qing
, Lindong Li
, Yuchen Wang
, Ce Zhu
:
Progressive Graph Reasoning-Based Social Relation Recognition. 6012-6024 - Guang Han
, Min Lin
, Ziyang Li
, Haitao Zhao
, Sam Kwong
:
Text-to-Image Person Re-Identification Based on Multimodal Graph Convolutional Network. 6025-6036 - Jiyou Chen
, Gaobo Yang
, Shengchun Wang
, Dewang Wang
, Xin Liao
:
Image Dehazing Assessment: A Real-World Dataset and a Haze Density-Aware Criteria. 6037-6049 - Fawei Ge
, Yunzhou Zhang
, Li Wang
, Sonya Coleman
, Dermot Kerr
:
Double-Domain Adaptation Semantics for Retrieval-Based Long-Term Visual Localization. 6050-6064 - Wenju Xu
, Chengjiang Long
, Yongwei Nie
, Guanghui Wang
:
Disentangled Representation Learning for Controllable Person Image Generation. 6065-6077 - Junxia Li
, Deshuo Shi
, Ying Cui
, Dongyan Guo
, Qingshan Liu
:
Adaptive Activation Network for Weakly Supervised Semantic Segmentation. 6078-6089 - Lizhi Xiong
, Jianhua Xu
, Ching-Nung Yang
, Xinpeng Zhang
:
CMCF-Net: An End-to-End Context Multiscale Cross-Fusion Network for Robust Copy-Move Forgery Detection. 6090-6101 - Xiuli Chai
, Yakun Ma
, Yinjing Wang
, Zhihua Gan
, Yushu Zhang
:
TPE-ADE: Thumbnail-Preserving Encryption Based on Adaptive Deviation Embedding for JPEG Images. 6102-6116 - Youze Wang
, Wenbo Hu
, Richang Hong
:
Iterative Adversarial Attack on Image-Guided Story Ending Generation. 6117-6130 - Yi Cheng
, Hehe Fan
, Dongyun Lin
, Ying Sun
, Mohan S. Kankanhalli
, Joo-Hwee Lim
:
Keyword-Aware Relative Spatio-Temporal Graph Networks for Video Question Answering. 6131-6141 - Hao Feng
, Shaokai Liu
, Jiajun Deng
, Wengang Zhou
, Houqiang Li
:
Deep Unrestricted Document Image Rectification. 6142-6154 - Jinyang Liu
, Shutao Li
, Renwei Dian
, Ze Song
:
Focus Relationship Perception for Unsupervised Multi-Focus Image Fusion. 6155-6165 - Jiajun Huang
, Chengbin Du
, Xinqi Zhu
, Siqi Ma
, Surya Nepal
, Chang Xu
:
Anti-Compression Contrastive Facial Forgery Detection. 6166-6177 - Yuanhong Zhong
, Guangxia Yang
, Daidi Zhong
, Xun Yang
, Shanshan Wang
:
Frame-Padded Multiscale Transformer for Monocular 3D Human Pose Estimation. 6191-6201 - Siyu Zhang
, Yeming Chen
, Yaoru Sun
, Fang Wang
, Haibo Shi
, Haoran Wang
:
LOIS: Looking Out of Instance Semantics for Visual Question Answering. 6202-6214 - Shankhanil Mitra
, Saiyam Jogani
, Rajiv Soundararajan
:
Semi-Supervised Learning of Perceptual Video Quality by Generating Consistent Pairwise Pseudo-Ranks. 6215-6227 - Zizheng Xun
, Shangzhe Di
, Yulu Gao, Zongheng Tang
, Gang Wang
, Si Liu
, Bo Li
:
Linker: Learning Long Short-term Associations for Robust Visual Tracking. 6228-6237 - Honglei Su
, Qi Liu
, Hui Yuan
, Qiang Cheng
, Raouf Hamzaoui
:
Support Vector Regression-Based Reduced- Reference Perceptual Quality Model for Compressed Point Clouds. 6238-6249 - Yabo Liu
, Jinghua Wang
, Weijia Wang
, Yu Hu
, Yaowei Wang
, Yong Xu
:
CRADA: Cross Domain Object Detection With Cyclic Reconstruction and Decoupling Adaptation. 6250-6261 - Bo Peng
, Guoting Lin
, Jianjun Lei
, Tianyi Qin
, Xiaochun Cao
, Nam Ling
:
Contrastive Multi-View Learning for 3D Shape Clustering. 6262-6272 - Xi Yang
, Menghui Tian, Meijie Li, Ziyu Wei
, Liu Yuan, Nannan Wang
, Xinbo Gao
:
SSRR: Structural Semantic Representation Reconstruction for Visible-Infrared Person Re-Identification. 6273-6284 - Yabo Liu
, Jinghua Wang
, Sheng-hua Zhong
, Lianyang Ma
, Yong Xu
:
Fine-Grained Representation Alignment for Zero-Shot Domain Adaptation. 6285-6296 - Ming Li
, Huazhu Fu
, Shengfeng He
, Hehe Fan
, Jun Liu
, Jussi Keppo
, Mike Zheng Shou
:
DR-FER: Discriminative and Robust Representation Learning for Facial Expression Recognition. 6297-6309 - Nana Yu
, Hong Shi
, Yahong Han
:
Joint Correcting and Refinement for Balanced Low-Light Image Enhancement. 6310-6324 - Chenghao Xu
, Jiexi Yan
, Yanhua Yang
, Cheng Deng
:
Implicit Compositional Generative Network for Length-Variable Co-Speech Gesture Synthesis. 6325-6335 - Dayoung Chun
, Seungil Lee, Hyun Kim
:
USD: Uncertainty-Based One-Phase Learning to Enhance Pseudo-Label Reliability for Semi-Supervised Object Detection. 6336-6347 - Ying Lv
, Zhi Liu
, Gongyang Li
:
Context-Aware Interaction Network for RGB-T Semantic Segmentation. 6348-6360 - Qibing Qin
, Yadong Huo
, Lei Huang
, Jiangyan Dai
, Huihui Zhang
, Wenfeng Zhang
:
Deep Neighborhood-Preserving Hashing With Quadratic Spherical Mutual Information for Cross-Modal Retrieval. 6361-6374 - Yuxiang Lu
, Shalayiding Sirejiding
, Yue Ding
, Chunlin Wang
, Hongtao Lu
:
Prompt Guided Transformer for Multi-Task Dense Prediction. 6375-6385 - Lifang Wu
, Meng Tian
, Ye Xiang
, Ke Gu
, Ge Shi
:
Learning Label Semantics for Weakly Supervised Group Activity Recognition. 6386-6397 - Weiling Chen
, Boqin Cai
, Sumei Zheng
, Tiesong Zhao
, Ke Gu
:
Perception-and-Cognition-Inspired Quality Assessment for Sonar Image Super-Resolution. 6398-6410 - Juncheng Zhang
, Qingmin Liao
, Haoyu Ma
, Jing-Hao Xue
, Wenming Yang
, Shaojun Liu
:
Exploit the Best of Both End-to-End and Map-Based Methods for Multi-Focus Image Fusion. 6411-6423 - Long Peng
, Yang Cao
, Yuejin Sun
, Yang Wang
:
Lightweight Adaptive Feature De-Drifting for Compressed Image Classification. 6424-6436 - Jia-Nan Li
, Xiao-Qian Liu
, Xin Luo
, Xin-Shun Xu
:
VOLTER: Visual Collaboration and Dual-Stream Fusion for Scene Text Recognition. 6437-6448 - Chao Tian
, Zikun Zhou
, Yuqing Huang
, Gaojun Li
, Zhenyu He
:
Cross-Modality Proposal-Guided Feature Mining for Unregistered RGB-Thermal Pedestrian Detection. 6449-6461 - Jeong Hun Yeo
, Minsu Kim
, Jeongsoo Choi
, Dae Hoe Kim
, Yong Man Ro
:
AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model. 6462-6474 - Xiaoqiang Zhou
, Huaibo Huang
, Zilei Wang
, Ran He
:
RISTRA: Recursive Image Super-Resolution Transformer With Relativistic Assessment. 6475-6487 - Xin Wen
, Weizhi Nie
, Jing Liu
, Yuting Su, Yongdong Zhang
, An-An Liu
:
CDCM: ChatGPT-Aided Diversity-Aware Causal Model for Interactive Recommendation. 6488-6500 - Runmin Cong
, Hang Xiong
, Jinpeng Chen
, Wei Zhang
, Qingming Huang
, Yao Zhao
:
Query-Guided Prototype Evolution Network for Few-Shot Segmentation. 6501-6512 - Runmin Wang
, Zhenlin Zhu
, Yanbin Zhu
, Hua Chen
, Yongzhong Liao
, Ziyu Zhu
, Yajun Ding
, Changxin Gao
, Nong Sang
:
DIMGNet: A Transformer-Based Network for Pedestrian Reidentification With Multi-Granularity Information Mutual Gain. 6513-6528 - Fei Hu
, Yibo Ma
, Wei Zhong
, Long Ye
, Xinyan Yang
, Li Fang
, Qin Zhang
:
A Dataset and Benchmark for 3D Scene Plausibility Assessment. 6529-6541 - Qi Cui
, Zhili Zhou
, Ruohan Meng
, Shaowei Wang
, Hongyang Yan
, Q. M. Jonathan Wu
:
ARES: On Adversarial Robustness Enhancement for Image Steganographic Cost Learning. 6542-6553 - Tianyi Wang
, Zian Li
, Ruixia Liu
, Yinglong Wang
, Liqiang Nie
:
An Efficient Attribute-Preserving Framework for Face Swapping. 6554-6565 - Yan Zhang
, Lu Zhang
, Xin Zhao
, Hongyong Fu
, Dequan Yu
:
Automatic Point Cloud Registration for 3D Virtual-to-Real Registration Using Macro and Micro Structures. 6566-6581 - Naiyu Fang
, Lemiao Qiu
, Shuyou Zhang
, Zili Wang
, Kerui Hu
:
PG-VTON: A Novel Image-Based Virtual Try-On Method via Progressive Inference Paradigm. 6595-6608 - Dixuan Lin
, Yi-Xing Peng
, Jingke Meng
, Wei-Shi Zheng
:
Cross-Modal Adaptive Dual Association for Text-to-Image Person Retrieval. 6609-6620 - Bing Cai
, Gui-Fu Lu
, Hua Li
, Weihong Song
:
Tensorized Scaled Simplex Representation for Multi-View Clustering. 6621-6631 - Kang Chen
, Lei Yu
:
Motion Deblur by Learning Residual From Events. 6632-6647 - Yonghua Pan
, Jing Liu
, Lu Jin
, Zechao Li
:
Unbiased Visual Question Answering by Leveraging Instrumental Variable. 6648-6662 - Yuezhou Li
, Rui Xu
, Yuzhen Niu
, Wenzhong Guo
, Tiesong Zhao
:
Perceptual Decoupling With Heterogeneous Auxiliary Tasks for Joint Low-Light Image Enhancement and Deblurring. 6663-6675 - Yuefang Gao
, Yuhao Xie
, Zeke Zexi Hu
, Tianshui Chen
, Liang Lin
:
Adaptive Global-Local Representation Learning and Selection for Cross-Domain Facial Expression Recognition. 6676-6688 - Ning Xu
, Zimu Lu
, Hongshuo Tian
, Rongbao Kang
, Jinbo Cao
, Yongdong Zhang
, An-An Liu
:
Learning to Supervise Knowledge Retrieval Over a Tree Structure for Visual Question Answering. 6689-6700 - Shuai Guo
, Jingchuan Hu
, Kai Zhou
, Jionghao Wang
, Li Song
, Rong Xie
, Wenjun Zhang
:
Real-Time Free Viewpoint Video Synthesis System Based on DIBR and a Depth Estimation Network. 6701-6716 - Yuxiang Shao
, Feifei Zhang
, Changsheng Xu
:
Snippet-to-Prototype Contrastive Consensus Network for Weakly Supervised Temporal Action Localization. 6717-6729 - Ali Ak
, Emin Zerman
, Maurice Quach
, Aladine Chetouani
, Aljosa Smolic
, Giuseppe Valenzise
, Patrick Le Callet
:
BASICS: Broad Quality Assessment of Static Point Clouds in a Compression Scenario. 6730-6742 - Junpeng Tan
, Xiaojun Yang
, Zhijing Yang
, Ruihan Chen
, Yongyi Lu
, Liang Lin
:
Extensible Max-Min Collaborative Retention for Online Mini-Batch Learning Hash Retrieval. 6743-6758 - Fanfan Ji
, Xiao-Tong Yuan
, Qingshan Liu
:
Soft Weight Pruning for Cross-Domain Few-Shot Learning With Unlabeled Target Data. 6759-6769 - Pei He
, Licheng Jiao
, Fang Liu
, Xu Liu
, Ronghua Shang
, Shuang Wang
:
Cross-Domain Scene Unsupervised Learning Segmentation With Dynamic Subdomains. 6770-6784 - Zixin Yin
, Jiakai Wang
, Yisong Xiao
, Hanqing Zhao
, Tianlin Li
, Wenbo Zhou
, Aishan Liu
, Xianglong Liu
:
Improving Deepfake Detection Generalization by Invariant Risk Minimization. 6785-6798 - Pei An
, Di Zhu, Siwen Quan
, Junfeng Ding
, Jie Ma
, You Yang
, Qiong Liu
:
ESC-Net: Alleviating Triple Sparsity on 3D LiDAR Point Clouds for Extreme Sparse Scene Completion. 6799-6810 - Myung Han Hyun
, Bumshik Lee
, Munchurl Kim
:
A VVC Intra Rate Control With Small Bit Fluctuations Using a Lagrange Multiplier Adjustment. 6811-6821 - Haofan Lu
, Shuiping Gou
, Ruimin Li
:
SPMHand: Segmentation-Guided Progressive Multi-Path 3D Hand Pose and Shape Estimation. 6822-6833 - Junqi Liao
, Li Li
, Dong Liu
, Houqiang Li
:
Content-Adaptive Rate-Distortion Modeling for Frame-Level Rate Control in Versatile Video Coding. 6864-6879 - Jianxin Lin
, Wei Zhao
, Yijun Wang
:
Visual Correspondence Learning and Spatially Attentive Synthesis via Transformer for Exemplar-Based Anime Line Art Colorization. 6880-6890 - Yuwu Lu
, Haoyu Huang
, Biqing Zeng
, Zhihui Lai
, Xuelong Li
:
Multi-Source and Multi-Target Domain Adaptation Based on Dynamic Generator with Attention. 6891-6905 - Wenxuan Wang
, Xingjian He
, Yisi Zhang
, Longteng Guo
, Jiachen Shen
, Jiangyun Li
, Jing Liu
:
CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation. 6906-6916 - Jui-Chiu Chiang
, Yu-Tze Wu
, Hsin-Yun Hsieh
, Yun-Chang Tsai:
Enhanced Temporal Consistency for Global Patch Allocation in Video-Based Point Cloud Compression. 6917-6930 - Jianan Li
, Jie Wang
, Tingfa Xu
:
PointGL: A Simple Global-Local Framework for Efficient Point Cloud Analysis. 6931-6942 - Yaochi Zhao
, Sen Chen
, Shiguang Liu
, Zhuhua Hu
, Jingwen Xia
:
Hierarchical Equalization Loss for Long-Tailed Instance Segmentation. 6943-6955 - Bing Yang
, Xueqin Xiang
, Wanzeng Kong
, Jianhai Zhang
, Yong Peng
:
DMF-GAN: Deep Multimodal Fusion Generative Adversarial Networks for Text-to-Image Synthesis. 6956-6967 - Jiankai Li
, Yunhong Wang
, Weixin Li
:
MHRN: A Multimodal Hierarchical Reasoning Network for Topic Detection. 6968-6980 - Fu-Zhao Ou
, Xingyu Chen
, Kai Zhao
, Shiqi Wang
, Yuan-Gen Wang
, Sam Kwong
:
Refining Uncertain Features With Self-Distillation for Face Recognition and Person Re-Identification. 6981-6995 - Sihui Zhang
, Yi Tian
, Yilei Zhang
, Mei Tian
, Yaping Huang
:
Domain-Consistent and Uncertainty-Aware Network for Generalizable Gaze Estimation. 6996-7011 - Zizheng Yang
, Jie Huang
, Man Zhou
, Naishan Zheng
, Feng Zhao
:
IRVR: A General Image Restoration Framework for Visual Recognition. 7012-7026 - Ronghui Zhang
, Jiongze Yu
, Junzhou Chen
, Guofa Li
, Liang Lin
, Danwei Wang
:
A Prior Guided Wavelet-Spatial Dual Attention Transformer Framework for Heavy Rain Image Restoration. 7043-7057 - Yi Huang
, Jiancheng Huang
, Jianzhuang Liu
, Mingfu Yan
, Yu Dong
, Jiaxi Lv
, Chaoqi Chen
, Shifeng Chen
:
WaveDM: Wavelet-Based Diffusion Models for Image Restoration. 7058-7073 - Limin Zheng
, Yu Luo
, Zihan Zhou
, Jie Ling
, Guanghui Yue
:
CDINet: Content Distortion Interaction Network for Blind Image Quality Assessment. 7089-7100 - Jianwei Lu
, Guohua Wang, Yi Cai
, Xin Wu
:
Towards Automated Infographic Authoring From Natural Language Statement With Multiple Proportional Facts. 7101-7113 - Xiaofei Zhou
, Zhicong Wu
, Runmin Cong
:
Decoupling and Integration Network for Camouflaged Object Detection. 7114-7129 - Zhi Han
, Yanmei Wang
, Shaojie Zhang, Huijie Fan
, Yandong Tang
, Yao Wang
:
Online Video Sparse Noise Removing via Nonlocal Robust PCA. 7130-7145 - Xiaoyu Guo
, Wei Xiang
, Shunli Zhang
, Wei Lu
, Weiwei Xing
:
DCRP: Class-Aware Feature Diffusion Constraint and Reliable Pseudo-Labeling for Imbalanced Semi-Supervised Learning. 7146-7159 - Xiaoqian Zhang
, Chao Luo
, Xiao Wang, Jinghao Li, Shuai Zhao
, Daojian Jiang:
Learnable Tensor Graph Fusion Framework for Natural Image Segmentation. 7160-7173 - Wenjun Hui
, Zhenfeng Zhu
, Guanghua Gu
, Meiqin Liu
, Yao Zhao
:
Implicit-Explicit Motion Learning for Video Camouflaged Object Detection. 7188-7196 - Shuai Chen
, Fanman Meng
, Runtong Zhang
, Heqian Qiu
, Hongliang Li
, Qingbo Wu
, Linfeng Xu
:
Visual and Textual Prior Guided Mask Assemble for Few-Shot Segmentation and Beyond. 7197-7209 - Jin Yuan
, Feng Hou
, Ying Yang
, Yang Zhang
, Zhongchao Shi
, Xin Geng
, Jianping Fan
, Zhiqiang He
, Yong Rui
:
Domain-Aware Graph Network for Bridging Multi-Source Domain Adaptation. 7210-7224 - Chengyang Li
, Baoping Cheng
, Yao Cheng
, Haocheng Zhang
, Renshuai Liu
, Yinglin Zheng
, Jing Liao
, Xuan Cheng
:
FaceRefiner: High-Fidelity Facial Texture Refinement With Differentiable Rendering-Based Style Transfer. 7225-7236 - Xi Yang
, Xian Wang
, Liangchen Liu
, Nannan Wang
, Xinbo Gao
:
STFE: A Comprehensive Video-Based Person Re-Identification Network Based on Spatio-Temporal Feature Enhancement. 7237-7249 - Minglu Zhao
, Wenmin Wang
, Tongbao Chen
, Rui Zhang
, Ruochen Li
:
TA2V: Text-Audio Guided Video Generation. 7250-7264 - Ziqi Yuan
, Baozheng Zhang
, Hua Xu
, Kai Gao
:
Meta Noise Adaption Framework for Multimodal Sentiment Analysis With Feature Noise. 7265-7277 - Xin Liu
, Yuting Zhang
, Zitong Yu
, Hao Lu
, Huanjing Yue
, Jingyu Yang
:
rPPG-MAE: Self-Supervised Pretraining With Masked Autoencoders for Remote Physiological Measurements. 7278-7293 - Lingyun Song
, Siyu Chen
, Ziyang Meng
, Mingxuan Sun
, Xuequn Shang
:
FMSA-SC: A Fine-Grained Multimodal Sentiment Analysis Dataset Based on Stock Comment Videos. 7294-7306 - Xu Lu, Li Liu, Lixin Ning, Liang Zhang, Shaomin Mu, Huaxiang Zhang:
Multi-Facet Weighted Asymmetric Multi-Modal Hashing Based on Latent Semantic Distribution. 7307-7320 - Anqi Liu
, Sumei Li
, Yongli Chang
, Wenlin Zhang
, Yonghong Hou
:
Coarse-to-Fine Cross-View Interaction Based Accurate Stereo Image Super-Resolution Network. 7321-7334 - Ludan Sun
, Kai Zhang
, Feng Zhang
, Wenbo Wan
, Jiande Sun
:
Deep Rank-N Decomposition Network for Image Fusion. 7335-7348 - Jiwei Wei
, Yang Yang
, Xiang Guan
, Xing Xu
, Guoqing Wang
, Heng Tao Shen
:
Runge-Kutta Guided Feature Augmentation for Few-Sample Learning. 7349-7358 - Wenjie Zhu
, Bo Peng
, Wei Qi Yan
:
Dual Knowledge Distillation on Multiview Pseudo Labels for Unsupervised Person Re-Identification. 7359-7371 - Zhe Zhang
, Marc St-Hilaire
, Xin Wei
, Haiwei Dong
, Abdulmotaleb El-Saddik
:
How to Cache Important Contents for Multi-Modal Service in Dynamic Networks: A DRL-Based Caching Scheme. 7372-7385 - Lin Liu
, Junfeng An
, Shanxin Yuan
, Wengang Zhou
, Houqiang Li
, Yanfeng Wang
, Qi Tian
:
Video Demoiréing With Deep Temporal Color Embedding and Video-Image Invertible Consistency. 7386-7397 - Qiang Li
, Guang Zu
, Hui Xu
, Jun Kong
, Yanni Zhang
, Jianzhong Wang
:
An Adaptive Dual Selective Transformer for Temporal Action Localization. 7398-7412 - Wenhui Zhao
, Qin Li
, Huafu Xu, Quanxue Gao
, Qianqian Wang
, Xinbo Gao
:
Anchor Graph-Based Feature Selection for One-Step Multi-View Clustering. 7413-7425 - Huafeng Liu
, Mengmeng Sheng
, Zeren Sun
, Yazhou Yao
, Xian-Sheng Hua
, Heng Tao Shen
:
Learning With Imbalanced Noisy Data by Preventing Bias in Sample Selection. 7426-7437 - Jingru Duan
, Yanbin Hao
, Bin Zhu
, Lechao Cheng
, Pengyuan Zhou
, Xiang Wang
:
Efficient Unsupervised Video Hashing With Contextual Modeling and Structural Controlling. 7438-7450 - Shuo Yang
, Xinxiao Wu
, Zirui Shang
, Jiebo Luo
:
Dynamic Pathway for Query-Aware Feature Learning in Language-Driven Action Localization. 7451-7461 - Nan Gao
, Renyuan Yao
, Ronghua Liang
, Peng Chen
, Tianshuang Liu
, Yuanjie Dang
:
Multi-Level Objective Alignment Transformer for Fine-Grained Oral Panoramic X-Ray Report Generation. 7462-7474 - Zongyi Xu
, Xinqi Jiang
, Xinyu Gao, Rui Gao, Changjun Gu, Qianni Zhang
, Weisheng Li
, Xinbo Gao
:
IGReg: Image-Geometry-Assisted Point Cloud Registration via Selective Correlation Fusion. 7475-7489 - Junhu Wang
, Yanyan Wei
, Zhao Zhang
, Jicong Fan
, Yang Zhao
, Yi Yang, Meng Wang
:
Progressive Stereo Image Dehazing Network via Cross-View Region Interaction. 7490-7502 - Jiajia Xie
, Sheng Zhang
, Beihao Xia
, Zhu Xiao
, Hongbo Jiang
, Siwang Zhou
, Zheng Qin
, Hongyang Chen
:
Pedestrian Trajectory Prediction Based on Social Interactions Learning With Random Weights. 7503-7515 - Qingguo Liu
, Pan Gao
, Kang Han
, Ningzhong Liu, Wei Xiang
:
Degradation-Aware Self-Attention Based Transformer for Blind Image Super-Resolution. 7516-7528 - Haiqi Liu
, C. L. Philip Chen
, Xinrong Gong
, Tong Zhang
:
Robust Saliency-Aware Distillation for Few-Shot Fine-Grained Visual Recognition. 7529-7542 - Xin Zhou
, Chunyan Miao
:
Disentangled Graph Variational Auto-Encoder for Multimodal Recommendation With Interpretability. 7543-7554 - Shanmin Pang
, Yueyang Zeng
, Jiawei Zhao
, Jianru Xue
:
A Mutually Textual and Visual Refinement Network for Image-Text Matching. 7555-7566 - Jinyu Cai
, Yunhe Zhang
, Shiping Wang
, Jicong Fan
, Wenzhong Guo
:
Wasserstein Embedding Learning for Deep Clustering: A Generative Approach. 7567-7580 - Zhuang Shao
, Jungong Han
, Kurt Debattista
, Yanwei Pang
:
DCMSTRD: End-to-end Dense Captioning via Multi-Scale Transformer Decoding. 7581-7593 - Yixuan Zhu
, Wenliang Zhao
, Yansong Tang
, Yongming Rao
, Jie Zhou
, Jiwen Lu
:
StableSwap: Stable Face Swapping in a Shared and Controllable Latent Space. 7594-7607 - Yifan Wang
, Liyuan Liu
, Chun Yuan
, Minbo Li
, Jing Liu
:
Negative-Sensitive Framework With Semantic Enhancement for Composed Image Retrieval. 7608-7621 - Ruohao Guo
, Xianghua Ying
, Yanyu Qi
, Liao Qu
:
UniTR: A Unified TRansformer-Based Framework for Co-Object and Multi-Modal Saliency Detection. 7622-7635 - Xi Luo
, Min Jiang
, Jun Kong
, Xuefeng Tao
:
Hierarchical Camera-Aware Contrast Extension for Unsupervised Person Re-Identification. 7636-7648 - Shuang Chen
, Amir Atapour-Abarghouei
, Hubert P. H. Shum
:
HINT: High-Quality INpainting Transformer With Mask-Aware Encoding and Enhanced Attention. 7649-7660 - Siduo Pan
, Ziqi Zhang
, Kun Wei
, Xu Yang
, Cheng Deng
:
Few-Shot Generative Model Adaptation via Style-Guided Prompt. 7661-7672 - Zhongze Wang
, Haitao Zhao
, Lujian Yao
, Jingchao Peng
, Kaijie Zhao
:
DFR-Net: Density Feature Refinement Network for Image Dehazing Utilizing Haze Density Difference. 7673-7686 - Jinkun You
, Yicong Zhou
:
Two-Stage Watermark Removal Framework for Spread Spectrum Watermarking. 7687-7699 - Xiaokun Li
, Rumeng Yi
, Yaping Huang
:
Mutual Filter Teaching for Open-Set Semi-Supervised Learning. 7700-7708 - Yuntong Tian
, Jiaxi Li
, Huazhu Fu
, Lei Zhu
, Lequan Yu
, Liang Wan
:
Self-Mining the Confident Prototypes for Source-Free Unsupervised Domain Adaptation in Image Segmentation. 7709-7720 - Yutao Liu
, Baochao Zhang
, Runze Hu
, Ke Gu
, Guangtao Zhai
, Junyu Dong
:
Underwater Image Quality Assessment: Benchmark Database and Objective Method. 7734-7747 - Shiyuan He
, Jiwei Wei
, Chaoning Zhang
, Xing Xu
, Jingkuan Song
, Yang Yang
, Heng Tao Shen
:
Boosting Adversarial Training with Hardness-Guided Attack Strategy. 7748-7760 - Haochen Han
, Qinghua Zheng
, Minnan Luo
, Kaiyao Miao
, Feng Tian
, Yan Chen
:
Noise-Tolerant Learning for Audio-Visual Action Recognition. 7761-7774 - Ardian Umam
, Cheng-Kun Yang
, Jen-Hui Chuang
, Yen-Yu Lin
:
Unsupervised Point Cloud Co-Part Segmentation via Co-Attended Superpoint Generation and Aggregation. 7775-7786 - Zhuangzhuang Zhou
, Yingying Zhu
:
RaFPN: Relation-Aware Feature Pyramid Network for Dense Image Prediction. 7787-7800 - Mingqi Shao
, Chongkun Xia
, Dongxu Duan
, Xueqian Wang
:
Polarimetric Inverse Rendering for Transparent Shapes Reconstruction. 7801-7811 - Nengzhong Yin
, Chengxu Liu
, Ruhao Tian
, Xueming Qian
:
SDPDet: Learning Scale-Separated Dynamic Proposals for End-to-End Drone-View Detection. 7812-7822 - Junjie Zhang
, Yutao Rao
, Xiaoshui Huang
, Guanyi Li
, Xin Zhou
, Dan Zeng
:
Frequency-Aware Multi-Modal Fine-Tuning for Few-Shot Open-Set Remote Sensing Scene Classification. 7823-7837 - Jingchun Zhou
, Shiyin Wang
, Zifan Lin
, Qiuping Jiang
, Ferdous Sohel
:
A Pixel Distribution Remapping and Multi-Prior Retinex Variational Model for Underwater Image Enhancement. 7838-7849 - Xiaowen Wang
, Lanjun Wang
, Yuting Su, Yongdong Zhang
, An-An Liu
:
MCDAN: A Multi-Scale Context-Enhanced Dynamic Attention Network for Diffusion Prediction. 7850-7862 - Wufei Ma
, Jiahao Li
, Bin Li
, Yan Lu
:
Uncertainty-Aware Deep Video Compression With Ensembles. 7863-7872 - Deebha Mumtaz
, Sadbhawna
, Vinit Jakhetiya
, Badri N. Subudhi
, Weisi Lin
:
Non-Subsampled Contourlet Transform and Ground-Truth Score Generation Based Quality Assessment for DIBR-Synthesized Views. 7873-7886 - Quan Zhou
, Linjie Wang
, Guangwei Gao
, Bin Kang
, Weihua Ou
, Huimin Lu
:
Boundary-Guided Lightweight Semantic Segmentation With Multi-Scale Semantic Context. 7887-7900 - Jianping Gou
, Yu Chen
, Baosheng Yu
, Jinhua Liu
, Lan Du
, Shaohua Wan
, Zhang Yi
:
Reciprocal Teacher-Student Learning via Forward and Feedback Knowledge Distillation. 7901-7916 - Shaohua Teng
, Jiangbo Li
, Luyao Teng
, Lunke Fei
, Naiqi Wu
, Wei Zhang:
Scalable Discrete and Asymmetric Unequal Length Hashing Learning for Cross-Modal Retrieval. 7917-7932 - Zhuopan Yang
, Zhenguo Yang
, Xiaoping Li
, Yi Yu
, Qing Li
, Wenyin Liu
:
A Progressive Placeholder Learning Network for Multimodal Zero-Shot Learning. 7933-7945 - Yaoqian Zhao
, Qizhi Teng
, Honggang Chen
, Shujiang Zhang
, Xiaohai He
, Yi Li
, Ray E. Sheriff
:
Activating More Information in Arbitrary-Scale Image Super-Resolution. 7946-7961 - Ge Song
, Kai Huang
, Hanwen Su
, Fengyi Song
, Ming Yang
:
Deep Ranking Distribution Preserving Hashing for Robust Multi-Label Cross-Modal Retrieval. 7027-7042 - Qihao Liang
, Ye Wang
:
Drawlody: Sketch-Based Melody Creation With Enhanced Usability and Interpretability. 7074-7088 - Haoyu Wang
, Yuhu Cheng
, Xiaomin Liu
, Xuesong Wang
:
Reinforcement Learning Based Markov Edge Decoupled Fusion Network for Fusion Classification of Hyperspectral and LiDAR. 7174-7187 - Jiachen Yang
, Shukun Ma
, Zhuo Zhang
, Yang Li
, Shuai Xiao
, Jiabao Wen
, Wen Lu
, Xinbo Gao
:
Say No to Redundant Information: Unsupervised Redundant Feature Elimination for Active Learning. 7721-7733 - Yutong Gao
, Congyan Lang
, Fayao Liu
, Yuanzhouhan Cao
, Lijuan Sun, Yunchao Wei
:
Dynamic Interaction Dilation for Interactive Human Parsing. 178-189 - Ziwei Niu
, Junkun Yuan
, Xu Ma
, Yingying Xu
, Jing Liu
, Yen-Wei Chen
, Ruofeng Tong
, Lanfen Lin
:
Knowledge Distillation-Based Domain-Invariant Representation Learning for Domain Generalization. 245-255