default search action
Yong Man Ro
Person information
- affiliation: Korea Advanced Institute of Science and Technology, School of Electrical Engineering, Image and Video Systems Lab, Daejeon, South Korea
- affiliation (former): Information and Communications University, Yusong, South Korea
- affiliation (PhD 1992): Korea Advanced Institute of Science and Technology, Daejeon, South Korea
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j97]Sungjune Park, Hyunjun Kim, Yong Man Ro:
Robust pedestrian detection via constructing versatile pedestrian knowledge bank. Pattern Recognit. 153: 110539 (2024) - [j96]Sangmin Lee, Hyung-Il Kim, Yong Man Ro:
Text-guided distillation learning to diversify video embeddings for text-video retrieval. Pattern Recognit. 156: 110754 (2024) - [j95]Jeong Hun Yeo, Minsu Kim, Jeongsoo Choi, Dae Hoe Kim, Yong Man Ro:
AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model. IEEE Trans. Multim. 26: 6462-6474 (2024) - [j94]Hong Joo Lee, Youngjoon Yu, Yong Man Ro:
Advancing Adversarial Training by Injecting Booster Signal. IEEE Trans. Neural Networks Learn. Syst. 35(9): 12665-12677 (2024) - [c251]Seongyeop Kim, Hyung-Il Kim, Yong Man Ro:
Improving Open Set Recognition via Visual Prompts Distilled from Common-Sense Knowledge. AAAI 2024: 2786-2794 - [c250]Byung-Kwan Lee, Beomchan Park, Chae Won Kim, Yong Man Ro:
CoLLaVO: Crayon Large Language and Vision mOdel. ACL (Findings) 2024: 1121-1138 - [c249]Se Jin Park, Chae Won Kim, Hyeongseop Rha, Minsu Kim, Joanna Hong, Jeong Hun Yeo, Yong Man Ro:
Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation. ACL (1) 2024: 16334-16348 - [c248]Pai Chet Ng, Zhixiang Chi, Malcolm Low, Juwei Lu, Konstantinos N. Plataniotis, Nikolaos V. Boulgouris, Thirimachos Bourlai, Yong Man Ro:
Hyperspectral Skin Vision Challenge: Can Your Camera See Beyond Your Skin? ICASSP Workshops 2024: 59-60 - [c247]Se Jin Park, Minsu Kim, Jeongsoo Choi, Yong Man Ro:
Exploring Phonetic Context-Aware Lip-Sync for Talking Face Generation. ICASSP 2024: 4325-4329 - [c246]Minsu Kim, Jeongsoo Choi, Soumi Maiti, Jeong Hun Yeo, Shinji Watanabe, Yong Man Ro:
Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-Training and Multi-Modal Tokens. ICASSP 2024: 7970-7974 - [c245]Jeongsoo Choi, Minsu Kim, Se Jin Park, Yong Man Ro:
Text-Driven Talking Face Synthesis by Reprogramming Audio-Driven Models. ICASSP 2024: 8065-8069 - [c244]Jeong Hun Yeo, Minsu Kim, Shinji Watanabe, Yong Man Ro:
Visual Speech Recognition for Languages with Limited Labeled Data Using Automatic Labels from Whisper. ICASSP 2024: 10471-10475 - [c243]Seunghee Han, Se Jin Park, Chae Won Kim, Yong Man Ro:
Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation. ICASSP 2024: 11321-11325 - [i73]Minsu Kim, Jeong Hun Yeo, Jeongsoo Choi, Se Jin Park, Yong Man Ro:
Multilingual Visual Speech Recognition with a Single Model by Learning with Discrete Visual Speech Units. CoRR abs/2401.09802 (2024) - [i72]Byung-Kwan Lee, Beomchan Park, Chae Won Kim, Yong Man Ro:
CoLLaVO: Crayon Large Language and Vision mOdel. CoRR abs/2402.11248 (2024) - [i71]Jeong Hun Yeo, Seunghee Han, Minsu Kim, Yong Man Ro:
Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing. CoRR abs/2402.15151 (2024) - [i70]Minsu Kim, Jee-weon Jung, Hyeongseop Rha, Soumi Maiti, Siddhant Arora, Xuankai Chang, Shinji Watanabe, Yong Man Ro:
TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages. CoRR abs/2402.16021 (2024) - [i69]Taeheon Kim, Sebin Shin, Youngjoon Yu, Hak Gu Kim, Yong Man Ro:
Causal Mode Multiplexer: A Novel Framework for Unbiased Multispectral Pedestrian Detection. CoRR abs/2403.01300 (2024) - [i68]Seunghee Han, Se Jin Park, Chae Won Kim, Yong Man Ro:
Persona Extraction Through Semantic Similarity for Emotional Support Conversation Generation. CoRR abs/2403.04212 (2024) - [i67]Byung-Kwan Lee, Beomchan Park, Chae Won Kim, Yong Man Ro:
MoAI: Mixture of All Intelligence for Large Language and Vision Models. CoRR abs/2403.07508 (2024) - [i66]Junho Kim, Yeonju Kim, Yong Man Ro:
What if...?: Counterfactual Inception to Mitigate Hallucination Effects in Large Multimodal Models. CoRR abs/2403.13513 (2024) - [i65]Taeheon Kim, Sangyun Chung, Damin Yeom, Youngjoon Yu, Hak Gu Kim, Yong Man Ro:
MSCoTDet: Language-driven Multi-modal Fusion for Improved Multispectral Pedestrian Detection. CoRR abs/2403.15209 (2024) - [i64]Sungjune Park, Hyunjun Kim, Yong Man Ro:
Robust Pedestrian Detection via Constructing Versatile Pedestrian Knowledge Bank. CoRR abs/2404.19299 (2024) - [i63]Byung-Kwan Lee, Chae Won Kim, Beomchan Park, Yong Man Ro:
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models. CoRR abs/2405.15574 (2024) - [i62]Junho Kim, Hyunjun Kim, Yeonju Kim, Yong Man Ro:
CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models. CoRR abs/2406.01920 (2024) - [i61]Se Jin Park, Chae Won Kim, Hyeongseop Rha, Minsu Kim, Joanna Hong, Jeong Hun Yeo, Yong Man Ro:
Let's Go Real Talk: Spoken Dialogue Model for Face-to-Face Conversation. CoRR abs/2406.07867 (2024) - [i60]Byung-Kwan Lee, Sangyun Chung, Chae Won Kim, Beomchan Park, Yong Man Ro:
TroL: Traversal of Layers for Large Language and Vision Models. CoRR abs/2406.12246 (2024) - 2023
- [j93]Hakmin Lee, Yong Man Ro:
Adversarial anchor-guided feature refinement for adversarial defense. Image Vis. Comput. 136: 104722 (2023) - [j92]Hong Joo Lee, Yong Man Ro:
Robust Proxy: Improving Adversarial Robustness by Robust Proxy Learning. IEEE Trans. Inf. Forensics Secur. 18: 4021-4033 (2023) - [j91]Jung Uk Kim, Hyung-Il Kim, Yong Man Ro:
Stereoscopic Vision Recalling Memory for Monocular 3D Object Detection. IEEE Trans. Image Process. 32: 2749-2760 (2023) - [c242]Taeheon Kim, Youngjoon Yu, Yong Man Ro:
Multispectral Invisible Coating: Laminated Visible-Thermal Physical Attack against Multispectral Object Detectors Using Transparent Low-E Films. AAAI 2023: 1151-1159 - [c241]Minsu Kim, Chae Won Kim, Yong Man Ro:
Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video. AAAI 2023: 8273-8281 - [c240]Junho Kim, Byung-Kwan Lee, Yong Man Ro:
Demystifying Causal Features on Adversarial Examples and Causal Inoculation for Robust Network by Adversarial Instrumental Variable Regression. CVPR 2023: 12032-12042 - [c239]Joanna Hong, Minsu Kim, Jeongsoo Choi, Yong Man Ro:
Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring. CVPR 2023: 18783-18794 - [c238]Joanna Hong, Se Jin Park, Yong Man Ro:
Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model. EMNLP (Findings) 2023: 4886-4890 - [c237]Minsu Kim, Joanna Hong, Yong Man Ro:
Lip-to-Speech Synthesis in the Wild with Multi-Task Learning. ICASSP 2023: 1-5 - [c236]Jung Uk Kim, Yong Man Ro:
Similarity Relation Preserving Cross-Modal Learning for Multispectral Pedestrian Detection Against Adversarial Attacks. ICASSP 2023: 1-5 - [c235]Jeong Hun Yeo, Minsu Kim, Yong Man Ro:
Multi-Temporal Lip-Audio Memory for Visual Speech Recognition. ICASSP 2023: 1-5 - [c234]Byung-Kwan Lee, Junho Kim, Yong Man Ro:
Mitigating Adversarial Vulnerability through Causal Parameter Estimation by Adversarial Double Machine Learning. ICCV 2023: 4476-4486 - [c233]Jeongsoo Choi, Joanna Hong, Yong Man Ro:
DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided Speaker Embedding. ICCV 2023: 7778-7787 - [c232]Minsu Kim, Jeong Hun Yeo, Jeongsoo Choi, Yong Man Ro:
Lip Reading for Low-resource Languages by Learning and Combining General Speech Knowledge and Language-specific Knowledge. ICCV 2023: 15313-15325 - [c231]Yeonju Kim, Junho Kim, Byung-Kwan Lee, Sebin Shin, Yong Man Ro:
Mitigating Dataset Bias in Image Captioning Through Clip Confounder-Free Captioning Network. ICIP 2023: 1720-1724 - [c230]Sungjune Park, Jung Uk Kim, Jin Mo Song, Yong Man Ro:
Robust Multispectral Pedestrian Detection Via Spectral Position-Free Feature Mapping. ICIP 2023: 1795-1799 - [c229]Jeongsoo Choi, Minsu Kim, Yong Man Ro:
Intelligible Lip-to-Speech Synthesis with Speech Units. INTERSPEECH 2023: 4349-4353 - [i59]Minsu Kim, Hyung-Il Kim, Yong Man Ro:
Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognition. CoRR abs/2302.08102 (2023) - [i58]Minsu Kim, Joanna Hong, Yong Man Ro:
Lip-to-Speech Synthesis in the Wild with Multi-task Learning. CoRR abs/2302.08841 (2023) - [i57]Junho Kim, Byung-Kwan Lee, Yong Man Ro:
Demystifying Causal Features on Adversarial Examples and Causal Inoculation for Robust Network by Adversarial Instrumental Variable Regression. CoRR abs/2303.01052 (2023) - [i56]Joanna Hong, Minsu Kim, Jeongsoo Choi, Yong Man Ro:
Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scoring. CoRR abs/2303.08536 (2023) - [i55]Minsu Kim, Chae Won Kim, Yong Man Ro:
Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video. CoRR abs/2303.08670 (2023) - [i54]Jeong Hun Yeo, Minsu Kim, Yong Man Ro:
Multi-Temporal Lip-Audio Memory for Visual Speech Recognition. CoRR abs/2305.04542 (2023) - [i53]Se Jin Park, Minsu Kim, Jeongsoo Choi, Yong Man Ro:
Exploring Phonetic Context in Lip Movement for Authentic Talking Face Generation. CoRR abs/2305.19556 (2023) - [i52]Jeongsoo Choi, Minsu Kim, Yong Man Ro:
Intelligible Lip-to-Speech Synthesis with Speech Units. CoRR abs/2305.19603 (2023) - [i51]Hong Joo Lee, Youngjoon Yu, Yong Man Ro:
Advancing Adversarial Training by Injecting Booster Signal. CoRR abs/2306.15451 (2023) - [i50]Hong Joo Lee, Yong Man Ro:
Robust Proxy: Improving Adversarial Robustness by Robust Proxy Learning. CoRR abs/2306.15457 (2023) - [i49]Jeongsoo Choi, Minsu Kim, Se Jin Park, Yong Man Ro:
Reprogramming Audio-driven Talking Face Synthesis into Text-driven. CoRR abs/2306.16003 (2023) - [i48]Byung-Kwan Lee, Junho Kim, Yong Man Ro:
Mitigating Adversarial Vulnerability through Causal Parameter Estimation by Adversarial Double Machine Learning. CoRR abs/2307.07250 (2023) - [i47]Minsu Kim, Jeongsoo Choi, Dahun Kim, Yong Man Ro:
Many-to-Many Spoken Language Translation via Unified Speech and Text Representation Learning with Unit-to-Unit Translation. CoRR abs/2308.01831 (2023) - [i46]Jeong Hun Yeo, Minsu Kim, Jeongsoo Choi, Dae Hoe Kim, Yong Man Ro:
AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model. CoRR abs/2308.07593 (2023) - [i45]Jeongsoo Choi, Joanna Hong, Yong Man Ro:
DiffV2S: Diffusion-based Video-to-Speech Synthesis with Vision-guided Speaker Embedding. CoRR abs/2308.07787 (2023) - [i44]Minsu Kim, Jeong Hun Yeo, Jeongsoo Choi, Yong Man Ro:
Lip Reading for Low-resource Languages by Learning and Combining General Speech Knowledge and Language-specific Knowledge. CoRR abs/2308.09311 (2023) - [i43]Minsu Kim, Jeongsoo Choi, Soumi Maiti, Jeong Hun Yeo, Shinji Watanabe, Yong Man Ro:
Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal Tokens. CoRR abs/2309.08531 (2023) - [i42]Jeong Hun Yeo, Minsu Kim, Shinji Watanabe, Yong Man Ro:
Visual Speech Recognition for Low-resource Languages with Automatic Labels From Whisper Model. CoRR abs/2309.08535 (2023) - [i41]Se Jin Park, Joanna Hong, Minsu Kim, Yong Man Ro:
DF-3DFace: One-to-Many Speech Synchronized 3D Face Animation with Diffusion. CoRR abs/2310.05934 (2023) - [i40]Junho Kim, Byung-Kwan Lee, Yong Man Ro:
Causal Unsupervised Semantic Segmentation. CoRR abs/2310.07379 (2023) - [i39]Joanna Hong, Se Jin Park, Yong Man Ro:
Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model. CoRR abs/2310.14946 (2023) - [i38]Sungjune Park, Hyunjun Kim, Yong Man Ro:
Incorporating Language-Driven Appearance Knowledge Units with Visual Cues in Pedestrian Detection. CoRR abs/2311.01025 (2023) - [i37]Jeongsoo Choi, Se Jin Park, Minsu Kim, Yong Man Ro:
AV2AV: Direct Audio-Visual Speech to Audio-Visual Speech Translation with Unified Audio-Visual Speech Representation. CoRR abs/2312.02512 (2023) - 2022
- [j90]Wissam J. Baddar, Sangmin Lee, Yong Man Ro:
On-the-Fly Facial Expression Prediction Using LSTM Encoded Appearance-Suppressed Dynamics. IEEE Trans. Affect. Comput. 13(1): 159-174 (2022) - [j89]Hyung-Il Kim, Kimin Yun, Yong Man Ro:
Face Shape-Guided Deep Feature Alignment for Face Recognition Robust to Face Misalignment. IEEE Trans. Biom. Behav. Identity Sci. 4(4): 556-569 (2022) - [j88]Jung Uk Kim, Sungjune Park, Yong Man Ro:
Uncertainty-Guided Cross-Modal Learning for Robust Multispectral Pedestrian Detection. IEEE Trans. Circuits Syst. Video Technol. 32(3): 1510-1523 (2022) - [j87]Sangmin Lee, Seongyeop Kim, Hak Gu Kim, Yong Man Ro:
Assessing Individual VR Sickness Through Deep Feature Fusion of VR Video and Physiological Response. IEEE Trans. Circuits Syst. Video Technol. 32(5): 2895-2907 (2022) - [j86]Junho Kim, Seongyeop Kim, Seong Tae Kim, Yong Man Ro:
Robust Perturbation for Visual Explanation: Cross-Checking Mask Optimization to Avoid Class Distortion. IEEE Trans. Image Process. 31: 301-313 (2022) - [j85]Youngjoon Yu, Hong Joo Lee, Hakmin Lee, Yong Man Ro:
Defending Person Detection Against Adversarial Patch Attack by Using Universal Defensive Frame. IEEE Trans. Image Process. 31: 6976-6990 (2022) - [j84]Minsu Kim, Joanna Hong, Se Jin Park, Yong Man Ro:
CroMM-VSR: Cross-Modal Memory Augmented Visual Speech Recognition. IEEE Trans. Multim. 24: 4342-4355 (2022) - [c228]Jung Uk Kim, Sungjune Park, Yong Man Ro:
Towards Versatile Pedestrian Detector with Multisensory-Matching and Multispectral Recalling Memory. AAAI 2022: 1157-1165 - [c227]Minsu Kim, Jeong Hun Yeo, Yong Man Ro:
Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading. AAAI 2022: 1174-1182 - [c226]Se Jin Park, Minsu Kim, Joanna Hong, Jeongsoo Choi, Yong Man Ro:
SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory. AAAI 2022: 2062-2070 - [c225]Sangmin Lee, Hyung-Il Kim, Yong Man Ro:
Weakly Paired Associative Learning for Sound and Image Representations via Bimodal Associative Memory. CVPR 2022: 10524-10533 - [c224]Byung-Kwan Lee, Junho Kim, Yong Man Ro:
Masking Adversarial Damage: Finding Adversarial Saliency for Robust and Sparse Network. CVPR 2022: 15105-15115 - [c223]Joanna Hong, Minsu Kim, Yong Man Ro:
VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection. ECCV (36) 2022: 452-468 - [c222]Sangmin Lee, Sungjune Park, Yong Man Ro:
Audio-Visual Mismatch-Aware Video Retrieval via Association and Adjustment. ECCV (14) 2022: 497-514 - [c221]Minsu Kim, Hyunjun Kim, Yong Man Ro:
Speaker-Adaptive Lip Reading with User-Dependent Padding. ECCV (36) 2022: 576-593 - [c220]Sungjune Park, Dae Hwi Choi, Jung Uk Kim, Yong Man Ro:
Robust Thermal Infrared Pedestrian Detection By Associating Visible Pedestrian Knowledge. ICASSP 2022: 4468-4472 - [c219]Taeheon Kim, Hong Joo Lee, Yong Man Ro:
Map: Multispectral Adversarial Patch to Attack Person Detection. ICASSP 2022: 4853-4857 - [c218]Joanna Hong, Minsu Kim, Daehun Yoo, Yong Man Ro:
Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition. INTERSPEECH 2022: 2838-2842 - [c217]Taeheon Kim, Youngjoon Yu, Yong Man Ro:
Defending Physical Adversarial Attack on Object Detection via Adversarial Patch-Feature Energy. ACM Multimedia 2022: 1905-1913 - [c216]Sangmin Lee, Sungjune Park, Yong Man Ro:
IVIST: Interactive Video Search Tool in VBS 2022. MMM (2) 2022: 524-529 - [i36]Minsu Kim, Joanna Hong, Se Jin Park, Yong Man Ro:
Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video. CoRR abs/2204.01265 (2022) - [i35]Minsu Kim, Jeong Hun Yeo, Yong Man Ro:
Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading. CoRR abs/2204.01725 (2022) - [i34]Minsu Kim, Joanna Hong, Yong Man Ro:
Lip to Speech Synthesis with Visual Context Attentional GAN. CoRR abs/2204.01726 (2022) - [i33]Junho Kim, Byung-Kwan Lee, Yong Man Ro:
Distilling Robust and Non-Robust Features in Adversarial Examples by Information Bottleneck. CoRR abs/2204.02735 (2022) - [i32]Byung-Kwan Lee, Junho Kim, Yong Man Ro:
Masking Adversarial Damage: Finding Adversarial Saliency for Robust and Sparse Network. CoRR abs/2204.02738 (2022) - [i31]Youngjoon Yu, Hong Joo Lee, Hakmin Lee, Yong Man Ro:
Defending Against Person Hiding Adversarial Patch Attack with a Universal White Frame. CoRR abs/2204.13004 (2022) - [i30]Joanna Hong, Minsu Kim, Yong Man Ro:
VisageSynTalk: Unseen Speaker Video-to-Speech Synthesis via Speech-Visage Feature Selection. CoRR abs/2206.07458 (2022) - [i29]Joanna Hong, Minsu Kim, Daehun Yoo, Yong Man Ro:
Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition. CoRR abs/2207.06020 (2022) - [i28]Minsu Kim, Hyunjun Kim, Yong Man Ro:
Speaker-adaptive Lip Reading with User-dependent Padding. CoRR abs/2208.04498 (2022) - [i27]Hyung-Il Kim, Kimin Yun, Yong Man Ro:
Face Shape-Guided Deep Feature Alignment for Face Recognition Robust to Face Misalignment. CoRR abs/2209.07220 (2022) - [i26]Minsu Kim, Youngjoon Yu, Sungjune Park, Yong Man Ro:
Meta Input: How to Leverage Off-the-Shelf Deep Neural Networks. CoRR abs/2210.13186 (2022) - [i25]Se Jin Park, Minsu Kim, Joanna Hong, Jeongsoo Choi, Yong Man Ro:
SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory. CoRR abs/2211.00924 (2022) - 2021
- [j83]Sungjune Park, Hong Joo Lee, Yong Man Ro:
Adversarially Robust Hyperspectral Image Classification via Random Spectral Sampling and Spectral Shape Encoding. IEEE Access 9: 66791-66804 (2021) - [j82]Joanna Hong, Minsu Kim, Se Jin Park, Yong Man Ro:
Speech Reconstruction With Reminiscent Sound Via Visual Voice Memory. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3654-3667 (2021) - [j81]Minho Park, Hak Gu Kim, Sangmin Lee, Yong Man Ro:
Robust Video Frame Interpolation With Exceptional Motion Map. IEEE Trans. Circuits Syst. Video Technol. 31(2): 754-764 (2021) - [j80]Jung Uk Kim, Seong-Tae Kim, Hong Joo Lee, Sangmin Lee, Yong Man Ro:
CUA Loss: Class Uncertainty-Aware Gradient Modulation for Robust Object Detection. IEEE Trans. Circuits Syst. Video Technol. 31(9): 3529-3543 (2021) - [c215]Hak Gu Kim, Sangmin Lee, Seongyeop Kim, Heoun-taek Lim, Yong Man Ro:
Towards a Better Understanding of VR Sickness: Physical Symptom Prediction for VR Contents. AAAI 2021: 836-844 - [c214]Hak Gu Kim, Minho Park, Sangmin Lee, Seongyeop Kim, Yong Man Ro:
Visual Comfort Aware-Reinforcement Learning for Depth Adjustment of Stereoscopic 3D Images. AAAI 2021: 1762-1770 - [c213]Seongyeop Kim, Yong Man Ro:
M-CAM: Visual Explanation of Challenging Conditioned Dataset with Bias-reducing Memory. BMVC 2021: 87 - [c212]Sangmin Lee, Hak Gu Kim, Dae Hwi Choi, Hyung-Il Kim, Yong Man Ro:
Video Prediction Recalling Long-Term Motion Context via Memory Alignment Learning. CVPR 2021: 3054-3063 - [c211]Youngjoon Yu, Hong Joo Lee, Byeong Cheon Kim, Jung Uk Kim, Yong Man Ro:
Towards Robust Training of Multi-Sensor Data Fusion Network Against Adversarial Examples in Semantic Segmentation. ICASSP 2021: 4710-4714 - [c210]Minsu Kim, Joanna Hong, Se Jin Park, Yong Man Ro:
Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video. ICCV 2021: 296-306 - [c209]Jung Uk Kim, Sungjune Park, Yong Man Ro:
Robust Small-scale Pedestrian Detection with Cued Recall via Memory Learning. ICCV 2021: 3030-3039 - [c208]Junho Kim, Minsu Kim, Yong Man Ro:
Interpretation of Lesional Detection via Counterfactual Generation. ICIP 2021: 96-100 - [c207]Hong Joo Lee, Yong Man Ro:
Adversarially Robust Multi-Sensor Fusion Model Training Via Random Feature Fusion For Semantic Segmentation. ICIP 2021: 339-343 - [c206]Byeong Cheon Kim, Youngjoon Yu, Yong Man Ro:
Robust Decision-Based Black-Box Adversarial Attack via Coarse-To-Fine Random Search. ICIP 2021: 3048-3052 - [c205]Sungjune Park, Jung Uk Kim, Yeongyun Kim, Sang-Keun Moon, Yong Man Ro:
Robust Multispectral Pedestrian Detection via Uncertainty-Aware Cross-Modal Learning. MMM (1) 2021: 391-402 - [c204]Yoonho Lee, Heeju Choi, Sungjune Park, Yong Man Ro:
IVIST: Interactive Video Search Tool in VBS 2021. MMM (2) 2021: 423-428 - [c203]Minsu Kim, Joanna Hong, Yong Man Ro:
Lip to Speech Synthesis with Visual Context Attentional GAN. NeurIPS 2021: 2758-2770 - [c202]Junho Kim, Byung-Kwan Lee, Yong Man Ro:
Distilling Robust and Non-Robust Features in Adversarial Examples by Information Bottleneck. NeurIPS 2021: 17148-17159 - [i24]Sangmin Lee, Hak Gu Kim, Dae Hwi Choi, Hyung-Il Kim, Yong Man Ro:
Video Prediction Recalling Long-term Motion Context via Memory Alignment Learning. CoRR abs/2104.00924 (2021) - [i23]Hak Gu Kim, Sangmin Lee, Seongyeop Kim, Heoun-taek Lim, Yong Man Ro:
Towards a Better Understanding of VR Sickness: Physical Symptom Prediction for VR Contents. CoRR abs/2104.06780 (2021) - [i22]