


default search action
Mayu Otani
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [i32]Zongshang Pang, Mayu Otani, Yuta Nakashima:
Measure Twice, Cut Once: Grasping Video Structures and Event Semantics with LLMs for Video Temporal Localization. CoRR abs/2503.09027 (2025) - 2024
- [j10]Zongshang Pang
, Yuta Nakashima, Mayu Otani
, Hajime Nagahara:
Unleashing the Power of Contrastive Learning for Zero-Shot Video Summarization. J. Imaging 10(9): 229 (2024) - [c34]Tianwei Chen, Yusuke Hirota, Mayu Otani, Noa Garcia, Yuta Nakashima:
Would Deep Generative Models Amplify Bias in Future Models? CVPR 2024: 10833-10843 - [c33]Antonio Tejero-de-Pablos
, Riku Togashi
, Mayu Otani
, Shin'ichi Satoh
:
Robust Nearest Neighbors for Source-Free Domain Adaptation Under Class Distribution Shift. ECCV (72) 2024: 1-17 - [c32]Julian Jorge Andrade Guerreiro
, Naoto Inoue
, Kento Masui
, Mayu Otani
, Hideki Nakayama
:
LayoutFlow: Flow Matching for Layout Generation. ECCV (36) 2024: 56-72 - [c31]Zongshang Pang, Yuta Nakashima, Mayu Otani, Hajime Nagahara:
Revisiting Pixel-Level Contrastive Pre-Training on Scene Images. WACV 2024: 1773-1782 - [i31]Julian Jorge Andrade Guerreiro, Naoto Inoue, Kento Masui, Mayu Otani, Hideki Nakayama:
LayoutFlow: Flow Matching for Layout Generation. CoRR abs/2403.18187 (2024) - [i30]Tianwei Chen, Yusuke Hirota, Mayu Otani, Noa Garcia, Yuta Nakashima:
Would Deep Generative Models Amplify Bias in Future Models? CoRR abs/2404.03242 (2024) - [i29]Mayu Otani, Naoto Inoue, Kotaro Kikuchi, Riku Togashi:
LTSim: Layout Transportation-based Similarity Measure for Evaluating Layout Generation. CoRR abs/2407.12356 (2024) - [i28]Kotaro Kikuchi, Naoto Inoue, Mayu Otani, Edgar Simo-Serra, Kota Yamaguchi:
Multimodal Markup Document Models for Graphic Design Completion. CoRR abs/2409.19051 (2024) - [i27]Kento Masui, Mayu Otani, Masahiro Nomura, Hideki Nakayama:
Harnessing the Latent Diffusion Model for Training-Free Image Style Transfer. CoRR abs/2410.01366 (2024) - 2023
- [c30]Naoto Inoue, Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi
:
LayoutDM: Discrete Diffusion Model for Controllable Layout Generation. CVPR 2023: 10167-10176 - [c29]Mayu Otani, Riku Togashi, Yu Sawai, Ryosuke Ishigami, Yuta Nakashima, Esa Rahtu
, Janne Heikkilä, Shin'ichi Satoh:
Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation. CVPR 2023: 14277-14286 - [c28]Naoto Inoue, Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi:
Towards Flexible Multi-modal Document Models. CVPR 2023: 14287-14296 - [c27]Daichi Haraguchi
, Mayu Otani
:
Coarse-to-fine font recommendation for banner designs. IUI Companion 2023: 148-150 - [c26]Qianru Qiu
, Xueting Wang
, Mayu Otani
:
Multimodal Color Recommendation in Vector Graphic Documents. ACM Multimedia 2023: 4003-4011 - [c25]Zongshang Pang, Yuta Nakashima, Mayu Otani, Hajime Nagahara:
Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization. WACV 2023: 2009-2018 - [c24]Qianru Qiu, Xueting Wang, Mayu Otani, Yuki Iwazaki:
Color Recommendation for Vector Graphic Documents based on Multi-Palette Representation. WACV 2023: 3610-3618 - [c23]Kotaro Kikuchi, Naoto Inoue, Mayu Otani, Edgar Simo-Serra, Kota Yamaguchi
:
Generative Colorization of Structured Mobile Web Pages. WACV 2023: 3639-3648 - [i26]Naoto Inoue, Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi
:
LayoutDM: Discrete Diffusion Model for Controllable Layout Generation. CoRR abs/2303.08137 (2023) - [i25]Naoto Inoue, Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi
:
Towards Flexible Multi-modal Document Models. CoRR abs/2303.18248 (2023) - [i24]Mayu Otani, Riku Togashi, Yu Sawai, Ryosuke Ishigami, Yuta Nakashima, Esa Rahtu, Janne Heikkilä, Shin'ichi Satoh:
Toward Verifiable and Reproducible Human Evaluation for Text-to-Image Generation. CoRR abs/2304.01816 (2023) - [i23]Qianru Qiu, Xueting Wang, Mayu Otani:
Multimodal Color Recommendation in Vector Graphic Documents. CoRR abs/2308.04118 (2023) - 2022
- [j9]Chenhui Chu
, Vinícius Oliveira
, Felix Giovanni Virgo
, Mayu Otani
, Noa Garcia, Yuta Nakashima
:
The semantic typology of visually grounded paraphrases. Comput. Vis. Image Underst. 215: 103333 (2022) - [j8]Mayu Otani, Yale Song, Yang Wang:
Video Summarization Overview. Found. Trends Comput. Graph. Vis. 13(4): 284-335 (2022) - [c22]Yutaro Yamada, Mayu Otani:
Does Robustness on ImageNet Transfer to Downstream Tasks? CVPR 2022: 9205-9214 - [c21]Riku Togashi
, Mayu Otani, Yuta Nakashima, Esa Rahtu
, Janne Heikkilä, Tetsuya Sakai:
AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval. CVPR 2022: 21044-21053 - [c20]Mayu Otani, Riku Togashi
, Yuta Nakashima, Esa Rahtu
, Janne Heikkilä, Shin'ichi Satoh:
Optimal Correction Cost for Object Detection Evaluation. CVPR 2022: 21075-21083 - [c19]Qianru Qiu, Mayu Otani, Yuki Iwazaki:
An Intelligent Color Recommendation Tool for Landing Page Design. IUI Companion 2022: 26-29 - [i22]Mayu Otani, Riku Togashi, Yuta Nakashima, Esa Rahtu, Janne Heikkilä, Shin'ichi Satoh:
Optimal Correction Cost for Object Detection Evaluation. CoRR abs/2203.14438 (2022) - [i21]Riku Togashi, Mayu Otani, Yuta Nakashima, Esa Rahtu, Janne Heikkilä, Tetsuya Sakai:
AxIoU: An Axiomatically Justified Measure for Video Moment Retrieval. CoRR abs/2203.16062 (2022) - [i20]Yutaro Yamada, Mayu Otani:
Does Robustness on ImageNet Transfer to Downstream Tasks? CoRR abs/2204.03934 (2022) - [i19]Tianwei Chen, Noa Garcia, Mayu Otani, Chenhui Chu, Yuta Nakashima, Hajime Nagahara:
Learning More May Not Be Better: Knowledge Transferability in Vision and Language Tasks. CoRR abs/2208.10758 (2022) - [i18]Qianru Qiu, Xueting Wang, Mayu Otani, Yuki Iwazaki:
Color Recommendation for Vector Graphic Documents based on Multi-Palette Representation. CoRR abs/2209.10820 (2022) - [i17]Mayu Otani, Yale Song, Yang Wang:
Video Summarization Overview. CoRR abs/2210.11707 (2022) - [i16]Zongshang Pang, Yuta Nakashima, Mayu Otani, Hajime Nagahara:
Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization. CoRR abs/2211.10056 (2022) - [i15]Kotaro Kikuchi, Naoto Inoue, Mayu Otani, Edgar Simo-Serra, Kota Yamaguchi
:
Generative Colorization of Structured Mobile Web Pages. CoRR abs/2212.11541 (2022) - 2021
- [j7]Wenjian Dong, Mayu Otani, Noa Garcia, Yuta Nakashima
, Chenhui Chu
:
Cross-Lingual Visual Grounding. IEEE Access 9: 349-358 (2021) - [j6]Kotaro Kikuchi, Mayu Otani, Kota Yamaguchi, Edgar Simo-Serra:
Modeling Visual Containment for Web Page Layout Optimization. Comput. Graph. Forum 40(7): 33-44 (2021) - [j5]Zekun Yang
, Noa Garcia, Chenhui Chu
, Mayu Otani, Yuta Nakashima, Haruo Takemura:
A comparative study of language transformers for video question answering. Neurocomputing 445: 121-133 (2021) - [c18]Jules Samaran
, Noa Garcia, Mayu Otani, Chenhui Chu, Yuta Nakashima:
Attending Self-Attention: A Case Study of Visually Grounded Supervision in Vision-and-Language Transformers. ACL (student) 2021: 81-86 - [c17]Tianran Wu, Noa Garcia, Mayu Otani, Chenhui Chu, Yuta Nakashima, Haruo Takemura:
Transferring Domain-Agnostic Knowledge in Video Question Answering. BMVC 2021: 301 - [c16]Yusuke Hirota, Noa Garcia, Mayu Otani, Chenhui Chu, Yuta Nakashima, Ittetsu Taniguchi, Takao Onoye:
Visual Question Answering with Textual Representations for Images. ICCVW 2021: 3147-3150 - [c15]Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi
:
Constrained Graphic Layout Generation via Latent Optimization. ACM Multimedia 2021: 88-96 - [c14]Riku Togashi
, Masahiro Kato, Mayu Otani, Tetsuya Sakai, Shin'ichi Satoh:
Scalable Personalised Item Ranking through Parametric Density Estimation. SIGIR 2021: 921-931 - [c13]Yuta Kayatani, Zekun Yang
, Mayu Otani, Noa Garcia, Chenhui Chu, Yuta Nakashima, Haruo Takemura:
The Laughing Machine: Predicting Humor in Video. WACV 2021: 2072-2081 - [c12]Riku Togashi
, Mayu Otani, Shin'ichi Satoh:
Alleviating Cold-Start Problems in Recommendation through Pseudo-Labelling over Knowledge Graph. WSDM 2021: 931-939 - [c11]Riku Togashi
, Masahiro Kato, Mayu Otani, Shin'ichi Satoh:
Density-Ratio Based Personalised Ranking from Implicit Feedback. WWW 2021: 3221-3233 - [i14]Riku Togashi, Masahiro Kato, Mayu Otani, Shin'ichi Satoh:
Density-Ratio Based Personalised Ranking from Implicit Feedback. CoRR abs/2101.07481 (2021) - [i13]Riku Togashi, Masahiro Kato, Mayu Otani, Tetsuya Sakai, Shin'ichi Satoh:
Scalable Personalised Item Ranking through Parametric Density Estimation. CoRR abs/2105.04769 (2021) - [i12]Yusuke Hirota, Noa Garcia, Mayu Otani, Chenhui Chu, Yuta Nakashima, Ittetsu Taniguchi, Takao Onoye:
A Picture May Be Worth a Hundred Words for Visual Question Answering. CoRR abs/2106.13445 (2021) - [i11]Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi:
Constrained Graphic Layout Generation via Latent Optimization. CoRR abs/2108.00871 (2021) - [i10]Tianran Wu, Noa Garcia, Mayu Otani, Chenhui Chu, Yuta Nakashima, Haruo Takemura:
Transferring Domain-Agnostic Knowledge in Video Question Answering. CoRR abs/2110.13395 (2021) - 2020
- [j4]Mayu Otani
, Chenhui Chu, Yuta Nakashima:
Visually grounded paraphrase identification via gating and phrase localization. Neurocomputing 404: 165-172 (2020) - [c10]Noa Garcia, Mayu Otani, Chenhui Chu, Yuta Nakashima:
KnowIT VQA: Answering Knowledge-Based Questions about Videos. AAAI 2020: 10826-10834 - [c9]Mayu Otani
, Yuta Nakashima, Esa Rahtu, Janne Heikkilä:
Uncovering Hidden Challenges in Query-Based Video Moment Retrieval. BMVC 2020 - [c8]Noa Garcia, Chentao Ye, Zihua Liu, Qingtao Hu, Mayu Otani, Chenhui Chu, Yuta Nakashima, Teruko Mitamura:
A Dataset and Baselines for Visual Question Answering on Art. ECCV Workshops (2) 2020: 92-108 - [c7]Zekun Yang
, Noa Garcia, Chenhui Chu, Mayu Otani, Yuta Nakashima, Haruo Takemura
:
BERT Representations for Video Question Answering. WACV 2020: 1545-1554 - [i9]Noa Garcia, Mayu Otani, Chenhui Chu, Yuta Nakashima:
Knowledge-Based Visual Question Answering in Videos. CoRR abs/2004.08385 (2020) - [i8]Noa Garcia, Chentao Ye, Zihua Liu, Qingtao Hu, Mayu Otani, Chenhui Chu, Yuta Nakashima, Teruko Mitamura:
A Dataset and Baselines for Visual Question Answering on Art. CoRR abs/2008.12520 (2020) - [i7]Mayu Otani, Yuta Nakashima, Esa Rahtu, Janne Heikkilä:
Uncovering Hidden Challenges in Query-Based Video Moment Retrieval. CoRR abs/2009.00325 (2020) - [i6]Riku Togashi, Mayu Otani, Shin'ichi Satoh:
Alleviating Cold-Start Problems in Recommendation through Pseudo-Labelling over Knowledge Graph. CoRR abs/2011.05061 (2020)
2010 – 2019
- 2019
- [c6]Mayu Otani, Yuta Nakashima, Esa Rahtu
, Janne Heikkilä:
Rethinking the Evaluation of Video Summaries. CVPR 2019: 7596-7604 - [i5]Mayu Otani
, Yuta Nakashima, Esa Rahtu, Janne Heikkilä:
Rethinking the Evaluation of Video Summaries. CoRR abs/1903.11328 (2019) - [i4]Noa Garcia, Mayu Otani, Chenhui Chu, Yuta Nakashima:
KnowIT VQA: Answering Knowledge-Based Questions about Videos. CoRR abs/1910.10706 (2019) - 2018
- [j3]Mayu Otani, Atsushi Nishida, Yuta Nakashima, Tomokazu Sato, Naokazu Yokoya:
Finding Important People in a Video Using Deep Neural Networks with Conditional Random Fields. IEICE Trans. Inf. Syst. 101-D(10): 2509-2517 (2018) - [c5]Chenhui Chu, Mayu Otani, Yuta Nakashima:
iParaphrasing: Extracting Visually Grounded Paraphrases via an Image. COLING 2018: 3479-3492 - [i3]Chenhui Chu, Mayu Otani
, Yuta Nakashima:
iParaphrasing: Extracting Visually Grounded Paraphrases via an Image. CoRR abs/1806.04284 (2018) - 2017
- [j2]Mayu Otani, Yuta Nakashima, Tomokazu Sato, Naokazu Yokoya:
Video summarization using textual descriptions for authoring video blogs. Multim. Tools Appl. 76(9): 12097-12115 (2017) - 2016
- [c4]Mayu Otani, Yuta Nakashima, Esa Rahtu
, Janne Heikkilä, Naokazu Yokoya:
Video Summarization Using Deep Semantic Features. ACCV (5) 2016: 361-377 - [c3]Mayu Otani, Yuta Nakashima, Esa Rahtu
, Janne Heikkilä, Naokazu Yokoya:
Learning Joint Representations of Videos and Sentences with Web Image Search. ECCV Workshops (1) 2016: 651-667 - [i2]Mayu Otani
, Yuta Nakashima, Esa Rahtu, Janne Heikkilä, Naokazu Yokoya:
Learning Joint Representations of Videos and Sentences with Web Image Search. CoRR abs/1608.02367 (2016) - [i1]Mayu Otani
, Yuta Nakashima, Esa Rahtu, Janne Heikkilä, Naokazu Yokoya:
Video Summarization using Deep Semantic Features. CoRR abs/1609.08758 (2016) - 2015
- [c2]Mayu Otani, Yuta Nakashima
, Tomokazu Sato, Naokazu Yokoya:
Textual description-based video summarization for video blogs. ICME 2015: 1-6 - 2014
- [c1]Mayu Otani, Hirohisa Hioki:
Video colorization based on optical flow and edge-oriented color propagation. Computational Imaging 2014: 902002 - 2012
- [j1]Toru Hasunuma, Mayu Otani:
On the (h, k)-domination numbers of iterated line digraphs. Discret. Appl. Math. 160(12): 1859-1863 (2012)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-04-14 21:12 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint