


default search action
Tatsuya Hiraoka
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[c12]Munachiso Nwadike, Zangir Iklassov, Toluwani Aremu, Tatsuya Hiraoka, Benjamin Heinzerling, Velibor Bojkovic, Hilal AlQuabeh, Martin Takác, Kentaro Inui:
Library-Like Behavior In Language Models is Enhanced by Self-Referencing Causal Cycles. ACL (1) 2025: 25365-25377
[c11]Kohei Tsuji, Tatsuya Hiraoka, Yuchang Cheng, Tomoya Iwakura:
SubRegWeigh: Effective and Efficient Annotation Weighing with Subword Regularization. COLING 2025: 1908-1921
[c10]Tatsuya Hiraoka, Kentaro Inui:
Repetition Neurons: How Do Language Models Produce Repetitions? NAACL (Short Papers) 2025: 483-495
[c9]Ahmed Oumar El-Shangiti, Tatsuya Hiraoka, Hilal AlQuabeh, Benjamin Heinzerling, Kentaro Inui:
The Geometry of Numerical Reasoning: Language Models Compare Numeric Properties in Linear Subspaces. NAACL (Short Papers) 2025: 550-561
[i22]Munachiso Nwadike, Zangir Iklassov, Toluwani Aremu, Tatsuya Hiraoka
, Velibor Bojkovic, Benjamin Heinzerling, Hilal Alqaubeh, Martin Takác, Kentaro Inui:
RECALL: Library-Like Behavior In Language Models is Enhanced by Self-Referencing Causal Cycles. CoRR abs/2501.13491 (2025)
[i21]H. V. AlquBoj, Hilal AlQuabeh, Velibor Bojkovic, Tatsuya Hiraoka
, Ahmed Oumar El-Shangiti, Munachiso Nwadike, Kentaro Inui:
Number Representations in LLMs: A Computational Parallel to Human Perception. CoRR abs/2502.16147 (2025)
[i20]Kohei Tsuji
, Tatsuya Hiraoka
, Yuchang Cheng, Eiji Aramaki, Tomoya Iwakura:
Investigating Neurons and Heads in Transformer-based LLMs for Typographical Errors. CoRR abs/2502.19669 (2025)
[i19]Shintaro Ozaki, Tatsuya Hiraoka
, Hiroto Otake, Hiroki Ouchi, Masaru Isonuma, Benjamin Heinzerling, Kentaro Inui, Taro Watanabe, Yusuke Miyao, Yohei Oseki, Yu Takagi:
Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance. CoRR abs/2505.21458 (2025)
[i18]Sangwhan Moon, Tatsuya Hiraoka
, Naoaki Okazaki:
Bit-level BPE: Below the byte boundary. CoRR abs/2506.07541 (2025)
[i17]Tatsuya Hiraoka
, Kentaro Inui:
Spelling-out is not Straightforward: LLMs' Capability of Tokenization from Token to Characters. CoRR abs/2506.10641 (2025)
[i16]Nhi Hoai Doan, Tatsuya Hiraoka
, Kentaro Inui:
Understanding and Controlling Repetition Neurons and Induction Heads in In-Context Learning. CoRR abs/2507.07810 (2025)
[i15]Seiya Ishikura, Hiroaki Yamada, Tatsuya Hiraoka
, Hiroaki Yamada, Takenobu Tokunaga:
Augmenting Dialog with Think-Aloud Utterances for Modeling Individual Personality Traits by LLM. CoRR abs/2510.09158 (2025)
[i14]Jesse Atuhurra, Iqra Ali, Tomoya Iwakura, Hidetaka Kamigaito, Tatsuya Hiraoka
:
VLURes: Benchmarking VLM Visual and Linguistic Understanding in Low-Resource Languages. CoRR abs/2510.12845 (2025)- 2024
[i13]Tatsuya Hiraoka
, Naoaki Okazaki:
Knowledge of Pretrained Language Models on Surface Information of Tokens. CoRR abs/2402.09808 (2024)
[i12]Marco Cognetta, Tatsuya Hiraoka
, Naoaki Okazaki, Rico Sennrich, Yuval Pinter:
An Analysis of BPE Vocabulary Trimming in Neural Machine Translation. CoRR abs/2404.00397 (2024)
[i11]Jesse Atuhurra, Iqra Ali, Tatsuya Hiraoka
, Hidetaka Kamigaito, Tomoya Iwakura, Taro Watanabe:
Constructing Multilingual Visual-Text Datasets Revealing Visual Multilingual Ability of Vision Language Models. CoRR abs/2406.15359 (2024)
[i10]Akiko Aizawa, Eiji Aramaki, Bowen Chen, Fei Cheng
, Hiroyuki Deguchi, Rintaro Enomoto, Kazuki Fujii, Kensuke Fukumoto, Takuya Fukushima, Namgi Han, Yuto Harada, Chikara Hashimoto, Tatsuya Hiraoka, Shohei Hisada, Sosuke Hosokawa, Lu Jie, Keisuke Kamata, Teruhito Kanazawa, Hiroki Kanezashi, Hiroshi Kataoka, Satoru Katsumata, Daisuke Kawahara, Seiya Kawano, Atsushi Keyaki, Keisuke Kiryu, Hirokazu Kiyomaru, Takashi Kodama, Takahiro Kubo, Yohei Kuga, Ryoma Kumon, Shuhei Kurita, Sadao Kurohashi, Conglong Li, Taiki Maekawa, Hiroshi Matsuda, Yusuke Miyao, Kentaro Mizuki, Sakae Mizuki, Yugo Murawaki, Ryo Nakamura, Taishi Nakamura, Kouta Nakayama, Tomoka Nakazato, Takuro Niitsuma, Jiro Nishitoba, Yusuke Oda, Hayato Ogawa, Takumi Okamoto, Naoaki Okazaki, Yohei Oseki, Shintaro Ozaki, Koki Ryu, Rafal Rzepka, Keisuke Sakaguchi, Shota Sasaki, Satoshi Sekine, Kohei Suda, Saku Sugawara, Issa Sugiura, Hiroaki Sugiyama, Hisami Suzuki, Jun Suzuki, Toyotaro Suzumura, Kensuke Tachibana, Yu Takagi, Kyosuke Takami, Koichi Takeda, Masashi Takeshita, Masahiro Tanaka, Kenjiro Taura, Arseny Tolmachev, Nobuhiro Ueda, Zhen Wan, Shuntaro Yada, Sakiko Yahata, Yuya Yamamoto, Yusuke Yamauchi, Hitomi Yanaka, Rio Yokota, Koichiro Yoshino:
LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs. CoRR abs/2407.03963 (2024)
[i9]Kohei Tsuji, Tatsuya Hiraoka
, Yuchang Cheng, Tomoya Iwakura:
SubRegWeigh: Effective and Efficient Annotation Weighing with Subword Regularization. CoRR abs/2409.06216 (2024)
[i8]Ahmed Oumar El-Shangiti, Tatsuya Hiraoka
, Hilal AlQuabeh, Benjamin Heinzerling, Kentaro Inui:
The Geometry of Numerical Reasoning: Language Models Compare Numeric Properties in Linear Subspaces. CoRR abs/2410.13194 (2024)
[i7]Tatsuya Hiraoka
, Kentaro Inui:
Repetition Neurons: How Do Language Models Produce Repetitions? CoRR abs/2410.13497 (2024)- 2023
[c8]Teruno Kajiura, Shiho Takano, Tatsuya Hiraoka, Kimio Kuramitsu:
Vocabulary Replacement in SentencePiece for Domain Adaptation. PACLIC 2023: 645-652
[i6]Tatsuya Hiraoka
, Tomoya Iwakura:
Downstream Task-Oriented Neural Tokenizer Optimization with Vocabulary Restriction as Post Processing. CoRR abs/2304.10808 (2023)
[i5]Tatsuya Hiraoka
, Tomoya Iwakura:
Tokenization Tractability for Human and Machine Learning Model: An Annotation Study. CoRR abs/2304.10813 (2023)- 2022
[j1]Tatsuya Hiraoka
, Sho Takase, Kei Uchiumi, Atsushi Keyaki, Naoaki Okazaki:
Recurrent Neural Hidden Markov Model for High-order Transition. ACM Trans. Asian Low Resour. Lang. Inf. Process. 21(2): 36:1-36:15 (2022)
[c7]Sho Takase, Tatsuya Hiraoka
, Naoaki Okazaki:
Single Model Ensemble for Subword Regularized Models in Low-Resource Machine Translation. ACL (Findings) 2022: 2536-2541
[c6]Tatsuya Hiraoka
, Sho Takase, Kei Uchiumi, Atsushi Keyaki, Naoaki Okazaki:
Word-level Perturbation Considering Word Length and Compositional Subwords. ACL (Findings) 2022: 3268-3275
[c5]Youmi Ma, Tatsuya Hiraoka, Naoaki Okazaki:
Joint Entity and Relation Extraction Based on Table Labeling Using Convolutional Neural Networks. SPNLP@ACL 2022: 11-21
[c4]Tatsuya Hiraoka:
MaxMatch-Dropout: Subword Regularization for WordPiece. COLING 2022: 4864-4872
[i4]Sho Takase, Tatsuya Hiraoka
, Naoaki Okazaki:
Single Model Ensemble for Subword Regularized Models in Low-Resource Machine Translation. CoRR abs/2203.13528 (2022)
[i3]Tatsuya Hiraoka
:
MaxMatch-Dropout: Subword Regularization for WordPiece. CoRR abs/2209.04126 (2022)- 2021
[c3]Tatsuya Hiraoka
, Sho Takase, Kei Uchiumi, Atsushi Keyaki, Naoaki Okazaki:
Joint Optimization of Tokenization and Downstream Model. ACL/IJCNLP (Findings) 2021: 244-255
[i2]Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, Naoaki Okazaki:
Joint Optimization of Tokenization and Downstream Model. CoRR abs/2105.12410 (2021)- 2020
[c2]Tatsuya Hiraoka, Sho Takase, Kei Uchiumi, Atsushi Keyaki, Naoaki Okazaki:
Optimizing Word Segmentation for Downstream Task. EMNLP (Findings) 2020: 1341-1351
[i1]Youmi Ma, Tatsuya Hiraoka, Naoaki Okazaki:
Named Entity Recognition and Relation Extraction using Enhanced Table Filling by Contextualized Representations. CoRR abs/2010.07522 (2020)
2010 – 2019
- 2019
[c1]Tatsuya Hiraoka, Hiroyuki Shindo, Yuji Matsumoto:
Stochastic Tokenization with a Language Model for Neural Text Classification. ACL (1) 2019: 1620-1629
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-01-05 23:41 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







