Hi, I'm Seiji

I am a PhD student at the Nara Institute of Science and Technology (NAIST) in Japan, working on clinical natural language processing under the supervision of Prof. Eiji Aramaki at the Social Computing Lab. My research focuses on developing practical and reliable language technologies for real-world health care settings.

A central theme of my work is addressing data scarcity, domain shift, and data-sharing constraints in the clinical domain while protecting patient privacy. My research focuses on robustness under domain shift, particularly on how models trained on data from specific clinical institutions or general-domain data can generalize to new hospitals and previously unseen types of clinical documentation. In parallel, I work on privacy-preserving synthetic corpus generation, developing methods to make clinical NLP research more accessible without compromising patient privacy.

In addition to English, I conduct research on Japanese clinical texts, focusing on language resource construction. My work includes benchmark construction and evaluation for practical clinical tasks in Japanese (J-ClinicalBench), as well as annotation methodologies that reduce expert burden. Overall, I aim to make clinical NLP systems both deployable and trustworthy in practice.


News

Highlighted Publications

Mock project card 1

A Herd of Language Models Makes a Better Zero-shot Annotator for Clinical Named Entity Recognition

Findings of ACL 2026

Mock project card 2

J-ClinicalBench: A Benchmark for Evaluating Large Language Models on Practical Clinical Tasks in Japanese

LREC 2026

Mock project card 3

RecordTwin: Towards Creating Safe Synthetic Clinical Corpora

Findings of ACL 2025

Mock project card 4

Exploring LLM Annotation for Adaptation of Clinical Information Extraction Models under Data-Sharing Restrictions

Findings of ACL 2025

Get In Touch

  • ✉️ shimizu.seiji.so8@is.naist.jp