Publications

Please see my Google Scholar profile for the full list of papers.

*: equal contribution

2026

Format Matters: The Robustness of Multimodal LLMs in Reviewing Evidence from Tables and Charts
Xanh Ho, Yun-Ang Wu, Sunisth Kumar, Florian Boudin, Atsuhiro Takasu, Akiko Aizawa
AAAI 2026

2025

Table-Text Alignment: Explaining Claim Verification Against Tables in Scientific Papers
Xanh Ho, Sunisth Kumar, Yun-Ang Wu, Florian Boudin, Atsuhiro Takasu, Akiko Aizawa
Findings of EMNLP 2025
Decontextualization, Everywhere: A Systematic Audit on PeerQA
Xanh Ho, Tian Cheng Xia, Khoa Duong, Yun-Ang Wu, Ha-Thanh Nguyen, Akiko Aizawa
Agents4Science 2025 (AI-generated paper)
UnitMath: Unit-Aware Numerical Reasoning and Dimensional Consistency for Scientific Table Claims
Xanh Ho, Tian Cheng Xia, Khoa Duong, Yun-Ang Wu, Ha-Thanh Nguyen, Akiko Aizawa
Agents4Science 2025 (AI-generated paper)
LLM-as-a-Judge: Reassessing the Performance of LLMs in Extractive QA
Xanh Ho, Jiahao Huang, Florian Boudin, Akiko Aizawa
arXiv preprint, 2025

2024

MoreHopQA: More Than Multi-hop Reasoning
Julian Schnitzler*, Xanh Ho*, Jiahao Huang*, Florian Boudin, Saku Sugawara, Akiko Aizawa
arXiv preprint, 2024
A Survey of Pre-trained Language Models for Processing Scientific Text
Xanh Ho*, Anh Khoa Duong Nguyen*, An Tuan Dao*, Junfeng Jiang*, Yuki Chida*, Kaito Sugimoto*, Huy Quoc To, Florian Boudin, Akiko Aizawa
arXiv preprint, 2024

2023

Analyzing the effectiveness of the underlying reasoning tasks in multi-hop question answering
Xanh Ho*, Anh-Khoa Duong Nguyen*, Saku Sugawara, Akiko Aizawa
Findings of EACL 2023
Solving Label Variation in Scientific Information Extraction via Multi-Task Learning
Dong Pham, Xanh Ho, Quang Thuy Ha, Akiko Aizawa
PACLIC 2023

2022

How Well Do Multi-hop Reading Comprehension Models Understand Date Information?
Xanh Ho, Saku Sugawara, Akiko Aizawa
AACL | IJCNLP 2022
A survey on measuring and mitigating reasoning shortcuts in machine reading comprehension
Xanh Ho*, Johannes Mario Meissner*, Saku Sugawara, Akiko Aizawa
arXiv preprint, 2022

2020

Constructing a multi-hop QA dataset for comprehensive evaluation of reasoning steps
Xanh Ho, Anh-Khoa Duong Nguyen, Saku Sugawara, Akiko Aizawa
COLING 2020