Keisuke Kamahori (釜堀 恵輔; he/him)
kamahori [at] uw.edu
I'm a second-year Ph.D. student at
Paul G. Allen School of Computer Science & Engineering,
University of Washington,
advised by Baris Kasikci.
I'm broadly interested in computer systems and architecture, with a focus on systems for LLMs recently.
I also spend some time at Kotoba Technologies, where I'm building the serving system behind the world's fastest simultaneous translation app.
My Ph.D. study is generously supported by Toyota Riken Ph.D. Scholarship.
Prior to that, I received B. Sc. in Information Science from the University of Tokyo in 2023, advised by Shinya Takamaeda-Yamazaki. I also worked with James Larus at EPFL in the summer of 2022.
[Google Scholar] [ORCID] [DBLP] [Linkedin] [X]
Publications
-
LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation
Keisuke Kamahori, Jungo Kasai, Noriyuki Kojima, Baris Kasikci
arXiv preprint
-
TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval
Chien-Yu Lin*, Keisuke Kamahori*, Yiyu Liu, Xiaoxiang Shi, Madhav Kashyap, Yile Gu, Rulin Shao, Zihao Ye, Kan Zhu, Stephanie Wang, Arvind Krishnamurthy, Rohan Kadekodi, Luis Ceze, Baris Kasikci
arXiv preprint
-
Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models
Keisuke Kamahori*, Tian Tang*, Yile Gu, Kan Zhu, Baris Kasikci
ICLR2025 (PML4LRS @ ICLR2024)
-
NanoFlow: Towards Optimal Large Language Model Serving Throughput
Kan Zhu, Yilong Zhao, Liangyu Zhao, Gefei Zuo, Yile Gu, Dedong Xie, Yufei Gao, Qinyu Xu, Tian Tang, Zihao Ye, Keisuke Kamahori, Chien-Yu Lin, Stephanie Wang, Arvind Krishnamurthy, Baris Kasikci
OSDI2025
-
A 475 MHz FPGA Accelerator for RTL Simulation
Sahand Kashani*, Mahyar Emami*, Keisuke Kamahori, Mohammad Sepehr Pourghannad, Ritik Raj, James R Larus
FPGA2024
-
Manticore: Hardware-Accelerated RTL Simulation with Static Bulk-Synchronous Parallelism
Mahyar Emami*, Sahand Kashani*, Keisuke Kamahori, Mohammad Sepehr Pourghannad, Ritik Raj, James R Larus
ASPLOS2024
-
CiraaS: cloud computing with programmable logic
Kenji Tanaka, Yuki Arikawa, Tsuyoshi Ito, Yuki Matsuda, Keisuke Kamahori, Shinya Kaji, Takeshi Sakamoto
SIGCOMM2022 Poster
-
Accelerating Decision Tree Ensemble with Guided Branch Approximation
Keisuke Kamahori, Shinya Takamaeda-Yamazaki
HEART2022