Publications

Conference Papers


HexGen-2: Disaggregated Generative Inference of LLMs in Heterogeneous Environment

Published in International Conference on Learning Representations (ICLR), 2025

Youhe Jiang*, Ran Yan*, Binhang Yuan

HexGen: Generative Inference of Large Language Model over Heterogeneous Environment

Published in International Conference on Machine Learning (ICML), 2024

Youhe Jiang*, Ran Yan*, Xiaozhe Yao*, Yang Zhou, Beidi Chen, Binhang Yuan