Zian Li's picture

1

Zian Li

AutumnE

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 2 months ago

TPLA: Tensor Parallel Latent Attention for Efficient Disaggregated Prefill \& Decode Inference

Paper • 2508.15881 • Published Aug 21 • 8