LUT-LLM: Efficient Large Language Model Inference with Memory-based Computations on FPGAs Paper • 2511.06174 • Published 14 days ago • 5
InTAR: Inter-Task Auto-Reconfigurable Accelerator Design for High Data Volume Variation in DNNs Paper • 2502.08807 • Published Feb 12 • 1
HMT: Hierarchical Memory Transformer for Long Context Language Processing Paper • 2405.06067 • Published May 9, 2024 • 2