gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7 • 381
view article Article Introducing HELMET: Holistically Evaluating Long-context Language Models Apr 16 • 40
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference Jan 16 • 76
view article Article Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding Jan 30, 2024 • 9
Distributed Speculative Inference of Large Language Models Paper • 2405.14105 • Published May 23, 2024 • 18