-
CLEX: Continuous Length Extrapolation for Large Language Models
Paper • 2310.16450 • Published • 10 -
E^2-LLM: Efficient and Extreme Length Extension of Large Language Models
Paper • 2401.06951 • Published • 26 -
Data Engineering for Scaling Language Models to 128K Context
Paper • 2402.10171 • Published • 25
Juan Herrera
juampahc
AI & ML interests
None yet
Recent Activity
liked
a model
15 days ago
Qwen/Qwen3-Omni-30B-A3B-Instruct
liked
a model
15 days ago
baidu/ERNIE-4.5-VL-28B-A3B-Thinking
liked
a model
15 days ago
BAAI/Emu3.5