Base Model for TransMLA
mengfanxu
fxmeng
AI & ML interests
None yet
Recent Activity
upvoted a paper 17 days ago
Generative Refinement Networks for Visual Synthesis upvoted a paper about 1 month ago
HISA: Efficient Hierarchical Indexing for Fine-Grained Sparse Attention authored a paper about 1 month ago
LIFT: Improving Long Context Understanding of Large Language Models
through Long Input Fine-TuningOrganizations
None yet