Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
vitalyr 's Collections
quant
agent
mamba
llm
Diffusion Model

llm

updated Apr 25, 2024
Upvote
-

  • LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

    Paper • 2404.05961 • Published Apr 9, 2024 • 66

  • Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

    Paper • 2404.07143 • Published Apr 10, 2024 • 111

  • Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

    Paper • 2404.08197 • Published Apr 12, 2024 • 29

  • Pre-training Small Base LMs with Fewer Tokens

    Paper • 2404.08634 • Published Apr 12, 2024 • 35

  • OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

    Paper • 2404.14619 • Published Apr 22, 2024 • 126
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs