10 3 18

InfiniCode

Azamorn

AI & ML interests

None yet

Recent Activity

commented on a paper about 1 month ago

Glyph: Scaling Context Windows via Visual-Text Compression

updated a model 9 months ago

Azamorn/smollm2_360m_grpo_gsm8k_reasoner-Q8_0-GGUF

published a model 9 months ago

Azamorn/smollm2_360m_grpo_gsm8k_reasoner-Q8_0-GGUF

View all activity

Organizations

None yet

commented a paper about 1 month ago

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published Oct 20 • 66 •

updated a model 9 months ago

Azamorn/smollm2_360m_grpo_gsm8k_reasoner-Q8_0-GGUF

0.4B • Updated Mar 1 • 1

published a model 9 months ago

Azamorn/smollm2_360m_grpo_gsm8k_reasoner-Q8_0-GGUF

0.4B • Updated Mar 1 • 1

liked a model 9 months ago

ibm-granite/granite-3.2-8b-instruct-preview

Text Generation • 8B • Updated Feb 26 • 72 • 69

upvoted a paper about 1 year ago

SelfCodeAlign: Self-Alignment for Code Generation

Paper • 2410.24198 • Published Oct 31, 2024 • 24

liked a model over 1 year ago

lmstudio-community/Mistral-Large-Instruct-2407-GGUF

Text Generation • 123B • Updated Aug 29, 2024 • 79 • 13

updated 2 models over 1 year ago

Azamorn/Meta-Llama-3-70B-Instruct-Distributed

Updated May 27, 2024 • 1

Azamorn/Meta-Llama-3-8B-Instruct-Distributed

Updated May 11, 2024

liked 2 models over 1 year ago

#1 opened over 1 year ago by

Anderson452

New activity in google/codegemma-7b over 1 year ago

context window size?

#10 opened over 1 year ago by

ichigoberry

upvoted a paper almost 2 years ago

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads

Paper • 2401.10774 • Published Jan 19, 2024 • 59

liked a model almost 2 years ago

openchat/openchat-3.5-0106

Text Generation • 7B • Updated May 18, 2024 • 15.6k • 360

upvoted a paper almost 2 years ago

TinyLlama: An Open-Source Small Language Model

Paper • 2401.02385 • Published Jan 4, 2024 • 95

updated a dataset almost 2 years ago

Azamorn/tiny-codes-csharp

Viewer • Updated Dec 26, 2023 • 125k • 104 • 1

updated a model almost 2 years ago

Azamorn/retnet-tinystories

Text Generation • 0.4B • Updated Dec 25, 2023

New activity in TheBloke/vicuna-13B-v1.5-16K-GGML over 2 years ago

Model answer ends in repeating word

#1 opened over 2 years ago by

mrichardt

New activity in HuggingFaceH4/starchat-beta over 2 years ago

Tokenizer causes issues in Finetuning because of special tokens in tokenization <|x|>

#16 opened over 2 years ago by

LazerJesus

liked a Space over 2 years ago

Open LLM Leaderboard

🏆

13.7k

Track, rank and evaluate open LLMs and chatbots

InfiniCode

AI & ML interests

Recent Activity

Organizations

Azamorn's activity

Thank you so much!

context window size?

Model answer ends in repeating word

Tokenizer causes issues in Finetuning because of special tokens in tokenization <|x|>

Open LLM Leaderboard