Intel

company

Verified

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

n1ck-guo updated a dataset about 1 hour ago

Intel/dynamic_model_information

wenhuach new activity about 17 hours ago

Intel/Ling-flash-2.0-gguf-q2ks-mixed-AutoRound:Inference with llama.cpp + Open WebUI gives repeating `?`

wenhuach updated a model about 17 hours ago

Intel/Ling-flash-2.0-gguf-q2ks-mixed-AutoRound

View all activity

Articles

Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face

12 days ago

• 13

Get your VLM running in 3 simple steps on Intel CPUs

13 days ago

• 13

Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models

29 days ago

• 18

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

May 9, 2024

• 12

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

Apr 3, 2024

• 11

n1ck-guo

updated a dataset about 1 hour ago

Intel/dynamic_model_information

Updated about 1 hour ago • 260

wenhuach

posted an update about 14 hours ago

Post

🚀 AutoRound(https://github.com/intel/auto-round) is now supported by SGLang!

After integrations with TorchAO, Transformers, and VLLM, AutoRound-quantized models are now officially compatible with SGLang — bringing faster and more flexible deployment to your LLM workflows.

💡 We’ve also enhanced the RTN mode (--iters 0), cutting quantization costs significantly for low-resource users.

⭐ Star our repo and stay tuned for more exciting updates!

wenhuach

in Intel/Ling-flash-2.0-gguf-q2ks-mixed-AutoRound about 17 hours ago

Inference with llama.cpp + Open WebUI gives repeating `?`

#1 opened 4 days ago by

whoisjeremylam

wenhuach

updated a model about 17 hours ago

Intel/Ling-flash-2.0-gguf-q2ks-mixed-AutoRound

103B • Updated about 17 hours ago • 85 • 4

danf

updated a Space 1 day ago

Hebrew Math Tutor

🚀

Streamlit template space

n1ck-guo

in Intel/Qwen3-235B-A22B-Instruct-2507-gguf-q2ks-mixed-AutoRound 4 days ago

Please can we get this model on the same quant ?

#1 opened 7 days ago by

groxaxo

n1ck-guo

updated 2 models 4 days ago

Intel/Ling-flash-2.0-gguf-q2ks-mixed-AutoRound

103B • Updated about 17 hours ago • 85 • 4

Intel/GLM-4.6-gguf-q2ks-mixed-AutoRound

357B • Updated 4 days ago • 430 • 6

n1ck-guo

published a model 4 days ago

Intel/Ling-flash-2.0-gguf-q2ks-mixed-AutoRound

103B • Updated about 17 hours ago • 85 • 4

n1ck-guo

in Intel/GLM-4.5-gguf-q2ks-mixed-AutoRound 6 days ago

GLM 4.6

#1 opened 10 days ago by

gellhorn

n1ck-guo

published a model 6 days ago

Intel/GLM-4.6-gguf-q2ks-mixed-AutoRound

357B • Updated 4 days ago • 430 • 6

yujiepan

authored a paper 8 days ago

Click-through Rate Prediction with Auto-Quantized Contrastive Learning

Paper • 2109.13921 • Published Sep 27, 2021

wenhuach

updated a model 8 days ago

Intel/Qwen3-8B-GGUF-Q2KS-AS-AutoRound

8B • Updated 8 days ago • 155 • 3

wenhuach

published a model 8 days ago

Intel/Qwen3-8B-GGUF-Q2KS-AS-AutoRound

8B • Updated 8 days ago • 155 • 3

wenhuach

posted an update 12 days ago

Post

1681

AutoRound keeps evolving its LLM quantization algorithm! 🚀
After enhancing W2A16 quantization, we now offer a fast algorithm to generate mixed bits/data-type schemes (~2mins for 8B models), great for MXFP4 and W2A16.
Learn more: https://github.com/intel/auto-round/blob/main/docs/step_by_step.md#autoscheme