Quantizations version
#5 opened 3 months ago
by
baiall
NuMarkdown-8B-reasoning on A100 40GB is extremely slow (even for 1 token)
๐
1
2
#4 opened 3 months ago
by
Fedoration
This is so exciting!
๐ค
5
1
#2 opened 4 months ago
by
ashercn97