Tokenizer JSON
#17 opened 9 days ago
by
rageltman
Asking to Release FP6 varient
#15 opened 12 days ago
by
MahdiFeyz
insights on comparisons with Qwen/Qwen3-Next-80B-A3B-Instruct ?
➕
2
#14 opened 16 days ago
by
saireddy
Model Architecture Naming: KDA
#11 opened 18 days ago
by
dkleine
trying to run this on a 4090 and 192GB RAM.. not enough RAM???
2
#10 opened 19 days ago
by
MikaSouthworth
tool parser?
2
#8 opened 22 days ago
by
prudant