Tokenizer change has strange order plus format
2
#21 opened over 1 year ago
by
Qubitium
Training DBRX-like model
3
#13 opened over 1 year ago
by
nguyenthanhdo
Release weights of smaller Experimental MoE
π
7
2
#12 opened over 1 year ago
by
shahules786
Train datasets?
β€οΈ
β
8
1
#11 opened over 1 year ago
by
danielpark
Errors During Training for the Original Implementation and the Fixes for the Errors
#10 opened over 1 year ago
by
v2ray
performance of DBRX in TEXT SUMMARIZATON and GRAMMAR CORRECTION
π
π
1
1
#9 opened over 1 year ago
by
CharanAI
Please, authorize access for the base weight!
β
π
34
44
#5 opened over 1 year ago
by
Undi95