Tony W
tonyaw
AI & ML interests
None yet
Organizations
None yet
how to make determinstic output?
4
#23 opened 4 months ago
by
junma
gpt-oss-120b has high possibility to generate response as part of reasoning
#133 opened 3 months ago
by
tonyaw
Does the RL lead to this model to prefer to give answers in a certain length scope?
#4 opened 7 months ago
by
tonyaw
Does the RL lead to this model to prefer to give answers in a certain length scope?
#1 opened 7 months ago
by
tonyaw
Does the RL lead to this model to prefer to give answers in a certain length scope?
#65 opened 7 months ago
by
tonyaw
Incorrect vocab size?
👍
2
12
#2 opened almost 2 years ago
by
claudiuv
How to use PEFT+LoRA to fine-tune starchat-alpha
1
#17 opened over 2 years ago
by
tonyaw