Tony W's picture

12

Tony W

tonyaw

tonyaw

AI & ML interests

None yet

Organizations

None yet

New activity in openai/gpt-oss-20b 3 months ago

how to make determinstic output?

#23 opened 4 months ago by

New activity in openai/gpt-oss-120b 3 months ago

gpt-oss-120b has high possibility to generate response as part of reasoning

#133 opened 3 months ago by

New activity in nvidia/Llama-3.1-Nemotron-70B-Reward 7 months ago

Does the RL lead to this model to prefer to give answers in a certain length scope?

#4 opened 7 months ago by

New activity in joshmiller656/Llama-3.1-Nemotron-70B-Instruct-AWQ-INT4 7 months ago

Does the RL lead to this model to prefer to give answers in a certain length scope?

#1 opened 7 months ago by

New activity in nvidia/Llama-3.1-Nemotron-70B-Instruct-HF 7 months ago

Does the RL lead to this model to prefer to give answers in a certain length scope?

#65 opened 7 months ago by

New activity in ise-uiuc/Magicoder-S-DS-6.7B almost 2 years ago

Incorrect vocab size?

#2 opened almost 2 years ago by

New activity in HuggingFaceH4/starchat-alpha over 2 years ago

How to use PEFT+LoRA to fine-tune starchat-alpha

#17 opened over 2 years ago by