dkimds
·
AI & ML interests
RL, LLM, RLHF and so on.
Organizations
None yet
dkimds/mt0-large-ia3
Updated
dkimds/peft-vit-base-patch16-224-in21k-lora
Updated
dkimds/bloomz-560-m-peft-method
Updated
dkimds/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
dkimds/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
•
2
dkimds/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
dkimds/ppo-SnowballTarget
Reinforcement Learning
•
Updated
dkimds/ppo-Pyramids-Training
Reinforcement Learning
•
Updated
dkimds/PixelCopter-PLE-v0
Updated
dkimds/Reinforce-CartPole-v1
Reinforcement Learning
•
Updated
dkimds/q-Taxi-v3
Reinforcement Learning
•
Updated
dkimds/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
dkimds/ppo-Huggy
Reinforcement Learning
•
Updated
•
13
dkimds/distilbert-base-uncased-finetuned-squad-d5716d28
Question Answering
•
Updated
dkimds/bert-finetuned-ner-accelerate
Updated
dkimds/bert-finetuned-ner
Updated
dkimds/code-search-net-tokenizer
Updated
dkimds/dummy-repo
Updated
dkimds/dummy-model
Fill-Mask
•
Updated