tiandunx
AI & ML interests
None yet
Recent Activity
Organizations
None yet
-
-
-
-
-
-
-
-
-
-
-
view article
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment
upvoted
a
paper
3 months ago
view article
How to generate text: using different decoding methods for language generation with Transformers