Interactive demo comparing a baseline SFT GPT-2 with an RLHF
A demo for an AG News classifier fine-tuned on DistilBERT