LLMs, NLP, Alignment, DPO, RLHF, data labeling, text-classification, text-generation, token-classification