CLAUSE-Bielefeld
/

communicative-baby-dpo

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

bbunzeck commited on Nov 7

Commit

efa54d6

·

verified ·

1 Parent(s): 8a6bc50

Update README.md

Files changed (1) hide show

README.md +16 -1

README.md CHANGED Viewed

@@ -47,7 +47,22 @@ This model was trained with DPO, a method introduced in [Direct Preference Optim
 - Tokenizers: 0.21.2
 ## Citations
 Cite DPO as:
 ```bibtex

 - Tokenizers: 0.21.2
 ## Citations
+For further information please consult the accompanying [paper](https://aclanthology.org/2025.babylm-main.29/).
+If you make use of this model in your work, please also cite the paper:
+```bibtex
+@inproceedings{padovani-etal-2025-dialogue,
+    title = "Dialogue Is Not Enough to Make a Communicative {B}aby{LM} (But Neither Is Developmentally Inspired Reinforcement Learning)",
+    author = "Padovani, Francesca  and Bunzeck, Bastian  and Ali, Manar  and Momen, Omar  and Bisazza, Arianna  and Buschmeier, Hendrik  and Zarrie{\ss}, Sina",
+    editor = "Charpentier, Lucas  and Choshen, Leshem  and Cotterell, Ryan  and Gul, Mustafa Omer  and Hu, Michael Y.  and Liu, Jing  and Jumelet, Jaap  and Linzen, Tal  and Mueller, Aaron  and Ross, Candace  and Shah, Raj Sanjay  and Warstadt, Alex  and Wilcox, Ethan Gotlieb  and Williams, Adina",
+    booktitle = "Proceedings of the First BabyLM Workshop",
+    month = nov,
+    year = "2025",
+    address = "Suzhou, China",
+    publisher = "Association for Computational Linguistics",
+    url = "https://aclanthology.org/2025.babylm-main.29/",
+    pages = "421--435",
+}
+```
 Cite DPO as:
 ```bibtex