bbunzeck commited on
Commit
efa54d6
·
verified ·
1 Parent(s): 8a6bc50

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -1
README.md CHANGED
@@ -47,7 +47,22 @@ This model was trained with DPO, a method introduced in [Direct Preference Optim
47
  - Tokenizers: 0.21.2
48
 
49
  ## Citations
50
-
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
51
  Cite DPO as:
52
 
53
  ```bibtex
 
47
  - Tokenizers: 0.21.2
48
 
49
  ## Citations
50
+ For further information please consult the accompanying [paper](https://aclanthology.org/2025.babylm-main.29/).
51
+ If you make use of this model in your work, please also cite the paper:
52
+ ```bibtex
53
+ @inproceedings{padovani-etal-2025-dialogue,
54
+ title = "Dialogue Is Not Enough to Make a Communicative {B}aby{LM} (But Neither Is Developmentally Inspired Reinforcement Learning)",
55
+ author = "Padovani, Francesca and Bunzeck, Bastian and Ali, Manar and Momen, Omar and Bisazza, Arianna and Buschmeier, Hendrik and Zarrie{\ss}, Sina",
56
+ editor = "Charpentier, Lucas and Choshen, Leshem and Cotterell, Ryan and Gul, Mustafa Omer and Hu, Michael Y. and Liu, Jing and Jumelet, Jaap and Linzen, Tal and Mueller, Aaron and Ross, Candace and Shah, Raj Sanjay and Warstadt, Alex and Wilcox, Ethan Gotlieb and Williams, Adina",
57
+ booktitle = "Proceedings of the First BabyLM Workshop",
58
+ month = nov,
59
+ year = "2025",
60
+ address = "Suzhou, China",
61
+ publisher = "Association for Computational Linguistics",
62
+ url = "https://aclanthology.org/2025.babylm-main.29/",
63
+ pages = "421--435",
64
+ }
65
+ ```
66
  Cite DPO as:
67
 
68
  ```bibtex