What will happen if we train a Q function for digital agents?
HAO BAI
JackBAI
AI & ML interests
Representation learning, language models.
Organizations
models
21
JackBAI/tti-new
Updated
JackBAI/webvoyager_allresults
Updated
JackBAI/short-to-long
Updated
JackBAI/aitw-general-digiq-agent
Updated
JackBAI/aitw-webshop-digiq-agent
Updated
JackBAI/llava-v1.5-7b-sfted-pad-inputtext
Updated
JackBAI/CRATE-GPT-12L-Pile-600000steps
Updated
JackBAI/webshop-off2on-filteredbc
Updated
JackBAI/general-off2on-filteredbc
Updated
JackBAI/general-off2on-digirl
Updated
datasets
8
JackBAI/tinytlp
Viewer
•
Updated
•
30k
•
156
JackBAI/eval_data
Viewer
•
Updated
•
9.64k
•
19
JackBAI/autoui-zeroshot-trajectories
Preview
•
Updated
•
14
JackBAI/pile_uncopyrighted_bin
Updated
•
8
JackBAI/bert_pretrain_datasets
Viewer
•
Updated
•
80.5M
•
76
•
1
JackBAI/redbajama-sampled
Viewer
•
Updated
•
24.3M
•
616
JackBAI/merged_roberta_dataset
Updated
•
8
JackBAI/chatgpt-woi-finetune
Preview
•
Updated
•
39
•
3