Can we get a version of this trained on one of the MoE models in the title? They are much easier to run
· Sign up or log in to comment