speculative decoding does not work

#1
by festr2 - opened

running this model is broken when speculative mode is enabled (sglang)

--speculative-algorithm EAGLE \
--speculative-num-steps 3 \
--speculative-eagle-topk 1 \
--speculative-num-draft-tokens 4

I have tested fp8 version

Sign up or log in to comment