Post
228
I wrote this article to explain the difference between vision token and text token. They are apples and oranges, but also the source of compression efficiency of DeepSeek OCR (don't forget Glyph by THUDM!)
https://huggingface.co/blog/onekq/behind-each-token
I am running experiment with DeepSeek OCR BTW
https://huggingface.co/blog/onekq/behind-each-token
I am running experiment with DeepSeek OCR BTW