What is the input resolution?

by wamreyaz - opened Feb 16

Feb 16

I see that inference.yml resizes everything to 800x800 without preserving aspect ratio.

Is this how the model was trained? The paper itself provides quite sparse details on how the model was trained and the docs are not that helpful either.

I say this because the model is confusing the reading order in large (8k x 5k) newspapers for example.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment