Spaces:
Running
on
Zero
Running
on
Zero
update docs
Browse filesits actually called axial rope (in 2d)
app.py
CHANGED
|
@@ -67,7 +67,7 @@ def compute_dynamic_size(height, width, max_dim: int = 720, patch_size: int = 16
|
|
| 67 |
|
| 68 |
|
| 69 |
def load_model(name):
|
| 70 |
-
"""Load model with
|
| 71 |
global processor, model
|
| 72 |
|
| 73 |
cleanup_memory()
|
|
@@ -91,7 +91,7 @@ load_model(DEFAULT_NAME)
|
|
| 91 |
def preprocess_image(img):
|
| 92 |
"""
|
| 93 |
Custom preprocessing that respects aspect ratio & uses dynamic sizing.
|
| 94 |
-
DINOv3's
|
| 95 |
"""
|
| 96 |
# Convert to RGB if needed
|
| 97 |
if img.mode != "RGB":
|
|
@@ -359,7 +359,7 @@ with gr.Blocks(
|
|
| 359 |
Satellite-pretrained models are intended for: geographic patterns, land use classification. structural analysis, etc. Try comparing similarity maps for the same image created by the model pretrained on sat493m vs. the one on lvd1689m (<i>general web</i>).
|
| 360 |
<br><br>
|
| 361 |
<b>Dynamic Resolution:</b> Images are processed at up to {MAX_IMAGE_DIM}px (longer side) while preserving aspect ratio.
|
| 362 |
-
DINOv3's
|
| 363 |
<br>
|
| 364 |
</div>
|
| 365 |
"""
|
|
|
|
| 67 |
|
| 68 |
|
| 69 |
def load_model(name):
|
| 70 |
+
"""Load model with dtype"""
|
| 71 |
global processor, model
|
| 72 |
|
| 73 |
cleanup_memory()
|
|
|
|
| 91 |
def preprocess_image(img):
|
| 92 |
"""
|
| 93 |
Custom preprocessing that respects aspect ratio & uses dynamic sizing.
|
| 94 |
+
DINOv3's 2D axial RoPE handles variable sizes, no need to force 224x224
|
| 95 |
"""
|
| 96 |
# Convert to RGB if needed
|
| 97 |
if img.mode != "RGB":
|
|
|
|
| 359 |
Satellite-pretrained models are intended for: geographic patterns, land use classification. structural analysis, etc. Try comparing similarity maps for the same image created by the model pretrained on sat493m vs. the one on lvd1689m (<i>general web</i>).
|
| 360 |
<br><br>
|
| 361 |
<b>Dynamic Resolution:</b> Images are processed at up to {MAX_IMAGE_DIM}px (longer side) while preserving aspect ratio.
|
| 362 |
+
DINOv3's 2D axial RoPE embeddings handle variable sizes.
|
| 363 |
<br>
|
| 364 |
</div>
|
| 365 |
"""
|