pszemraj commited on
Commit
be45a3f
·
verified ·
1 Parent(s): 169b580

update docs

Browse files

its actually called axial rope (in 2d)

Files changed (1) hide show
  1. app.py +3 -3
app.py CHANGED
@@ -67,7 +67,7 @@ def compute_dynamic_size(height, width, max_dim: int = 720, patch_size: int = 16
67
 
68
 
69
  def load_model(name):
70
- """Load model with CORRECT dtype"""
71
  global processor, model
72
 
73
  cleanup_memory()
@@ -91,7 +91,7 @@ load_model(DEFAULT_NAME)
91
  def preprocess_image(img):
92
  """
93
  Custom preprocessing that respects aspect ratio & uses dynamic sizing.
94
- DINOv3's 3D RoPE handles variable sizes, no need to force 224x224
95
  """
96
  # Convert to RGB if needed
97
  if img.mode != "RGB":
@@ -359,7 +359,7 @@ with gr.Blocks(
359
  Satellite-pretrained models are intended for: geographic patterns, land use classification. structural analysis, etc. Try comparing similarity maps for the same image created by the model pretrained on sat493m vs. the one on lvd1689m (<i>general web</i>).
360
  <br><br>
361
  <b>Dynamic Resolution:</b> Images are processed at up to {MAX_IMAGE_DIM}px (longer side) while preserving aspect ratio.
362
- DINOv3's 3D RoPE embeddings handle variable sizes.
363
  <br>
364
  </div>
365
  """
 
67
 
68
 
69
  def load_model(name):
70
+ """Load model with dtype"""
71
  global processor, model
72
 
73
  cleanup_memory()
 
91
  def preprocess_image(img):
92
  """
93
  Custom preprocessing that respects aspect ratio & uses dynamic sizing.
94
+ DINOv3's 2D axial RoPE handles variable sizes, no need to force 224x224
95
  """
96
  # Convert to RGB if needed
97
  if img.mode != "RGB":
 
359
  Satellite-pretrained models are intended for: geographic patterns, land use classification. structural analysis, etc. Try comparing similarity maps for the same image created by the model pretrained on sat493m vs. the one on lvd1689m (<i>general web</i>).
360
  <br><br>
361
  <b>Dynamic Resolution:</b> Images are processed at up to {MAX_IMAGE_DIM}px (longer side) while preserving aspect ratio.
362
+ DINOv3's 2D axial RoPE embeddings handle variable sizes.
363
  <br>
364
  </div>
365
  """