How to change the output embedding dimensions?

by ly400929 - opened Jun 6

Jun 6

When I use vllm to run the model and want to set the output dimension to 256, an error occurs:

Jun 6

It is truncation, not pooling.

deleted

Sep 15

Use truncate_dim=x when creating the model:

model = SentenceTransformer("Qwen/Qwen3-Embedding-0.6B", truncate_dim=512)

26 days ago

The is_matryoshka parameter is missing from config.json. You need to start vllm with hf_overrides={"is_matryoshka": True}.
Reference: vLLM docs

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment