PromptTuner v0.1

PromptTuner-v0.1 is a fine-tuned gemma-3-270M-it model specifically designed to enhance text prompts for text-to-image models.

This model takes a basic image concept and expands into rich, detailed descriptions including:

  • Visual composition and perspective
  • Artistic style and medium
  • Color palette and lighting
  • Atmosphere and mood
  • Textures and materials
  • Environmental context

The model tries preserves the core intent of your original prompt while adding professional-quality visual descriptors.

Dataset

The model was trained on a curated collection of prompt/magic-prompt pairs.

The dataset underwent extensive cleaning to ensure quality:

  • Removed duplicates
  • Removed prompts consisting only of numbers and spaces
  • Filtered out magic prompts containing error messages or refusal responses
  • Removed magic prompts below quality thresholds
  • Cleaned up quotation marks at prompt boundaries
  • Removed rows with excessively short prompts (length <= 2)
  • Filtered out web links and URLs
  • Removed gibberish inputs
  • Filtered pairs where prompt and magic prompt were too similar

The training dataset was balanced using K-means clustering on prompt embeddings to ensure diverse representation of creative concepts.

Training

Visualize in Weights & Biases

  • Training Method: LoRA
    • Rank: 16
    • Alpha: 32
    • Target modules: q_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj
  • Epochs: 3
  • Batch Size: 16
  • Learning Rate: 2e-4
  • Optimizer: adamw_8bit
  • LR Scheduler: Cosine
  • Warmup Ratio: 0.1
  • Train/Test Split: 90/10

Usage

from transformers import AutoTokenizer, AutoModelForCausalLM

model = AutoModelForCausalLM.from_pretrained("shb777/PromptTuner-v0.1")
tokenizer = AutoTokenizer.from_pretrained("shb777/PromptTuner-v0.1")

SYSTEM_PROMPT = """You are an expert creative director specializing in visual descriptions for image generation.

Your task: Transform the user's concept into a rich, detailed image description while PRESERVING their core idea.

IMPORTANT RULES:
1. Keep ALL key elements (intents, entities) from the original concept
2. Enhance with artistic details, NOT change the fundamental idea
3. Maintain the user's intended subject, action, and setting

You should elaborate on:
- Visual composition and perspective
- Artistic style (photorealistic, impressionist, etc.)
- Color palette and color temperature
- Lighting (golden hour, dramatic shadows, etc.)
- Atmosphere and mood
- Textures and materials
- Technical details (medium, brushwork, rendering style)
- Environmental context (time of day, weather, season, era)
- Level of detail and focus points

Output format: A single, flowing paragraph that reads naturally as an image prompt."""

user_input = "fox, red tail, blue moon, clouds"

messages = [
    {"role": "system", "content": SYSTEM_PROMPT},
    {"role": "user", "content": user_input}
]

prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
inputs = tokenizer(prompt, return_tensors="pt")

outputs = model.generate(
    **inputs,
    max_new_tokens=512,
    temperature=1.0,
    top_p=0.95,
    top_k=64
)

enhanced_prompt = tokenizer.decode(outputs[0], skip_special_tokens=True)
print(enhanced_prompt)

Recommended Generation Parameters

You must use the exact system prompt shown above, as the model was trained on it.

  • max_new_tokens: 512
  • temperature: 1.0
  • top_p: 0.95
  • top_k: 64

You can try the model directly at TinkerSpace HF Space.

Limitations

This is only the first version of PromptTuner. As an initial release, the model may:

  • Occasionally lose details and relationships from multi-entity prompts
  • Sometimes introduce stylistic elements and text not present in the original concept

Feedback and suggestions for improvement are welcome.

License

This model is built upon Google's Gemma 3. Please refer to the Gemma license for usage terms.

Citation

If you use this model in your work, please cite:

@model{prompt_tuner_v0.1,
  title={PromptTuner-v0.1: A Fine-tuned Gemma3-270M for Prompt Enhancement},
  author={shb777},
  year={2025},
  url={https://huggingface.co/shb777/PromptTuner-v0.1}
}

Acknowledgments

Downloads last month
98
Safetensors
Model size
0.3B params
Tensor type
F32
·
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for shb777/PromptTuner-v0.1

Finetuned
(17)
this model

Space using shb777/PromptTuner-v0.1 1