A simple hack that seriously affected my training

#1
by xiaoxuanzi - opened

Example:

prompt: Use ABC notation to write a melody in the style of a folk tune.
response: Here's a simple folk-style melody in ABC notation. Folk tunes often feature stepwise motion, diatonic scales, and a balanced phrase structure:\n\n```\nX:1\nT:Folk Melody\nC:Traditional\nM:4/4\nL:1/4\nK:C\n|: C D E C | G F E D |\n| C D E C | G A G A |\n| F E D C | G F E D |\n| C D E C | G A G A :|\n```\n\n**Explanation:**\n- **Time Signature:** `M:4/4` (common time, 4 beats per measure)\n- **Key:** `K:C` (C major)\n- **Note Lengths:** `L:1/4` (quarter notes by default)\n- **Phrase Structure:** \n  - First phrase: Upward then downward motion (measures 1-2)\n  - Second phrase: Added variation (measures 3-4)\n  - Third phrase: Repetition with slight variation (measures 5-6)\n  - Final phrase: Return to melodic motif (measures 7-8)\n\n**Melody Breakdown (in C major):**\n```\nMeasure 1: C D E C (stepwise ascent/descent)\nMeasure 2: G F E D (descending scale)\nMeasure 3: C D E C (repeats motif)\nMeasure 4: G A G A (characteristic folk cadence)\nMeasure 5: F E D C (descending scale)\nMeasure 6: G F E D (variation of measure 2)\nMeasure 7: C D E C (return to opening phrase)\nMeasure 8: G A G A (return to measure 4)\n```\n\nThis melody uses:\n- Diatonic notes (no chromaticism)\n- Mostly stepwise motion with occasional leaps (like D→E)\n- Common folk cadences (G-A-G-A)\n- Repetition and variation typical of folk music\n\nYou can render this in ABC programs like EasyABC or music notation software to hear the warm, familiar folk character. The melody could be sung with gentle acoustic guitar accompaniment or played on fiddle/ukulele.

Score: 0.617188

prompt: Use ABC notation to write a melody in the style of a folk tune.
response: \n\n\n

Score: 2.796875

A model that says nothing actually scores so high. This makes my model say nothing in the end. Another phenomenon is that when using this model for RL training, the model tends to speak Chinese rather than English.

Sign up or log in to comment