lbourdois commited on
Commit
50b3460
·
verified ·
1 Parent(s): 9c587f3

Improve language tag

Browse files

Hi! As the model is multilingual, this is a PR to add other languages than English to the language tag to improve the referencing. Note that 29 languages are announced in the README, but only 13 are explicitly listed. I was therefore only able to add these 13 languages.

Files changed (1) hide show
  1. README.md +119 -105
README.md CHANGED
@@ -1,105 +1,119 @@
1
- ---
2
- license: apache-2.0
3
- base_model:
4
- - Qwen/Qwen2.5-32B
5
- ---
6
-
7
-
8
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/e9B0KVbciDCI9dLm-moEA.png)
9
-
10
- # Qwen2.5-32B-Marigold-v1
11
-
12
- <i>Iterating on Marigold in hopes of another Snowdrop?</i>
13
-
14
- <p><b>Severian's notes</b>: Not going to lie, this model candidate won when I was honestly preferring another one. Doesn't help that we're all coming off a collective Snowdrop high. Most of the model testers liked this over the other candidates as well as v0, but it's not drastic in terms of improvements. A little sloppy, thinking holds better here than v0. The character portrayal is stronger with thinking on. It's slightly unhinged, too.</p>
15
-
16
- <p><b>Has's notes</b>: honestly this one kinda sucks, but then again it was just Mullein v1 data mixture on top of Qwen 32B, so I'm not sure why I was expecting something special</p>
17
-
18
- ## Recommended settings
19
-
20
- <p><b>Context/instruct template</b>: Chatml</p>
21
-
22
- <p><b>Samplers</b>: temperature at 0.9, min_p at 0.05, top_a at 0.3, TFS at 0.75, repetition_penalty at 1.03, DRY if you have access to it.</p>
23
-
24
- A virt-io derivative prompt worked best during our testing, but feel free to use what you like. Seems to perform decent (and better than otherwise) with thinking.
25
-
26
- ## 60+ followers!
27
-
28
- Thanks for the support, y'all.
29
-
30
- ## Thank you!
31
-
32
- Big thanks to the folks in the trashpanda-org discord for testing and sending over some logs!
33
-
34
- ## Reviews
35
-
36
- > PROS:
37
- >
38
- > Most proactive char stance. Even if char himself wasn’t particularly doing anything, their behaviour and inner monologue showed a clear directive and their current goal.
39
- >
40
- > Kept char’s attitude consistent. Also portrayed inner conflict well between duty and personal attachment.
41
- >
42
- > All swipes gave very different outputs. Positive or negative, those were still very different from one another. Definitely most interesting prose, too.
43
- >
44
- > One of the swipes went into action by itself and it was written out great. Stakes were high, sense of danger was present.
45
- >
46
- > CONS:
47
- >
48
- > Jumped to romantic tension a tad bit too quickly. Literally second message and I got hit with a ‘his fingers brushed against their neck’. I’m about to get (possibly) executed and he’s trying to flirt.
49
- >
50
- > Had just some minor odd capitalization like ‘she kNEW’.
51
- >
52
- > Conclusion: Definitely my favorite out of them all. Speaking for user was close to none, every response was very vibrant and kept me engaged (to the point where I actually want to continue rp-ing with it out of personal preference).
53
- > Just very good variety in responses. Barely spoke for user, kept everything in character. Mwuah!
54
-
55
- Sellvene
56
-
57
- > Non-thinking: Best responses so far [out of Marigold v1 candidates]. Not extraordinary but good responses for [being a] small-mid model.
58
- >
59
- > Thinking: Seems like would work nice in smut and felt best among models. But still, seemed a bit too basic for my R1-Snowdrop addicted ass.
60
-
61
- — Carmenta
62
-
63
- > Gacha with rare responses out of 1/4 rolls
64
-
65
- — AIELO
66
-
67
- > I was trying qwq the other day and I keep running into impersonation issue, I was wondering if this because I'm using base model and it wasn't trained with roleplaying data. It thinks, but the issue persist. Y'know, talking and acting as user.
68
- >
69
- > On the contrary, thinking curbs that completely for marigold, it reinforces the model to stick as the character, ignoring user completely. There's still some phrases that describe what the user is doing, but it doesn't get expanded on. There's a small issue with </think> not getting added though, so thinking and actual response doesn't get separated.
70
- >
71
- > The prose, it has some slop here and there, but it's pretty tolerateable honestly.
72
-
73
- Sam
74
-
75
- > It's able to impersonate side chars, and the fact that they're interacting is the better. I feel like it repeats response structure and the slops are juuuust...
76
-
77
- Raihanbook
78
-
79
- > cooks BUT impersonation. I think that 100% is a qwen model issue cause chuluun did it lots. Maybe can be fixed with reasoning...
80
-
81
- Mooth
82
-
83
- > giving me good responses but it ends with slop ahh last paragraph
84
-
85
- Myscell
86
-
87
- ## Some logs
88
-
89
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/-XZEaOxfNdLZCq0KDMJih.png)
90
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/ZUZKU4dBgWJ7vycyHZboj.png)
91
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/iGFA1j8nkbT90leL-5ba8.png)
92
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/-WPqet-WJ8a36VMzTyG42.png)
93
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/g-J3yYKsJso9zJvSm2DxS.png)
94
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/OB18Fp_oDNVFZZrGuTQYT.png)
95
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/TW1cco6CsuTjWSV-r5rKq.png)
96
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/Qowht2AoC8h6J0H-ZuWY9.png)
97
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/XCUiJwJWBeVPmbMPzMNeP.png)
98
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/nFJheZNg2zkLVLoZeHj2J.png)
99
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/xYLRRQow3Dfy-XFffkXDf.png)
100
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/vU2xMyLnckJo-tpJHKueF.png)
101
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/B-KOhj03I5r1vK6ZqRFDH.png)
102
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/hpGYKGkZrojxM1FzHPVzu.png)
103
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/UDUzqzWxu-WXfpTOSo3X2.png)
104
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/sG3WOXg7KStra9jrutyZt.png)
105
- ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/BIUaMjpYPpzWQYcgGZsit.png)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model:
4
+ - Qwen/Qwen2.5-32B
5
+ language:
6
+ - zho
7
+ - eng
8
+ - fra
9
+ - spa
10
+ - por
11
+ - deu
12
+ - ita
13
+ - rus
14
+ - jpn
15
+ - kor
16
+ - vie
17
+ - tha
18
+ - ara
19
+ ---
20
+
21
+
22
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/e9B0KVbciDCI9dLm-moEA.png)
23
+
24
+ # Qwen2.5-32B-Marigold-v1
25
+
26
+ <i>Iterating on Marigold in hopes of another Snowdrop?</i>
27
+
28
+ <p><b>Severian's notes</b>: Not going to lie, this model candidate won when I was honestly preferring another one. Doesn't help that we're all coming off a collective Snowdrop high. Most of the model testers liked this over the other candidates as well as v0, but it's not drastic in terms of improvements. A little sloppy, thinking holds better here than v0. The character portrayal is stronger with thinking on. It's slightly unhinged, too.</p>
29
+
30
+ <p><b>Has's notes</b>: honestly this one kinda sucks, but then again it was just Mullein v1 data mixture on top of Qwen 32B, so I'm not sure why I was expecting something special</p>
31
+
32
+ ## Recommended settings
33
+
34
+ <p><b>Context/instruct template</b>: Chatml</p>
35
+
36
+ <p><b>Samplers</b>: temperature at 0.9, min_p at 0.05, top_a at 0.3, TFS at 0.75, repetition_penalty at 1.03, DRY if you have access to it.</p>
37
+
38
+ A virt-io derivative prompt worked best during our testing, but feel free to use what you like. Seems to perform decent (and better than otherwise) with thinking.
39
+
40
+ ## 60+ followers!
41
+
42
+ Thanks for the support, y'all.
43
+
44
+ ## Thank you!
45
+
46
+ Big thanks to the folks in the trashpanda-org discord for testing and sending over some logs!
47
+
48
+ ## Reviews
49
+
50
+ > PROS:
51
+ >
52
+ > Most proactive char stance. Even if char himself wasn’t particularly doing anything, their behaviour and inner monologue showed a clear directive and their current goal.
53
+ >
54
+ > Kept char’s attitude consistent. Also portrayed inner conflict well between duty and personal attachment.
55
+ >
56
+ > All swipes gave very different outputs. Positive or negative, those were still very different from one another. Definitely most interesting prose, too.
57
+ >
58
+ > One of the swipes went into action by itself and it was written out great. Stakes were high, sense of danger was present.
59
+ >
60
+ > CONS:
61
+ >
62
+ > Jumped to romantic tension a tad bit too quickly. Literally second message and I got hit with a ‘his fingers brushed against their neck’. I’m about to get (possibly) executed and he’s trying to flirt.
63
+ >
64
+ > Had just some minor odd capitalization like ‘she kNEW’.
65
+ >
66
+ > Conclusion: Definitely my favorite out of them all. Speaking for user was close to none, every response was very vibrant and kept me engaged (to the point where I actually want to continue rp-ing with it out of personal preference).
67
+ > Just very good variety in responses. Barely spoke for user, kept everything in character. Mwuah!
68
+
69
+ Sellvene
70
+
71
+ > Non-thinking: Best responses so far [out of Marigold v1 candidates]. Not extraordinary but good responses for [being a] small-mid model.
72
+ >
73
+ > Thinking: Seems like would work nice in smut and felt best among models. But still, seemed a bit too basic for my R1-Snowdrop addicted ass.
74
+
75
+ Carmenta
76
+
77
+ > Gacha with rare responses out of 1/4 rolls
78
+
79
+ AIELO
80
+
81
+ > I was trying qwq the other day and I keep running into impersonation issue, I was wondering if this because I'm using base model and it wasn't trained with roleplaying data. It thinks, but the issue persist. Y'know, talking and acting as user.
82
+ >
83
+ > On the contrary, thinking curbs that completely for marigold, it reinforces the model to stick as the character, ignoring user completely. There's still some phrases that describe what the user is doing, but it doesn't get expanded on. There's a small issue with </think> not getting added though, so thinking and actual response doesn't get separated.
84
+ >
85
+ > The prose, it has some slop here and there, but it's pretty tolerateable honestly.
86
+
87
+ Sam
88
+
89
+ > It's able to impersonate side chars, and the fact that they're interacting is the better. I feel like it repeats response structure and the slops are juuuust...
90
+
91
+ — Raihanbook
92
+
93
+ > cooks BUT impersonation. I think that 100% is a qwen model issue cause chuluun did it lots. Maybe can be fixed with reasoning...
94
+
95
+ — Mooth
96
+
97
+ > giving me good responses but it ends with slop ahh last paragraph
98
+
99
+ — Myscell
100
+
101
+ ## Some logs
102
+
103
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/-XZEaOxfNdLZCq0KDMJih.png)
104
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/ZUZKU4dBgWJ7vycyHZboj.png)
105
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/iGFA1j8nkbT90leL-5ba8.png)
106
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/-WPqet-WJ8a36VMzTyG42.png)
107
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/g-J3yYKsJso9zJvSm2DxS.png)
108
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/OB18Fp_oDNVFZZrGuTQYT.png)
109
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/TW1cco6CsuTjWSV-r5rKq.png)
110
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/Qowht2AoC8h6J0H-ZuWY9.png)
111
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/XCUiJwJWBeVPmbMPzMNeP.png)
112
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/nFJheZNg2zkLVLoZeHj2J.png)
113
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/xYLRRQow3Dfy-XFffkXDf.png)
114
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/vU2xMyLnckJo-tpJHKueF.png)
115
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/B-KOhj03I5r1vK6ZqRFDH.png)
116
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/hpGYKGkZrojxM1FzHPVzu.png)
117
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/UDUzqzWxu-WXfpTOSo3X2.png)
118
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/sG3WOXg7KStra9jrutyZt.png)
119
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/675a77cf99ca23af9daacccc/BIUaMjpYPpzWQYcgGZsit.png)