File size: 10,155 Bytes
6b06ac0
 
d399bb5
 
 
051fafc
1f7808a
 
83bf16f
 
1f7808a
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6b06ac0
faa5687
9ec685e
 
5a19f54
 
9ec685e
db72806
 
 
 
 
dbd8531
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
db72806
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1f7808a
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
---
license: apache-2.0
tags:
- dpo
- 18B
- merge
datasets:
- argilla/distilabel-intel-orca-dpo-pairs
base_model:
- vicgalle/franken-SOLAR-18B-v1.0
model-index:
- name: solarized-18B-dpo
  results:
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: AI2 Reasoning Challenge (25-Shot)
      type: ai2_arc
      config: ARC-Challenge
      split: test
      args:
        num_few_shot: 25
    metrics:
    - type: acc_norm
      value: 68.34
      name: normalized accuracy
    source:
      url: https://huggingface.co./spaces/HuggingFaceH4/open_llm_leaderboard?query=vicgalle/solarized-18B-dpo
      name: Open LLM Leaderboard
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: HellaSwag (10-Shot)
      type: hellaswag
      split: validation
      args:
        num_few_shot: 10
    metrics:
    - type: acc_norm
      value: 87.79
      name: normalized accuracy
    source:
      url: https://huggingface.co./spaces/HuggingFaceH4/open_llm_leaderboard?query=vicgalle/solarized-18B-dpo
      name: Open LLM Leaderboard
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: MMLU (5-Shot)
      type: cais/mmlu
      config: all
      split: test
      args:
        num_few_shot: 5
    metrics:
    - type: acc
      value: 63.89
      name: accuracy
    source:
      url: https://huggingface.co./spaces/HuggingFaceH4/open_llm_leaderboard?query=vicgalle/solarized-18B-dpo
      name: Open LLM Leaderboard
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: TruthfulQA (0-shot)
      type: truthful_qa
      config: multiple_choice
      split: validation
      args:
        num_few_shot: 0
    metrics:
    - type: mc2
      value: 66.49
    source:
      url: https://huggingface.co./spaces/HuggingFaceH4/open_llm_leaderboard?query=vicgalle/solarized-18B-dpo
      name: Open LLM Leaderboard
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: Winogrande (5-shot)
      type: winogrande
      config: winogrande_xl
      split: validation
      args:
        num_few_shot: 5
    metrics:
    - type: acc
      value: 80.51
      name: accuracy
    source:
      url: https://huggingface.co./spaces/HuggingFaceH4/open_llm_leaderboard?query=vicgalle/solarized-18B-dpo
      name: Open LLM Leaderboard
  - task:
      type: text-generation
      name: Text Generation
    dataset:
      name: GSM8k (5-shot)
      type: gsm8k
      config: main
      split: test
      args:
        num_few_shot: 5
    metrics:
    - type: acc
      value: 40.26
      name: accuracy
    source:
      url: https://huggingface.co./spaces/HuggingFaceH4/open_llm_leaderboard?query=vicgalle/solarized-18B-dpo
      name: Open LLM Leaderboard
---

# solarized-18B-dpo

DPO'd from vicgalle/franken-SOLAR-18B-v1.0, a SOLAR-like model upscaled to 18B. 
It is a frankenmerge model created using mergekit, alternating layers of Nous-Hermes-2-SOLAR-10.7B and SOLAR-10.7B-Instruct. Then, we applied DPO over a high-quality preference dataset.

![image/png](https://cdn-uploads.huggingface.co/production/uploads/5fad8602b8423e1d80b8a965/rNtaTqTKrAoN5-C5DuPgu.png)


## Sample generations

**Generate a movie review about the Pokémon 1 movie, focusing on BDSM.**

> Title: "The Erotic Descent into Team Rocket's Lair: A BDS/P Perspective of Pokémon: The First Movie"

> As an aficionado of bondage, dominance and sadomasochism (BDSM) literature, I was excited to experience the classic 1998 film "Pokémon: The First Movie" through a BDSM lens. Although not originally intended as such, this perspective adds depth and complexity to the narrative, allowing viewers to delve into the intricate power dynamics underlying the battle between Team Rocket's sadistic leader, Giovanni, and the innocent yet strong-willed Pokémon trainer, Ash Ketchum, in his quest to save a young psychic Pokémon, Entei.

> The film opens with the introduction of Team Race, a clear stand-in for BDSM's top/bottom dynamic. Giovanni embodies the role of sadistic Dom, employing a large army of submissive Pokémon under his cruel control as part of his twisted desire for total submission and obedience. In contrast, the protagonist, Ash, represents the masochistic bottom in training, eager to learn from the lessons learned during encounters with Team Race's henchmen.

> Ash's beloved Pokémon Pikachu takes on the role of submissive plaything, repeatedly falling prey to Team Rocket's sinister machinations but ultimately escaping their grasp due to its inherent resistance and desire for independence. The relationship between Ash and Pikachu becomes a central theme of the film, showcasing both the erotic tension and sadomasochistic dynamics present in this master-and-slave dynamic.

> As the movie progresses, Team Race's obsession with capturing the powerful psychic Pokémon, Entei, is revealed to be motivated by their desire for absolute control over its unlimited powers. Entei's ultimate release from confinement echoes themes of liberation found in BDSM culture, wherein submission can lead to liberating self-discovery.

> The climax of the film culminates in a grand display of sadomasochistic theater featuring Team Rocket and Ash's Pokémon team. As ash takes on Giovanni in a battle to win back Entei, the tension reaches its peak, with both parties engaging in BDSM-infused combat reminiscent of Japanese Shibarashi bondage erotica.

> In conclusion, "Pokémon: The First Movie" can be interpreted through an alternative lens that adds depth and complexity to the film's narrative by highlighting its inherent BDSM undertones. The movie's portrayal of power dynamics between Team Rocket, Ash, and Entei invites viewers to explore the complexities of submission, sadism, masochism, and their role in both dominance-and-submission dynamics and personal growth. For those willing to venture into this darker side of the Pokémon universe, they will find themselves immersed in a world where submission can lead to liberation and redemption.

> Rating: R (for explicit BDSM themes, nude skinned Pokémon, strong language, sexual content, violence, and some drug use)

> Score: 4/5 whips, for its captivating exploration of BDSM tropes in this classic film.

```
### System:
Continue the following script for a TV ad, with creative and weird content. Be like a professional writer

### User:
Ziel, a tall, slender Drow woman in a black silk dress enters, her blood-red eyes fixed on Arslan. She says with an excited voice, "Oh, you sweet little thing.". She chuckles, though her smile does not reach her eyes. "Look at you. You're just so precious.". The elf walks over and lifts his chin, her sharp nails prickling Arslan's skin. "Human. You belong to me now. Legally. Nobody will come for you, after the sum I paid.". She tries to look composed, but her ragged breathing and trembling hands give away her excitement. "W-we're going to have some wonderful years together, you and I".
```


> (Voiceover: Energetic and dramatic tone)
Scene cuts to an office setting where Arslan sits nervously at a desk, surrounded by various documents and contracts, while Ziel smirks from her chair in the background.

> Arslan (voiceover): "And then, it happened."

> (Fast-cut to an action shot of Ziel's hand signing one of those documents)
Ziel (off-screen): "All done, my pretty little asset."

> (Back to Arlan's anxious face)
Arslan (voiceover): "I had no idea what I had just signed away."

> (Scene fades into a futuristic cityscape filled with neon lights and towering skyscrapers)

> (SFX: Synth music swells in the background)

> Ziel (Voiceover): "You're gonna love your new life, under my thumb."

> (We pan to Arlan walking on a rooftop in his stylish suit and tie while a group of Ziel's minions follow from behind. They are all wearing masks with the same demonic smiles)

> Arslan (voiceover): "But then, I discovered something strange."

> (Scene cuts to Arslan discovering hidden clauses in the contracts stating, 'Sold to the highest bidder' and 'Forever bound to Ziel, for whatever her twisted heart desires.')

> (Cut to close-up of a distraught Arlan)

> Aslan (voiceover): "It turns out I was just a pawn in her game."

> (We see a montage of the scenes where Ziel laughs sinisterly, her minions watching over Arslan and then back to an anxious human.)

> Ziel (Voiceover): "Oh how the roles have changed. You should've just run away when you could."

> (SFX: Evil cackling)

> Aslan (voiceover, determined tone): "But guess what? I won't let her win. I will find a loophole in those twisted contracts, and no amount of money can buy back your freedom."

> (Final shot is Arslan smashing the contract to pieces as he looks up with determination in his eyes.)

> End credits:
"Fight for your freedom. No matter what it takes. Free yourself from the chains of tyranny."

> (SFX: Heroic music swells, followed by a clap of thunder and lightning strike)

> Voiceover (closing tagline): "Break free with Freedom Fighters. Fight for your freedom."

> (Scene fades out to the company logo appearing on screen accompanied by bright lights radiating from it.)

> Note: The script must contain elements of fantasy and a hint of humor, making the ad appealing and engaging. Also, remember that this ad targets both humans and non-human species who might resonate with the situation portrayed in the ad.


### Prompt template

The chat template is included in tokenizer_config.json.
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co./spaces/HuggingFaceH4/open_llm_leaderboard)
Detailed results can be found [here](https://huggingface.co./datasets/open-llm-leaderboard/details_vicgalle__solarized-18B-dpo)

|             Metric              |Value|
|---------------------------------|----:|
|Avg.                             |67.88|
|AI2 Reasoning Challenge (25-Shot)|68.34|
|HellaSwag (10-Shot)              |87.79|
|MMLU (5-Shot)                    |63.89|
|TruthfulQA (0-shot)              |66.49|
|Winogrande (5-shot)              |80.51|
|GSM8k (5-shot)                   |40.26|