trollek
/

Tyr-4B-v0.1-GGUF

Not-For-All-Audiences

Model card Files Files and versions Community

trollek commited on 21 days ago

Commit

4b68057

•

1 Parent(s): 50742c1

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -30,10 +30,12 @@ Merging seems like the way to go when it comes to training language models on a
 ## Model Description
-This model was created by training LoRAs and Della merge them. It saves space and time this way and the result is good. The WhiteRabbitNeo datasets are the focus in this one along with coding.
 Incedentally, it seems uncensored. It was trained using the ChatML template and can be used with or without a system prompt.
 ## Apache-2.0 + WhiteRabbitNeo Extended Version
 ### Licence: Usage Restrictions

 ## Model Description
+This model was created by training LoRAs ([rsLoRA](https://arxiv.org/abs/2312.03732) with [NEFTune](https://arxiv.org/abs/2310.05914) noise alpha at 5) and [DELLA merge](https://arxiv.org/abs/2406.11617) them. It saves space and time this way and the result is good. The WhiteRabbitNeo datasets are the focus in this one along with coding.
 Incedentally, it seems uncensored. It was trained using the ChatML template and can be used with or without a system prompt.
+I am very grateful for all the different components that made this model possible. From H2O and their danube models, through Huggingface and Llama-Factory for making the fine-tuning easy, and to all the great dataset creators. **Thank you!**
 ## Apache-2.0 + WhiteRabbitNeo Extended Version
 ### Licence: Usage Restrictions