aifeifei798
/

Phi-3-song-lyrics-1.0

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

aifeifei798 commited on Jul 11

Commit

26462b5

•

1 Parent(s): d7401ad

Upload README.md

Files changed (1) hide show

README.md +7 -0

README.md CHANGED Viewed

@@ -12,6 +12,13 @@ tags:
 ## mradermacher's superb gguf version, thank you for your conscientious and responsible dedication.
  - https://huggingface.co/mradermacher/Phi-3-song-lyrics-1.0-i1-GGUF
  - https://huggingface.co/mradermacher/Phi-3-song-lyrics-1.0-GGUF
 ![image/png](https://huggingface.co/aifeifei798/Phi-3-song-lyrics-1.0/resolve/main/Phi-3-song-lyrics-1.0.png)

 ## mradermacher's superb gguf version, thank you for your conscientious and responsible dedication.
  - https://huggingface.co/mradermacher/Phi-3-song-lyrics-1.0-i1-GGUF
  - https://huggingface.co/mradermacher/Phi-3-song-lyrics-1.0-GGUF
+## These are my own quantizations (updated almost daily).
+The difference with normal quantizations is that I quantize the output and embed tensors to f16.
+and the other tensors to 15_k,q6_k or q8_0.
+This creates models that are little or not degraded at all and have a smaller size.
+They run at about 3-6 t/sec on CPU only using llama.cpp
+And obviously faster on computers with potent GPUs
+- the fast cat at [ZeroWw/Phi-3-song-lyrics-1.0-GGUF](https://huggingface.co/ZeroWw/Phi-3-song-lyrics-1.0-GGUF)
 ![image/png](https://huggingface.co/aifeifei798/Phi-3-song-lyrics-1.0/resolve/main/Phi-3-song-lyrics-1.0.png)