aifeifei798 commited on
Commit
26462b5
1 Parent(s): d7401ad

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -0
README.md CHANGED
@@ -12,6 +12,13 @@ tags:
12
  ## mradermacher's superb gguf version, thank you for your conscientious and responsible dedication.
13
  - https://huggingface.co/mradermacher/Phi-3-song-lyrics-1.0-i1-GGUF
14
  - https://huggingface.co/mradermacher/Phi-3-song-lyrics-1.0-GGUF
 
 
 
 
 
 
 
15
 
16
  ![image/png](https://huggingface.co/aifeifei798/Phi-3-song-lyrics-1.0/resolve/main/Phi-3-song-lyrics-1.0.png)
17
 
 
12
  ## mradermacher's superb gguf version, thank you for your conscientious and responsible dedication.
13
  - https://huggingface.co/mradermacher/Phi-3-song-lyrics-1.0-i1-GGUF
14
  - https://huggingface.co/mradermacher/Phi-3-song-lyrics-1.0-GGUF
15
+ ## These are my own quantizations (updated almost daily).
16
+ The difference with normal quantizations is that I quantize the output and embed tensors to f16.
17
+ and the other tensors to 15_k,q6_k or q8_0.
18
+ This creates models that are little or not degraded at all and have a smaller size.
19
+ They run at about 3-6 t/sec on CPU only using llama.cpp
20
+ And obviously faster on computers with potent GPUs
21
+ - the fast cat at [ZeroWw/Phi-3-song-lyrics-1.0-GGUF](https://huggingface.co/ZeroWw/Phi-3-song-lyrics-1.0-GGUF)
22
 
23
  ![image/png](https://huggingface.co/aifeifei798/Phi-3-song-lyrics-1.0/resolve/main/Phi-3-song-lyrics-1.0.png)
24