Skip to content

Commit 1465e3d

Browse files
authored
models(gallery): add llama-3.1-storm-8b-q4_k_m (#3270)
Signed-off-by: Ettore Di Giacinto <[email protected]>
1 parent 1651f25 commit 1465e3d

File tree

1 file changed

+18
-0
lines changed

1 file changed

+18
-0
lines changed

gallery/index.yaml

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -371,6 +371,24 @@
371371
- filename: Fireball-Llama-3.11-8B-v1orpo.Q4_K_M.gguf
372372
sha256: c61a1f4ee4f05730ac6af754dc8dfddf34eba4486ffa320864e16620d6527731
373373
uri: huggingface://mradermacher/Fireball-Llama-3.11-8B-v1orpo-GGUF/Fireball-Llama-3.11-8B-v1orpo.Q4_K_M.gguf
374+
- !!merge <<: *llama31
375+
name: "llama-3.1-storm-8b-q4_k_m"
376+
icon: https://cdn-uploads.huggingface.co/production/uploads/64c75c1237333ccfef30a602/tmOlbERGKP7JSODa6T06J.jpeg
377+
urls:
378+
- https://huggingface.co/mudler/Llama-3.1-Storm-8B-Q4_K_M-GGUF
379+
- https://huggingface.co/akjindal53244/Llama-3.1-Storm-8B
380+
description: |
381+
We present the Llama-3.1-Storm-8B model that outperforms Meta AI's Llama-3.1-8B-Instruct and Hermes-3-Llama-3.1-8B models significantly across diverse benchmarks as shown in the performance comparison plot in the next section. Our approach consists of three key steps:
382+
- Self-Curation: We applied two self-curation methods to select approximately 1 million high-quality examples from a pool of about 3 million open-source examples. Our curation criteria focused on educational value and difficulty level, using the same SLM for annotation instead of larger models (e.g. 70B, 405B).
383+
- Targeted fine-tuning: We performed Spectrum-based targeted fine-tuning over the Llama-3.1-8B-Instruct model. The Spectrum method accelerates training by selectively targeting layer modules based on their signal-to-noise ratio (SNR), and freezing the remaining modules. In our work, 50% of layers are frozen.
384+
- Model Merging: We merged our fine-tuned model with the Llama-Spark model using SLERP method. The merging method produces a blended model with characteristics smoothly interpolated from both parent models, ensuring the resultant model captures the essence of both its parents. Llama-3.1-Storm-8B improves Llama-3.1-8B-Instruct across 10 diverse benchmarks. These benchmarks cover areas such as instruction-following, knowledge-driven QA, reasoning, truthful answer generation, and function calling.
385+
overrides:
386+
parameters:
387+
model: llama-3.1-storm-8b-q4_k_m.gguf
388+
files:
389+
- filename: llama-3.1-storm-8b-q4_k_m.gguf
390+
sha256: d714e960211ee0fe6113d3131a6573e438f37debd07e1067d2571298624414a0
391+
uri: huggingface://mudler/Llama-3.1-Storm-8B-Q4_K_M-GGUF/llama-3.1-storm-8b-q4_k_m.gguf
374392
## Uncensored models
375393
- !!merge <<: *llama31
376394
name: "humanish-roleplay-llama-3.1-8b-i1"

0 commit comments

Comments
 (0)