suayptalha commited on
Commit
22cdac2
·
verified ·
1 Parent(s): 84471c2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -7
README.md CHANGED
@@ -830,21 +830,21 @@ a:hover .link-arrow {
830
  <head>
831
  <meta charset="UTF-8">
832
  <meta name="viewport" content="width=device-width, initial-scale=1.0">
833
- <title>L3.3-Nevoria-R1-70b</title>
834
  <link href="https://fonts.googleapis.com/css2?family=Quicksand:wght@400;500;600&display=swap" rel="stylesheet">
835
  <link href="styles.css" rel="stylesheet">
836
  </head>
837
  <body>
838
  <div class="container">
839
  <div class="header">
840
- <h1>L3.3-Nevoria-R1-70b</h1>
841
  </div>
842
  <div class="info">
843
  <img src="https://cdn-uploads.huggingface.co/production/uploads/64545af5ec40bbbd01242ca6/_oWpsvCZ-graNKzJBBjGo.jpeg" alt="Model banner">
844
  <div class="creator-section">
845
  <div class="creator-badge">
846
  <span class="creator-label">Created by</span>
847
- <a href="https://huggingface.co/Steelskull" target="_blank" class="creator-link">
848
  <span class="creator-name">SteelSkull</span>
849
  <span class="creator-arrow">→</span>
850
  </a>
@@ -858,7 +858,7 @@ a:hover .link-arrow {
858
  <div class="model-tags">
859
  <span class="model-tag">L3.3 = Llama 3.3</span>
860
  <span class="model-tag">R1 = DeepSeek-R1</span>
861
- <span class="model-tag">70b Parameters</span>
862
  </div>
863
  </div>
864
  <div class="model-composition">
@@ -873,9 +873,13 @@ a:hover .link-arrow {
873
  </ul>
874
  </div>
875
  <div class="model-description">
876
- <p>This model builds upon the original Nevoria foundation, incorporating the Deepseek-R1 reasoning architecture to enhance dialogue interaction and scene comprehension. While maintaining Nevoria's core strengths in storytelling and scene description (derived from EVA, EURYALE, and Anubis), this iteration aims to improve prompt adherence and creative reasoning capabilities. The model also retains the balanced perspective introduced by Negative_LLAMA and Nemotron elements. Also, the model plays the card to almost a fault, It'll pick up on minor issues and attempt to run with them. Users had it call them out for misspelling a word while playing in character. </p>
877
- <p>Note: While Nevoria-R1 represents a significant architectural change, rather than a direct successor to Nevoria, it operates as a distinct model with its own characteristics.</p>
878
- <p>The lorablated model base choice was intentional, creating unique weight interactions similar to the original <a href="https://huggingface.co/Steelskull/L3-MS-Astoria-70b" target="_blank">Astoria model</a> and <a href="https://huggingface.co/Steelskull/L3.1-MS-Astoria-70b-v2" target="_blank">Astoria V2 model</a>. This "weight twisting" effect, achieved by subtracting the lorablated base model during merging, creates an interesting balance in the model's behavior. While unconventional compared to sequential component application, this approach was chosen for its unique response characteristics.</p>
 
 
 
 
879
  </div>
880
  </div>
881
  <div class="support-section">
 
830
  <head>
831
  <meta charset="UTF-8">
832
  <meta name="viewport" content="width=device-width, initial-scale=1.0">
833
+ <title>Maestro-10B</title>
834
  <link href="https://fonts.googleapis.com/css2?family=Quicksand:wght@400;500;600&display=swap" rel="stylesheet">
835
  <link href="styles.css" rel="stylesheet">
836
  </head>
837
  <body>
838
  <div class="container">
839
  <div class="header">
840
+ <h1>Maestro-10B</h1>
841
  </div>
842
  <div class="info">
843
  <img src="https://cdn-uploads.huggingface.co/production/uploads/64545af5ec40bbbd01242ca6/_oWpsvCZ-graNKzJBBjGo.jpeg" alt="Model banner">
844
  <div class="creator-section">
845
  <div class="creator-badge">
846
  <span class="creator-label">Created by</span>
847
+ <a href="https://huggingface.co/suayptalha" target="_blank" class="creator-link">
848
  <span class="creator-name">SteelSkull</span>
849
  <span class="creator-arrow">→</span>
850
  </a>
 
858
  <div class="model-tags">
859
  <span class="model-tag">L3.3 = Llama 3.3</span>
860
  <span class="model-tag">R1 = DeepSeek-R1</span>
861
+ <span class="model-tag">10b Parameters</span>
862
  </div>
863
  </div>
864
  <div class="model-composition">
 
873
  </ul>
874
  </div>
875
  <div class="model-description">
876
+ Maestro-10B is a 10 billion parameter model built upon the Virtuoso-Lite architecture,
877
+ enhanced by fine-tuning with 10,000 carefully selected data points from the OpenOrca dataset.
878
+ This process imbues Maestro with improved capabilities in following instructions and generating
879
+ creative, contextually relevant responses. While Virtuoso-Lite provides a strong foundation,
880
+ the OpenOrca fine-tuning specifically targets enhancing the model's ability to understand and
881
+ respond to complex prompts. This training regimen was chosen to optimize the model's performance
882
+ in tasks requiring both comprehension and imaginative text generation.
883
  </div>
884
  </div>
885
  <div class="support-section">