WasamiKirua commited on
Commit
d67d266
·
verified ·
1 Parent(s): 5f35881

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +48 -1
README.md CHANGED
@@ -3,6 +3,7 @@ license: gemma
3
  datasets:
4
  - WasamiKirua/gemma-samantha-neon
5
  - WasamiKirua/gemma-samantha-deepseek
 
6
  language:
7
  - it
8
  - en
@@ -15,4 +16,50 @@ tags:
15
  - eq
16
  - philosophy
17
  - dpo
18
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  datasets:
4
  - WasamiKirua/gemma-samantha-neon
5
  - WasamiKirua/gemma-samantha-deepseek
6
+ - WasamiKirua/Human-Like-DPO-ita
7
  language:
8
  - it
9
  - en
 
16
  - eq
17
  - philosophy
18
  - dpo
19
+ ---
20
+
21
+ <img src="https://i.postimg.cc/T2t1v042/temp-Image1-Mnnn-S.avif" alt="cover" border="0" width="862px">
22
+
23
+ ## Model Overview
24
+
25
+ **Samanta-NewGenesis** is an advanced conversational AI model fine-tuned specifically for the Italian language. Built on a **2B parameter** foundation, this model is designed to engage in multi-turn dialogues with a strong emphasis on **emotional intelligence (EQ), philosophical reasoning, and psychological depth**.
26
+
27
+ This iteration improves upon its predecessor, **Samanta**, by integrating carefully curated datasets from song lyrics, movie scripts, and philosophical discussions. The goal is to **enhance the model’s ability to understand and respond with nuanced emotional and sentimental depth**.
28
+
29
+ ## Training Process
30
+
31
+ Samanta-NewGenesis has undergone a multi-stage training process to refine its reasoning and interaction capabilities:
32
+
33
+ 1. **Supervised Fine-Tuning (SFT):**
34
+ - The base model was fine-tuned using a crafted **Italian multi-turn dataset** focused on **philosophical, psychological, and emotional intelligence topics**.
35
+
36
+ 2. **Direct Preference Optimization (DPO):**
37
+ - To enhance **human-like response generation**, **reduce refusals**, and improve alignment, DPO was applied post-SFT, refining the model’s conversational dynamics.
38
+
39
+ 3. **Expanded Knowledge Base:**
40
+ - The model was further trained on selected **song lyrics** and **movie scripts**, reinforcing its ability to process and generate **emotionally rich** and **contextually appropriate** responses.
41
+
42
+ 4. **NSFW Awareness (Optional):**
43
+ - While primarily designed for EQ and companionship, the model has also been **exposed to NSFW content**. If explicitly instructed, it can generate responses within that domain. However, its core training remains focused on fostering **empathetic and meaningful interactions**, rather than being a **"waifu AI."**
44
+
45
+ ## Model Limitations
46
+
47
+ - **Size Constraints:** At **2B parameters**, the model may exhibit **some limitations in Italian text generation accuracy and coherence**, particularly in highly technical or intricate linguistic structures.
48
+ - **Bias and Safety:** While trained with safety measures in place, responses may still reflect biases present in the dataset.
49
+ - **Knowledge Cutoff:** The model’s knowledge is limited to the data it was trained on and may not reflect the most current events or emerging topics.
50
+
51
+ ## Intended Use
52
+
53
+ Samanta-NewGenesis is best suited for:
54
+ - Conversational AI applications requiring **deep emotional engagement and companionship**.
55
+ - **Philosophical and psychological discussions**, where nuanced reasoning is needed.
56
+ - Assisting in creative writing, dialogue generation, and **empathetic AI interactions**.
57
+
58
+ ## Ethical Considerations
59
+
60
+ This model is **not designed for factual accuracy** or for use in **high-stakes decision-making**. Users should be mindful of its limitations and employ human oversight when utilizing responses in critical contexts.
61
+
62
+ ---
63
+
64
+ **Samanta-NewGenesis** represents a leap forward in AI-driven emotional intelligence and companionship. While not flawless, it aims to provide rich, meaningful interactions in the Italian language, blending reasoning, sentiment, and depth into its conversational style.
65
+