Update README.md
Browse files
README.md
CHANGED
@@ -3,6 +3,7 @@ license: gemma
|
|
3 |
datasets:
|
4 |
- WasamiKirua/gemma-samantha-neon
|
5 |
- WasamiKirua/gemma-samantha-deepseek
|
|
|
6 |
language:
|
7 |
- it
|
8 |
- en
|
@@ -15,4 +16,50 @@ tags:
|
|
15 |
- eq
|
16 |
- philosophy
|
17 |
- dpo
|
18 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
3 |
datasets:
|
4 |
- WasamiKirua/gemma-samantha-neon
|
5 |
- WasamiKirua/gemma-samantha-deepseek
|
6 |
+
- WasamiKirua/Human-Like-DPO-ita
|
7 |
language:
|
8 |
- it
|
9 |
- en
|
|
|
16 |
- eq
|
17 |
- philosophy
|
18 |
- dpo
|
19 |
+
---
|
20 |
+
|
21 |
+
<img src="https://i.postimg.cc/T2t1v042/temp-Image1-Mnnn-S.avif" alt="cover" border="0" width="862px">
|
22 |
+
|
23 |
+
## Model Overview
|
24 |
+
|
25 |
+
**Samanta-NewGenesis** is an advanced conversational AI model fine-tuned specifically for the Italian language. Built on a **2B parameter** foundation, this model is designed to engage in multi-turn dialogues with a strong emphasis on **emotional intelligence (EQ), philosophical reasoning, and psychological depth**.
|
26 |
+
|
27 |
+
This iteration improves upon its predecessor, **Samanta**, by integrating carefully curated datasets from song lyrics, movie scripts, and philosophical discussions. The goal is to **enhance the model’s ability to understand and respond with nuanced emotional and sentimental depth**.
|
28 |
+
|
29 |
+
## Training Process
|
30 |
+
|
31 |
+
Samanta-NewGenesis has undergone a multi-stage training process to refine its reasoning and interaction capabilities:
|
32 |
+
|
33 |
+
1. **Supervised Fine-Tuning (SFT):**
|
34 |
+
- The base model was fine-tuned using a crafted **Italian multi-turn dataset** focused on **philosophical, psychological, and emotional intelligence topics**.
|
35 |
+
|
36 |
+
2. **Direct Preference Optimization (DPO):**
|
37 |
+
- To enhance **human-like response generation**, **reduce refusals**, and improve alignment, DPO was applied post-SFT, refining the model’s conversational dynamics.
|
38 |
+
|
39 |
+
3. **Expanded Knowledge Base:**
|
40 |
+
- The model was further trained on selected **song lyrics** and **movie scripts**, reinforcing its ability to process and generate **emotionally rich** and **contextually appropriate** responses.
|
41 |
+
|
42 |
+
4. **NSFW Awareness (Optional):**
|
43 |
+
- While primarily designed for EQ and companionship, the model has also been **exposed to NSFW content**. If explicitly instructed, it can generate responses within that domain. However, its core training remains focused on fostering **empathetic and meaningful interactions**, rather than being a **"waifu AI."**
|
44 |
+
|
45 |
+
## Model Limitations
|
46 |
+
|
47 |
+
- **Size Constraints:** At **2B parameters**, the model may exhibit **some limitations in Italian text generation accuracy and coherence**, particularly in highly technical or intricate linguistic structures.
|
48 |
+
- **Bias and Safety:** While trained with safety measures in place, responses may still reflect biases present in the dataset.
|
49 |
+
- **Knowledge Cutoff:** The model’s knowledge is limited to the data it was trained on and may not reflect the most current events or emerging topics.
|
50 |
+
|
51 |
+
## Intended Use
|
52 |
+
|
53 |
+
Samanta-NewGenesis is best suited for:
|
54 |
+
- Conversational AI applications requiring **deep emotional engagement and companionship**.
|
55 |
+
- **Philosophical and psychological discussions**, where nuanced reasoning is needed.
|
56 |
+
- Assisting in creative writing, dialogue generation, and **empathetic AI interactions**.
|
57 |
+
|
58 |
+
## Ethical Considerations
|
59 |
+
|
60 |
+
This model is **not designed for factual accuracy** or for use in **high-stakes decision-making**. Users should be mindful of its limitations and employ human oversight when utilizing responses in critical contexts.
|
61 |
+
|
62 |
+
---
|
63 |
+
|
64 |
+
**Samanta-NewGenesis** represents a leap forward in AI-driven emotional intelligence and companionship. While not flawless, it aims to provide rich, meaningful interactions in the Italian language, blending reasoning, sentiment, and depth into its conversational style.
|
65 |
+
|