File size: 1,435 Bytes
112bc7c
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
---
license: apache-2.0
tags:
- inbora studio
- drchamyoung
- Neural Network
- DLL
- Deep ONNX
- Behaviour Agents
---
# Xieral Code Gen 3B

Xieral Code Gen 3B is a decoder-only language model with 2.7 billion parameters. Developed from the `Xieral-Code-Gen-3b`, this model is designed specifically for code generation and software engineering tasks.

## Model Overview

- **Architecture**: Decoder-only language model
- **Parameters**: 2.7 billion
- **Training Data**: Combination of publicly available and synthetic datasets
- **Optimization**: Direct Preference Optimization (DPO)
- **Fine-tuning**: General code/software engineering conversations, SQL query generation, and discussion

## Performance

Xieral Code Gen 3B has demonstrated competitive performance compared to other models of similar size:

- **MultiPL-E Metrics**: Evaluated across various programming languages using the BigCode Evaluation Harness.
- **MT Bench**: Shows strong results on code-related tasks.

## Usage

This model is well-suited for:

- General code/software engineering conversations
- SQL query generation and discussion

### Requirements

To run Xieral Code Gen 3B locally, you will need:

- **VRAM**: 8GB+ (Graphics card with sufficient VRAM)
- **Dependencies**: Ensure you have the necessary libraries and environment set up to run the model.

## Installation

To install the required dependencies, use:

```bash
pip install -r requirements.txt