CHZY-1 commited on
Commit
5a184fc
1 Parent(s): f204fbe

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -11
README.md CHANGED
@@ -6,31 +6,39 @@ tags:
6
  - trl
7
  - sft
8
  - generated_from_trainer
 
 
 
 
9
  model-index:
10
  - name: sqlcoder-7b-2_FineTuned_PEFT_QLORA_adapter_alpha_r_32
11
  results: []
 
 
12
  ---
13
 
14
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
15
- should probably proofread and complete it, then remove this comment. -->
16
 
17
  # sqlcoder-7b-2_FineTuned_PEFT_QLORA_adapter_alpha_r_32
18
 
19
- This model is a fine-tuned version of [defog/sqlcoder-7b-2](https://huggingface.co/defog/sqlcoder-7b-2) on an unknown dataset.
20
-
21
- ## Model description
22
-
23
- More information needed
24
 
25
  ## Intended uses & limitations
26
 
27
- More information needed
28
 
29
- ## Training and evaluation data
30
 
31
- More information needed
 
 
 
 
 
32
 
33
- ## Training procedure
 
 
 
34
 
35
  ### Training hyperparameters
36
 
 
6
  - trl
7
  - sft
8
  - generated_from_trainer
9
+ - QLora
10
+ - peft
11
+ - SQL
12
+ - causal-lm
13
  model-index:
14
  - name: sqlcoder-7b-2_FineTuned_PEFT_QLORA_adapter_alpha_r_32
15
  results: []
16
+ language:
17
+ - en
18
  ---
19
 
 
 
20
 
21
  # sqlcoder-7b-2_FineTuned_PEFT_QLORA_adapter_alpha_r_32
22
 
23
+ This model is a fine-tuned version of [defog/sqlcoder-7b-2](https://huggingface.co/defog/sqlcoder-7b-2) on 260 MS SQL examples (Task, Schema and Answer triplets) related to financial/banking domain.
 
 
 
 
24
 
25
  ## Intended uses & limitations
26
 
27
+ MS SQL Server - SQL Query Generation
28
 
29
+ ## Training
30
 
31
+ This model was trained using the QLoRA method with the following configurations:
32
+ - r = 64,
33
+ - lora_alpha = 32
34
+ - lora_dropout = 0.05
35
+ - bias='none'
36
+ - task_type='CAUSAL_LM'
37
 
38
+ Quantization parameters:
39
+ - load_in_4bit=True
40
+ - bnb_4bit_quant_type="nf4"
41
+ - bnb_4bit_compute_dtype=torch.bfloat16
42
 
43
  ### Training hyperparameters
44