BAAI
/

Text Generation
Transformers
Safetensors
aquila3
conversational
custom_code
Inference Endpoints
MonteXiaofeng commited on
Commit
43e4cc8
·
verified ·
1 Parent(s): ace5095

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -3
README.md CHANGED
@@ -10,13 +10,17 @@ Aquila is a large language model independently developed by BAAI. Building upon
10
 
11
  The training process of the model is described as follows. For more information, please refer to our technical report. https://github.com/FlagAI-Open/industry-application/blob/main/Aquila_med_tech-report.pdf
12
 
13
- ![pipeline](http://flagchat.ks3-cn-beijing.ksyuncs.com/shixiaofeng/modelhub/AquilaMed-RL/img/pipeline.png?KSSAccessKeyId=AKLTkqVnZwpfTBiiu7O6iQHnA&Expires=2318244577&Signature=Obq391bkIAioVpi6rnely%2BSjkyE%3D)
14
 
15
  ## Evaluation
16
 
17
- Using GPT-4 for evaluation, the win rates of our model compared to the reference answers in the annotated validation dataset are as follows.
18
 
19
- ![pipeline](http://flagchat.ks3-cn-beijing.ksyuncs.com/shixiaofeng/modelhub/AquilaMed-RL/img/eval-result.jpeg?KSSAccessKeyId=AKLTkqVnZwpfTBiiu7O6iQHnA&Expires=2318244691&Signature=dGJgHapUpmYA4NxKwernLBBFrCU%3D)
 
 
 
 
20
 
21
  ## usage
22
 
 
10
 
11
  The training process of the model is described as follows. For more information, please refer to our technical report. https://github.com/FlagAI-Open/industry-application/blob/main/Aquila_med_tech-report.pdf
12
 
13
+ ![pipeline](./img/pipline_2.jpg)
14
 
15
  ## Evaluation
16
 
17
+ The subjective and objective scores are as follows
18
 
19
+ subjective: Using GPT-4 for evaluation, the win rates of our model compared to the reference answers in the annotated validation dataset are as follows.
20
+
21
+ Objective:use MMLU / C-EVAL / CMB-exam to evaluate the model
22
+
23
+ ![pipeline](./img/eval-result-med.png)
24
 
25
  ## usage
26