Kundyzka commited on
Commit
b21d8dd
·
verified ·
1 Parent(s): 8ef9d16

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +38 -12
README.md CHANGED
@@ -1,6 +1,31 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  # Description
2
 
3
- This model was developed by **Kundyz Maksutova**, PhD Candidate, as part of research on improving question-answering systems in the Kazakh language. It is a fine-tuned version of `Kyrmasch/t5-kazakh-qa` on the `Kundyzka/informatics_kaz` dataset. The model is specifically optimized for question-answering tasks in Kazakh, targeting the domain of computer science and related fields. It demonstrates reliable performance on both validation and test datasets, as evaluated by F1 and Exact Match metrics.
4
 
5
  ### Key Features:
6
  - **Developer**: Kundyz Maksutova, PhD Candidate
@@ -9,22 +34,23 @@ This model was developed by **Kundyz Maksutova**, PhD Candidate, as part of rese
9
  - **Language**: Kazakh (`kk`)
10
  - **Task**: Question Answering
11
  - **Performance**:
12
- - Validation:
13
  - F1 Score: 31.405
14
  - Exact Match: 14.675
15
- - Test:
16
  - F1 Score: 56.819
17
  - Exact Match: 35.454
18
 
 
 
 
19
  ### Intended Use:
20
- This model is designed for answering questions in the Kazakh language, with a focus on computer science topics. It is well-suited for use in educational platforms, research projects, and applications that require natural language understanding and response generation in Kazakh.
 
 
 
21
 
22
  ### Limitations and Ethical Considerations:
23
- - The model's performance may vary for domains outside of computer science.
24
- - It may reflect biases present in the dataset.
25
- - Users are encouraged to validate the model’s suitability for their specific use case before deploying it in production.
26
-
27
- ### Tags:
28
- - `computerscience`
29
-
30
- This model exemplifies the application of multilingual transformer models in low-resource languages like Kazakh. It is a contribution to advancing natural language processing capabilities for underrepresented languages. For further details, customization, or fine-tuning, refer to the model repository.
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - Kundyzka/informatics_kaz
5
+ language:
6
+ - kk
7
+ metrics:
8
+ - name: F1
9
+ type: F1 Score (Validation)
10
+ value: 31.405
11
+ - name: Exact Match (Validation)
12
+ type: Exact Match
13
+ value: 14.675
14
+ - name: F1 (Test)
15
+ type: F1 Score
16
+ value: 56.819
17
+ - name: Exact Match (Test)
18
+ type: Exact Match
19
+ value: 35.454
20
+ base_model:
21
+ - Kyrmasch/t5-kazakh-qa
22
+ new_version: Kundyzka/t5-kazakh-qa-informatics-kaz
23
+ pipeline_tag: question-answering
24
+ ---
25
+
26
  # Description
27
 
28
+ This model was developed by **Kundyz Maksutova**, PhD Candidate, as part of research on improving question-answering systems in the Kazakh language. It is a fine-tuned version of `Kyrmasch/t5-kazakh-qa` on the `Kundyzka/informatics_kaz` dataset. The model is specifically optimized for question-answering tasks in Kazakh, focusing on the domain of computer science and related fields.
29
 
30
  ### Key Features:
31
  - **Developer**: Kundyz Maksutova, PhD Candidate
 
34
  - **Language**: Kazakh (`kk`)
35
  - **Task**: Question Answering
36
  - **Performance**:
37
+ - **Validation**:
38
  - F1 Score: 31.405
39
  - Exact Match: 14.675
40
+ - **Test**:
41
  - F1 Score: 56.819
42
  - Exact Match: 35.454
43
 
44
+ ### Dataset:
45
+ The `Kundyzka/informatics_kaz` dataset is curated to provide a diverse set of questions and answers in Kazakh, primarily targeting topics in computer science. This dataset ensures the model handles domain-specific terminology effectively.
46
+
47
  ### Intended Use:
48
+ This model is designed for answering questions in the Kazakh language, with applications in:
49
+ - **Educational Platforms**: Supporting students in learning computer science.
50
+ - **Research Projects**: Facilitating studies in Kazakh natural language processing.
51
+ - **Applications**: Powering intelligent systems like chatbots or question-answering assistants.
52
 
53
  ### Limitations and Ethical Considerations:
54
+ - **Domain-Specific Bias**: Performance may drop on topics outside computer science.
55
+ - **Dataset Bias**: Potential biases from the dataset can influence model outputs.
56
+ - **Language Support**: The model is optimized for Kazakh and does not support other languages.