Question Answering
Transformers
PyTorch
Arabic
Inference Endpoints
abdoelsayed commited on
Commit
7ca60a7
·
verified ·
1 Parent(s): a807e66

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +54 -1
README.md CHANGED
@@ -8,4 +8,57 @@ language:
8
  metrics:
9
  - accuracy
10
  library_name: transformers
11
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
  metrics:
9
  - accuracy
10
  library_name: transformers
11
+ ---
12
+
13
+ # AraDPR: Arabic Dense Passage Retrieval Model
14
+
15
+ AraDPR is a state-of-the-art dense passage retrieval model specifically designed for the Arabic language. It leverages deep learning techniques to encode passages and questions into dense vectors, facilitating efficient and accurate retrieval for question-answering systems.
16
+
17
+ ## Model Details
18
+
19
+ ### Model Description
20
+
21
+ - **Developed by:**
22
+ - **Model type:** Dense Passage Retrieval (DPR)
23
+ - **Language(s) (NLP):** Arabic
24
+ - **License:** MIT
25
+ - **Finetuned from:** AraBERT
26
+
27
+ ### Model Sources
28
+
29
+ - **Repository:** https://github.com/DataScienceUIBK/ArabicaQA
30
+ - **Paper:** will be available soon
31
+ - **Demo:** will be available soon
32
+
33
+ ## Uses
34
+
35
+ ### Direct Use
36
+
37
+ AraDPR is designed for use in Arabic question-answering systems, enabling these systems to retrieve the most relevant passages from a large corpus efficiently.
38
+
39
+ ### Downstream Use
40
+
41
+ Beyond question answering, AraDPR can be integrated into various NLP applications requiring passage retrieval, such as document summarization, information extraction, and more.
42
+
43
+ ### Out-of-Scope Use
44
+
45
+ AraDPR is not intended for languages other than Arabic or for tasks that do not involve passage retrieval.
46
+
47
+ ## Bias, Risks, and Limitations
48
+
49
+ While AraDPR represents a significant advancement in Arabic NLP, users should be aware of the model's limitations, particularly in handling dialects or very domain-specific texts. Further research and development are encouraged to address these challenges.
50
+
51
+ ## How to Get Started with the Model
52
+
53
+ To get started with AraDPR, you can use the following code snippet:
54
+
55
+ Please check out our github page: https://github.com/DataScienceUIBK/ArabicaQA
56
+ ## Training Details
57
+ AraDPR was trained on a diverse corpus from Arabic Wikipedia, covering a wide range of topics to ensure comprehensive language representation.
58
+
59
+ ## Results
60
+ AraDPR demonstrates superior performance over traditional retrieval methods, significantly improving the efficiency and accuracy of question answering in Arabic.
61
+
62
+ ## Technical Specifications
63
+ Model Architecture and Objective
64
+ AraDPR utilizes a dual-encoder architecture, with separate encoders for questions and passages. The model is optimized to project semantically related questions and passages closer in the vector space.