NaverHustQA
/

LawVinaLlama

Text Generation

text-generation-inference

retrieval-augmented-generation

Inference Endpoints

Model card Files Files and versions Community

haisonle001 commited on 25 days ago

Commit

efd9ca5

·

verified ·

1 Parent(s): 69ad59f

Update README.md

Files changed (1) hide show

README.md +19 -18

README.md CHANGED Viewed

@@ -16,38 +16,39 @@ tags:
 ## Model Card: LawVinaLlama
-**Mô tả mô hình:**
-LawVinaLlama là à một mô hình ngôn ngữ lớn (LLM) chuyên về pháp luật Việt Nam, được tinh chỉnh từ kiến trúc Llama. Mô hình được đào tạo trên các tài liệu pháp lý thực tế nhằm nâng cao khả năng suy luận, truy xuất thông tin và tóm tắt nội dung pháp luật.
-**Các nguồn dữ liệu chính:**
-* 150.000 QPA được crawled và xử lý từ Thư Viện Pháp Luật
-* 40.000 QA dịch và tóm tắt từ luật quốc tế
-* 10.000 QA dịch và tóm tắt từ luật quốc tế
-* 50.000 Reasoning QA được generated từ GPT-4.0/ Gemini
-**Mục đích sử dụng:**
-Mô hình LawVinaLlama phù hợp cho các tác vụ sau:
-* Trả lời câu hỏi/ Trả lời câu hỏi dựa trên ngữ cảnh cho trước.
-* Tóm tắt
-**Giới hạn:**
-LawVinaLlama vẫn có thể gặp phải một số hạn chế:
-* Có thể tạo ra thông tin sai lệch hoặc không chính xác.
-* Hiệu suất có thể bị ảnh hưởng bởi chất lượng của đầu vào.
-**Cách sử dụng:**
-Load Model
 ```python
 from unsloth import FastLanguageModel
@@ -100,4 +101,4 @@ generated_ids = model.generate(
 a = tokenizer.batch_decode(generated_ids)[0]
 #  print(a.split('### Trả lời:')[1])
 print(a)
-```

 ## Model Card: LawVinaLlama
+**Model Description:**
+LawVinaLlama is a large language model (LLM) specialized in **Vietnamese law**, fine-tuned from the Llama architecture. The model has been trained on real legal documents to improve its ability to **reason, retrieve legal information, and summarize legal content**.
+**Main Data Sources:**
+- **150,000 Q&A** crawled and processed from *Thư Viện Pháp Luật* (Vietnamese Legal Library)
+- **40,000 Q&A** translated and summarized from international law
+- **10,000 Q&A** translated and summarized from international law (duplicate, possibly an error)
+- **50,000 Reasoning Q&A** generated by GPT-4.0/Gemini
+**Intended Use Cases:**
+LawVinaLlama is suitable for the following tasks:
+- **Answering legal questions** / **Providing legal answers based on a given context**
+- **Summarizing legal content**
+**Limitations:**
+LawVinaLlama may still encounter some limitations:
+- It may generate **misleading or inaccurate** information.
+- Its **performance depends on the quality of the input data**.
+**How to Use:**
+Load model
 ```python
 from unsloth import FastLanguageModel
 a = tokenizer.batch_decode(generated_ids)[0]
 #  print(a.split('### Trả lời:')[1])
 print(a)
+```