limecoding
/

gemma2-2b-it-finetuned-patent

@@ -21,18 +21,17 @@ This gemma2 model was trained 2x faster with [Unsloth](https://github.com/unslot
 [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
-## 모델 소개
-해당 모델은 대략적인 발명품의 설명을 입력으로 받아 특허 명세서 작성을 도와주는 파인튜닝된 모델입니다.
-베이스 모델은 unsloth/gemma-2-2b-it이며, unsloth를 이용해 파인튜닝된 adapter를 병합했습니다.
-## 데이터셋
-데이터셋은 AI-Hub에 있는 논문자료 요약 데이터 셋과 키프리스에서 직접 청구항을 가져와 조합한 데이터셋을 이용했습니다.
-## 모델 학습
-모델 학습은 loRA를 이용하여 진행하였으며, 학습에 사용된 코드는 다음과 같습니다.
 ```
 model = FastLanguageModel.get_peft_model(
     model,
@@ -82,9 +81,9 @@ trainer = SFTTrainer(
 ```
-## 모델 사용법
-1. unsloth를 설치합니다
 ```
 %%capture
 !pip install unsloth
@@ -97,7 +96,7 @@ if torch.cuda.get_device_capability()[0] >= 8:
     !pip install --no-deps packaging ninja einops "flash-attn>=2.6.3"
 ```
-2. 모델을 불러옵니다.
 ```
 from unsloth import FastLanguageModel
 import torch
@@ -114,7 +113,7 @@ model, tokenizer = FastLanguageModel.from_pretrained(
     token = token
 )
 ```
-3. 프롬프트를 작성하여 텍스트 생성합니다.
 ```
 input = """
 상술한 과제를 해결하기 위하여, 본 고안은 내부에 보관할 물건을 넣을 수 있는 기본 내장 공간과 이를 둘러싼
@@ -163,9 +162,10 @@ _ = model.generate(**inputs, streamer = text_streamer, max_new_tokens = 1000)
 ```
-## 모델 결과
-해당 모델로 실제 특허 명세서의 과제 해결 수단 항목을 가지고 테스트했으며 실제 문서와 비교했을 때
-비교적 유사한 내용을 생성했다.
 ```
 [발명의 명칭]
 가방

 [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
+## Model Overview
+This model is fine-tuned to assist with drafting patent specifications based on a general description of an invention.
+The base model is unsloth/gemma-2-2b-it, and I used unsloth to merge the fine-tuned adapter.
+## Dataset
+The dataset used for fine-tuning includes a combination of research paper
+summary datasets from AI-Hub and patent claims data directly retrieved from KIPRIS
+(Korea Intellectual Property Rights Information Service).
+Model Training
+The model was trained using LoRA (Low-Rank Adaptation). The following code was used for training:
 ```
 model = FastLanguageModel.get_peft_model(
     model,
 ```
+## How to Use the Model
+1. Install unsloth:
 ```
 %%capture
 !pip install unsloth
     !pip install --no-deps packaging ninja einops "flash-attn>=2.6.3"
 ```
+2. Load the fine-tuned model and use it for inference:
 ```
 from unsloth import FastLanguageModel
 import torch
     token = token
 )
 ```
+3. Write a prompt and generate text:
 ```
 input = """
 상술한 과제를 해결하기 위하여, 본 고안은 내부에 보관할 물건을 넣을 수 있는 기본 내장 공간과 이를 둘러싼
 ```
+## Model Results
+The model was tested using the "Means to Solve the Problem" section from actual patent specifications.
+When compared with real patent documents, the model generated content that was relatively similar in
+structure and meaning.
 ```
 [발명의 명칭]
 가방