shreyasmeher
/

ConflLlama

Text Classification

Inference Endpoints

Model card Files Files and versions Community

shreyasmeher commited on Nov 11, 2024

Commit

1ebc03e

·

verified ·

1 Parent(s): 8453c2f

Update README.md

Files changed (1) hide show

README.md +14 -0

README.md CHANGED Viewed

@@ -85,6 +85,20 @@ inference:
   - 4-bit Quantization: Enabled
   - Max Sequence Length: 1024
 ### Memory Optimizations
 - Used 4-bit quantization
 - Gradient accumulation steps: 8

   - 4-bit Quantization: Enabled
   - Max Sequence Length: 1024
+## Model Architecture
+The model uses a combination of efficient fine-tuning techniques and optimizations for handling conflict event classification:
+<p align="center">
+  <img src=".github/images/model-arch.png" alt="Model Training Architecture" width="800"/>
+</p>
+### Data Processing Pipeline
+The preprocessing pipeline transforms raw GTD data into a format suitable for fine-tuning:
+<p align="center">
+  <img src=".github/images/preprocessing.png" alt="Data Preprocessing Pipeline" width="800"/>
+</p>
 ### Memory Optimizations
 - Used 4-bit quantization
 - Gradient accumulation steps: 8