Update README.md
Browse files
README.md
CHANGED
@@ -24,6 +24,13 @@ Caption the speech: {}
|
|
24 |
|
25 |
### Response:
|
26 |
Caption of the speech: {}"""
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
27 |
- **Developed by:** mesut
|
28 |
- **License:** apache-2.0
|
29 |
- **Finetuned from model :** unsloth/meta-llama-3.1-8b-bnb-4bit
|
|
|
24 |
|
25 |
### Response:
|
26 |
Caption of the speech: {}"""
|
27 |
+
|
28 |
+
|
29 |
+
The data set is curated using
|
30 |
+
|
31 |
+
Judd, Nicholas, Dan Drinkard, Jeremy Carbaugh, and Lindsay Young. congressional-record: A parser for the Congressional Record. Chicago, IL: 2017. https://github.com/unitedstates/congressional-record
|
32 |
+
|
33 |
+
Text is preprocessed by removing President names, Vice President names, party names, and some cliche phrases such as "I reserve the balance of my time","I yield the floor" etc. The data set is balanced based on parties as well.
|
34 |
- **Developed by:** mesut
|
35 |
- **License:** apache-2.0
|
36 |
- **Finetuned from model :** unsloth/meta-llama-3.1-8b-bnb-4bit
|