mesut commited on
Commit
f4137a5
·
verified ·
1 Parent(s): e6ffb3f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -0
README.md CHANGED
@@ -24,6 +24,13 @@ Caption the speech: {}
24
 
25
  ### Response:
26
  Caption of the speech: {}"""
 
 
 
 
 
 
 
27
  - **Developed by:** mesut
28
  - **License:** apache-2.0
29
  - **Finetuned from model :** unsloth/meta-llama-3.1-8b-bnb-4bit
 
24
 
25
  ### Response:
26
  Caption of the speech: {}"""
27
+
28
+
29
+ The data set is curated using
30
+
31
+ Judd, Nicholas, Dan Drinkard, Jeremy Carbaugh, and Lindsay Young. congressional-record: A parser for the Congressional Record. Chicago, IL: 2017. https://github.com/unitedstates/congressional-record
32
+
33
+ Text is preprocessed by removing President names, Vice President names, party names, and some cliche phrases such as "I reserve the balance of my time","I yield the floor" etc. The data set is balanced based on parties as well.
34
  - **Developed by:** mesut
35
  - **License:** apache-2.0
36
  - **Finetuned from model :** unsloth/meta-llama-3.1-8b-bnb-4bit