Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -9,15 +9,15 @@ pinned: false
|
|
9 |
|
10 |
<div style="border: 2px solid #cce7ff; background-color: #d6ecff; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
|
11 |
|
12 |
-
# π MarineLives: Unlocking Early Modern History
|
13 |
|
14 |
-
**MarineLives** is a volunteer-led initiative for transcribing and enriching English High Court of Admiralty records from the 16th and 17th centuries. These records serve as a rich source for exploring social, material, and economic history.
|
15 |
|
16 |
</div>
|
17 |
|
18 |
<div style="border: 2px solid #cce7ff; background-color: #f0f8ff; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
|
19 |
|
20 |
-
# π¬ 1.0 Research Focus
|
21 |
|
22 |
## **1.1 Fine-tuning Small LLMs**
|
23 |
|
@@ -44,7 +44,7 @@ Exploring the potential of small LLMs for cleaning Raw HTR outputs from machine-
|
|
44 |
|
45 |
<div style="border: 2px solid #ffc299; background-color: #fff4e5; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
|
46 |
|
47 |
-
|
48 |
|
49 |
### Components:
|
50 |
- **Retriever**: BM25 or Sentence-BERT
|
@@ -59,7 +59,7 @@ Exploring the potential of small LLMs for cleaning Raw HTR outputs from machine-
|
|
59 |
|
60 |
<div style="border: 2px solid #b3e6b3; background-color: #e5ffe5; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
|
61 |
|
62 |
-
# π 2.0 Datasets
|
63 |
|
64 |
## **2.1 Published Datasets**
|
65 |
1. [MarineLives/English-Expansions](https://huggingface.co/datasets/MarineLives/English-Expansions)
|
@@ -77,7 +77,9 @@ Exploring the potential of small LLMs for cleaning Raw HTR outputs from machine-
|
|
77 |
|
78 |
<div style="border: 2px solid #cce7ff; background-color: #d6ecff; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
|
79 |
|
80 |
-
# π Explore MarineLives
|
81 |
Join us in unlocking Early Modern history by exploring our [Hugging Face organization](https://huggingface.co/MarineLives) and datasets!
|
|
|
|
|
82 |
|
83 |
</div>
|
|
|
9 |
|
10 |
<div style="border: 2px solid #cce7ff; background-color: #d6ecff; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
|
11 |
|
12 |
+
# π **MarineLives: Unlocking Early Modern History**
|
13 |
|
14 |
+
## **MarineLives** is a volunteer-led initiative for transcribing and enriching English High Court of Admiralty records from the 16th and 17th centuries. These records serve as a rich source for exploring social, material, and economic history.
|
15 |
|
16 |
</div>
|
17 |
|
18 |
<div style="border: 2px solid #cce7ff; background-color: #f0f8ff; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
|
19 |
|
20 |
+
# π¬ **1.0 Research Focus**
|
21 |
|
22 |
## **1.1 Fine-tuning Small LLMs**
|
23 |
|
|
|
44 |
|
45 |
<div style="border: 2px solid #ffc299; background-color: #fff4e5; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
|
46 |
|
47 |
+
# **1.2 Integration with RAG Pipeline**
|
48 |
|
49 |
### Components:
|
50 |
- **Retriever**: BM25 or Sentence-BERT
|
|
|
59 |
|
60 |
<div style="border: 2px solid #b3e6b3; background-color: #e5ffe5; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
|
61 |
|
62 |
+
# π **2.0 Datasets**
|
63 |
|
64 |
## **2.1 Published Datasets**
|
65 |
1. [MarineLives/English-Expansions](https://huggingface.co/datasets/MarineLives/English-Expansions)
|
|
|
77 |
|
78 |
<div style="border: 2px solid #cce7ff; background-color: #d6ecff; padding: 20px; border-radius: 10px; margin-bottom: 20px;">
|
79 |
|
80 |
+
# π **Explore MarineLives**
|
81 |
Join us in unlocking Early Modern history by exploring our [Hugging Face organization](https://huggingface.co/MarineLives) and datasets!
|
82 |
+
You can follow us on BlueSky at [@marinelives.bsky.social](https://bsky.app/profile/marinelives.bsky.social)
|
83 |
+
You can explore our content on our [MarineLives wiki](http://www.marinelives.org/wiki/MarineLives) and on our [MarineLives Transkribus site]
|
84 |
|
85 |
</div>
|