cyberagent
/

type-r

@@ -45,20 +45,19 @@ resources/
 </pre>
-## 📘 ⚠️Manual download required⚠️
 - `resources/weight/ocr/solo.pth`
-  - Please download this weight from the official [Deeosolo](https://github.com/ViTAE-Transformer/DeepSolo) implementation.
-[[link](https://onedrive.live.com/?redeem=aHR0cHM6Ly8xZHJ2Lm1zL3UvcyFBaW1CZ1lWN0pqVGxnY2Q5d2k0MzJ1aXRNZ1RNLXc%5FZT1manVKYm0&cid=E534267B85818129&id=E534267B85818129%2125597&parId=E534267B85818129%2125575&o=OneUp)] uses ViTAEv2-S as its backbone and is trained on `Synth150K+Total-Text+MLT17+IC13+IC15+TextOCR`.
 - `resources/weight/ocr/masktextspotterv3.pth`
-  - Please download this weight from the official [MaskTextSpotterV3](https://github.com/MhLiao/MaskTextSpotterV3) implementation.
-[[link](https://drive.google.com/file/d/1XQsikiNY7ILgZvmvOeUf9oPDG4fTp0zs/view)]
 - `resources/data/arial_unicode_ms.ttf`
   - Since the Arial font cannot be redistributed, please obtain it through your operating system or another legal source. As an alternative, you may use an open font such as Liberation Sans (resources/data/LiberationSans-Regular.ttf). However, please note that we have observed a drop of 1–2 points in OCR accuracy on the Mario-Eval benchmark when using AnyText with Liberation Sans under our best configuration.
 ## 📘 Dataset details
 - `weight/`
-  - Pretrained weights used for various modules in the Type-R pipeline:
   - **ocr/**: Models for OCR detection/recognition.
   - **text_eraser/**: Inpainting or erasure modules for removing text.
   - **text_editor/**: Models for rendering text into images.
@@ -67,7 +66,7 @@ resources/
 - `data/marioevalbench/`
-  - The dataset containing prompts and reference images for evaluating Type-R:
   - **hfds/**: includes prompts, augmented prompts, and images of the Mario-Eval Benchmark

 </pre>
+## 📘 ⚠️Manual download required data⚠️
 - `resources/weight/ocr/solo.pth`
+  - Please download this weight from the official [Deeosolo](https://github.com/ViTAE-Transformer/DeepSolo) implementation.[[link](https://onedrive.live.com/?redeem=aHR0cHM6Ly8xZHJ2Lm1zL3UvcyFBaW1CZ1lWN0pqVGxnY2Q5d2k0MzJ1aXRNZ1RNLXc%5FZT1manVKYm0&cid=E534267B85818129&id=E534267B85818129%2125597&parId=E534267B85818129%2125575&o=OneUp)]
+  - This weight has ViTAEv2-S as its backbone and is trained on `Synth150K+Total-Text+MLT17+IC13+IC15+TextOCR`.
 - `resources/weight/ocr/masktextspotterv3.pth`
+  - Please download this weight from the official [MaskTextSpotterV3](https://github.com/MhLiao/MaskTextSpotterV3) implementation. [[link](https://drive.google.com/file/d/1XQsikiNY7ILgZvmvOeUf9oPDG4fTp0zs/view)]
 - `resources/data/arial_unicode_ms.ttf`
   - Since the Arial font cannot be redistributed, please obtain it through your operating system or another legal source. As an alternative, you may use an open font such as Liberation Sans (resources/data/LiberationSans-Regular.ttf). However, please note that we have observed a drop of 1–2 points in OCR accuracy on the Mario-Eval benchmark when using AnyText with Liberation Sans under our best configuration.
 ## 📘 Dataset details
 - `weight/`
+  - This dicrectory contains pretrained weights used for various modules in the Type-R pipeline
   - **ocr/**: Models for OCR detection/recognition.
   - **text_eraser/**: Inpainting or erasure modules for removing text.
   - **text_editor/**: Models for rendering text into images.
 - `data/marioevalbench/`
+  - The dataset containing prompts and reference images for evaluating Type-R
   - **hfds/**: includes prompts, augmented prompts, and images of the Mario-Eval Benchmark