openthaigpt
/

thai-trocr

vision-encoder-decoder

image-text-to-text

Inference Endpoints

Model card Files Files and versions Community

kobkrit commited on Oct 18, 2024

Commit

1c1be54

·

verified ·

1 Parent(s): ca95332

Update README.md

Files changed (1) hide show

README.md +7 -6

README.md CHANGED Viewed

@@ -52,7 +52,8 @@ print(generated_text)
 ## Model Performance Comparison
-The table below summarizes the performance metrics of various models across different document types, based on the adjusted mean score:
 | Document Type         | ThaiTrOCR | EasyOCR | Tesseract |
 |:----------------------|---------:|--------:|---------:|
@@ -63,12 +64,12 @@ The table below summarizes the performance metrics of various models across diff
 | Scene Text            | **0.134182** | 0.390583 | 2.408704 |
 | **Adjusted Mean**     | **0.123600** | 0.298474 | 1.269101 |
-**Notes**
-- The CER metric indicates that lower scores reflect better performance.
-- Tesseract supports only one language at a time; this benchmark uses only Thai.
-- Benchmarking was performed on a Google Colab CPU task.
-- The evaluation dataset is sourced from the openthaigpt/thai-ocr-evaluation.
 ## Sponsors

 ## Model Performance Comparison
+This section details the performance comparison between the open-source ThaiTrOCR model and other widely-used OCR systems, namely EasyOCR and Tesseract. The table below highlights their respective performance across various document types based on the average Character Error Rate (CER).
 | Document Type         | ThaiTrOCR | EasyOCR | Tesseract |
 |:----------------------|---------:|--------:|---------:|
 | Scene Text            | **0.134182** | 0.390583 | 2.408704 |
 | **Adjusted Mean**     | **0.123600** | 0.298474 | 1.269101 |
+# Key Insights
+* Character Error Rate (CER): This metric evaluates the percentage of characters that were incorrectly predicted by the model. A lower CER indicates better performance. As shown in the table, ThaiTrOCR consistently outperforms EasyOCR and Tesseract across all document types, with a significantly lower average CER, making it the most accurate model in the comparison.
+* Model Performance: The ThaiTrOCR model is particularly effective with PDF documents (both Thai-only and bilingual English-Thai texts), and shows substantial improvement over competing models in reading scene text and handwritten content.
+* Tesseract Limitation: It’s important to note that Tesseract only supports single-language input at a time in this comparison. For the purposes of this benchmark, it was tested using only the Thai language setting, which might have contributed to its higher CER values.
+* The evaluation dataset is sourced from the [openthaigpt/thai-ocr-evaluation](https://huggingface.co/datasets/openthaigpt/thai-ocr-evaluation).
 ## Sponsors