DebasishDhal99 commited on
Commit
5c6e992
·
1 Parent(s): d35de09

Readme edit

Browse files
Files changed (1) hide show
  1. README.md +21 -5
README.md CHANGED
@@ -9,13 +9,29 @@ app_file: app.py
9
  pinned: false
10
  short_description: Convert text/image/audio/video from src language to English
11
  ---
 
 
 
 
12
 
13
- The space consists of 3/4 parts: -
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
14
 
15
- - Text translator - Input (Text), Output (Translated text in English)
16
- - Image translator - Input (Image with any text), Output (English Translated text version of the text in the image)
17
- - Audio translator - Input (Audio in any language), Output (English Translated text version of the audio)
18
- - Video translator - Input (Video), Output (English Translated text version of the audio) [Not yet implemented]
19
  ********************************************************
20
 
21
  Demo
 
9
  pinned: false
10
  short_description: Convert text/image/audio/video from src language to English
11
  ---
12
+ ****************************
13
+ <p align="center">
14
+ Liked the setup? Put a like on top left, it takes only 2 seconds.
15
+ </p>
16
 
17
+ ****************************
18
+ Replication
19
+ - Requirements
20
+ - Free API Key from https://detectlanguage.com/ for automatic language detection from text.
21
+ - GPU for `Whisper` model inference. It's slower in CPU.
22
+ - Notes
23
+ - `pytesseract` library (For image-to-text) is easier to install in linux machines.
24
+ - If you have GPU, you can go for more sophisticated image-to-text models.
25
+ - The image-to-text setup works best for non-decorative and normal sized fonts.
26
+ *******
27
+
28
+ The space consists of 3-4 parts: -
29
+
30
+ - Text translator - Input (Input Text, Target language), Output (Translated text in target language, Source language name)
31
+ - Image translator - Input (Image with any text, Source language, Target language), Output (Image text in source language, Image text translated to target language)
32
+ - Audio translator - Input (Audio in any language, Model size, Target language), Output (Transcribed original text, Transcribed text translated to target language, Original language name)
33
+ - Video translator - Input (Video, Model size, Target language), Output (Translated text version of the audio) [Not yet implemented]
34
 
 
 
 
 
35
  ********************************************************
36
 
37
  Demo