Spaces:
Runtime error
Runtime error
File size: 1,125 Bytes
0cc1374 25d6d4e 0cc1374 c73baf9 0cc1374 d6367a6 6ccf728 b5d228a 25d6d4e 0cc1374 73fbbec 0cc1374 e981580 0d92853 f77e00f 07b53f0 013f4fa 5363551 0d92853 b9b23be 295a905 ed98a83 3bd5c6e d6367a6 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 |
boto3>=1.28.43 Brotli>=1.1.0 click>=8.1.7 PyMuPDF>=1.24.9,<1.24.14 loguru>=0.6.0 numpy>=1.21.6,<2.0.0 fast-langdetect>=0.2.3 scikit-learn>=1.0.2 transformers>=4.37.2 # Updated for LayoutLMv3 pdfminer.six==20231228 unimernet==0.2.3 doclayout_yolo==0.0.2b1 matplotlib ultralytics>=8.3.48 paddleocr==2.7.3 paddlepaddle-gpu @ https://paddle-whl.bj.bcebos.com/stable/cu118/paddlepaddle-gpu/paddlepaddle_gpu-3.0.0b1-cp310-cp310-linux_x86_64.whl struct-eqtable==0.3.2 detectron2 @ https://wheels-1251341229.cos.ap-shanghai.myqcloud.com/assets/whl/detectron2/detectron2-0.6-cp310-cp310-linux_x86_64.whl magic-pdf>=1.0.1 torch>=2.2.2,<=2.3.1 torchvision>=0.17.2,<=0.18.1 rapid-table>=1.0.3,<2.0.0 rapidocr-paddle rapidocr-onnxruntime gradio-pdf>=0.0.21 openai telebot requests PyPDF2>=3.0.0 # Updated for better PDF parsing Pillow>=10.0.0 # Required for image processing pytesseract>=0.3.10 # Optional for OCR capabilities python-Levenshtein>=0.21.1 # For text similarity comparison pdf2image>=1.16.3 # For PDF to image conversion layoutlmv3 @ git+https://github.com/microsoft/unilm.git#subdirectory=layoutlmv3 # For LayoutLMv3 |