File size: 1,787 Bytes
f836e5c
 
 
 
 
 
 
 
 
 
 
d2a5e02
f836e5c
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
d2a5e02
f836e5c
 
 
 
 
 
d2a5e02
f836e5c
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
### Install Git LFS
Before you begin, make sure Git Large File Storage (Git LFS) is installed on your system. Install it using the following command:

```bash
git lfs install
```

### Download the Model from Hugging Face
To download the `PDF-Extract-Kit` model from Hugging Face, use the following command:

```bash
git lfs clone https://huggingface.co/opendatalab/PDF-Extract-Kit
```

Ensure that Git LFS is enabled during the clone to properly download all large files.



### Download the Model from ModelScope

#### SDK Download

```bash
# First, install the ModelScope library using pip:
pip install modelscope
```

```python
# Use the following Python code to download the model using the ModelScope SDK:
from modelscope import snapshot_download
model_dir = snapshot_download('opendatalab/PDF-Extract-Kit')
```

#### Git Download
Alternatively, you can use Git to clone the model repository from ModelScope:

```bash
git clone https://www.modelscope.cn/opendatalab/PDF-Extract-Kit.git
```


Put [model files]() here:

```
./
β”œβ”€β”€ Layout
β”‚   β”œβ”€β”€ config.json
β”‚   └── model_final.pth
β”œβ”€β”€ MFD
β”‚   └── weights.pt
β”œβ”€β”€ MFR
β”‚   └── UniMERNet
β”‚       β”œβ”€β”€ config.json
β”‚       β”œβ”€β”€ preprocessor_config.json
β”‚       β”œβ”€β”€ pytorch_model.bin
β”‚       β”œβ”€β”€ README.md
β”‚       β”œβ”€β”€ tokenizer_config.json
β”‚       └── tokenizer.json
β”œβ”€β”€ TabRec
β”‚   └── StructEqTable
β”‚       β”œβ”€β”€ config.json
β”‚       β”œβ”€β”€generation_config.json
β”‚       β”œβ”€β”€model.safetensors
β”‚       β”œβ”€β”€preprocessor_config.json
β”‚       β”œβ”€β”€special_tokens_map.json
β”‚       β”œβ”€β”€spiece.model
β”‚       β”œβ”€β”€tokenizer_config.json
β”‚       └──tokenizer.json
└── README.md
```