Spaces:
Sleeping
Sleeping
File size: 854 Bytes
331f289 cc5cebc e6f2eb8 cc5cebc |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 |
# Load images from PDF files (using PDF2Image)
!!! danger "Warning"
Unlike other image extraction methods in `document_loader.image_loader`, this loader does not extract embedded images from the PDF.
Instead, it creates a snapshot image version of each selected page from the PDF.
??? note "Note"
**Underlying Library:** `pdf2image`
Extract images from PDF files using `pdf2image`.
Use it in our library with:
```python
from medrag_multi_modal.document_loader.image_loader import PDF2ImageLoader
```
For details and available `**kwargs`, please refer to the sources below.
**Sources:**
- [DataLab](https://www.datalab.to)
- [GitHub](https://github.com/VikParuchuri/marker)
- [PyPI](https://pypi.org/project/marker-pdf/)
::: medrag_multi_modal.document_loader.image_loader.pdf2image_img_loader
|