Models specialized in extracting structured information (JSON) from text, PDFs, scans, spreadsheets, etc.
AI & ML interests
Interactive NLP development
Recent Activity
Organization Card
We are a startup building the NuExtract Platform.
We also develop open-source Information Extraction foundation models that we share here. They are often SOTA in their category, and always under MIT license; use them without restrictions π.
spaces
6
Running
on
L40S
33
NuMarkdown 8b Thinking
π
Reasoning model specialized for OCR/Markdown generation.
Sleeping
11
NuExtract 2.0
π
Space for numind/NuExtract-2.0-4B
Runtime error
77
NuExtract 1.5
π
Playground for NuExtract-v1.5
Running
on
T4
35
NuNER_Zero
π»
Identify and highlight key entities in text
Paused
71
NuExtract
π
models
30

numind/NuMarkdown-8B-Thinking
Image-to-Text
β’
8B
β’
Updated
β’
8.01k
β’
192

numind/NuExtract-2.0-8B-GPTQ
Image-Text-to-Text
β’
3B
β’
Updated
β’
273
β’
4

numind/NuExtract-2.0-8B
Image-Text-to-Text
β’
8B
β’
Updated
β’
3.66k
β’
31

numind/NuExtract-2.0-4B
Image-Text-to-Text
β’
4B
β’
Updated
β’
2.13k
β’
19

numind/NuExtract-2.0-2B
Image-Text-to-Text
β’
2B
β’
Updated
β’
4.88k
β’
29

numind/NuExtract-1.5
Text Generation
β’
4B
β’
Updated
β’
81k
β’
236

numind/NuExtract-2.0-4B-GPTQ
Image-Text-to-Text
β’
1B
β’
Updated
β’
122
β’
2

numind/NuExtract-2-1B-experimental
Text Generation
β’
0.9B
β’
Updated
β’
8
β’
1

numind/NuExtract-2-2B-experimental
Text Generation
β’
2B
β’
Updated
β’
12
β’
8

numind/NuExtract-2-4B-experimental
Text Generation
β’
4B
β’
Updated
β’
11
β’
3