NuExtract3 released: open-weight 4B VLM for Markdown, OCR and structured extraction (self-hostable)

r/LocalLLaMA
Computer Vision NLP Open Source AI AI Tools

Disclaimer: I work for Numind, the company behind this open-weight model TLDR: Image/text to Markdown:-) We just released a 4B model based on Qwen3.5-4B, under Apache-2.0 license. The goal is to make information extraction from complex documents practical with an open model: PDFs, screenshots, forms, tables, receipts, invoices, multi-page documents, and other visually structured inputs. If you ever used NuMarkdown, this is its successor! Try it, we have a huggingface space that is completely free (you don't even have to sign-up): If you ever used NuMarkdown, NuExtract3 is the successor.