Skip to main content

Mathpix OCR API

Mathpix OCR recognizes printed and handwritten STEM document content — including math, text, tables, and chemistry diagrams — from images, stroke data, or PDF files.

The primary output format is Mathpix Markdown (MMD), a superset of Markdown that adds support for math equations, tables, chemistry diagrams, and other STEM content. The API also provides document conversion endpoints to transform MMD into other formats like DOCX, LaTeX, HTML, PDF, and PPTX.

Get started

  1. Get your API keys at console.mathpix.com
  2. Process a PDFstep-by-step guide
  3. Process an imagestep-by-step guide

Endpoints

EndpointDescription
POST v3/pdfProcess PDFs and documents (async)
POST v3/textProcess a single image
POST v3/converterConvert Mathpix Markdown to DOCX, LaTeX, HTML, etc.
POST v3/batchProcess multiple images in one request
POST v3/strokesProcess handwriting stroke coordinates

Support

Questions or problems? Email support@mathpix.com.