Skip to main content

Mathpix OCR API

Mathpix OCR recognizes printed and handwritten STEM document content, including math, text, tables, and chemistry diagrams, from images, stroke data, or PDF files.

The primary output format is Mathpix Markdown (MMD), a superset of Markdown with support for math equations, tables, chemistry diagrams, and other STEM content. The API also provides document conversion endpoints to transform MMD into other formats like DOCX, LaTeX, HTML, PDF, and PPTX.

Guides

  1. Authentication
  2. Process a PDF
  3. Process an Image
  4. Batch Image Processing
  5. Convert Markdown
  6. Process Stroke Data

Reference

EndpointDescription
Process DocumentsSubmit documents for OCR (async)
Process ImagesProcess a single image
Process Equations (deprecated)Process a single equation image
Process StrokesProcess handwriting stroke coordinates
Batch ProcessingProcess multiple images in one request
Convert MarkdownConvert Mathpix Markdown to DOCX, LaTeX, HTML, etc.

Concepts

Support

Questions or problems? Email support@mathpix.com.