Surya-OCR / README.md
artificialguybr's picture
Update README.md
23374dd verified

A newer version of the Gradio SDK is available: 6.15.2

Upgrade
metadata
title: Surya OCR Studio
emoji: 📄
colorFrom: green
colorTo: blue
sdk: gradio
sdk_version: 6.11.0
app_file: app.py
pinned: false

Surya OCR Studio

Complete document OCR toolkit supporting 90+ languages with ZeroGPU.

Features

Feature Description
OCR Text recognition in 90+ languages
Text Detection Line-level text detection
Layout Analysis Identify tables, figures, headers, captions, etc.
Table Recognition Extract table structure to Markdown
LaTeX OCR Convert equation images to LaTeX

Usage

OCR

  1. Upload an image (JPG, PNG, etc.)
  2. Specify language codes (e.g., "en" or "en, pt, es")
  3. Click "Run OCR"

Table Recognition

  1. Upload an image of a table
  2. Click "Recognize Table"
  3. Get Markdown output

LaTeX OCR

  1. Upload a cropped equation image
  2. Click "Extract LaTeX"
  3. Copy the LaTeX code

Supported Languages

English (en), Portuguese (pt), Spanish (es), French (fr), German (de), Italian (it), Dutch (nl), Russian (ru), Chinese (zh), Japanese (ja), Korean (ko), Arabic (ar), Hindi (hi), and 80+ more.

Model

License

  • Code: GPL-3.0
  • Model weights: Modified AI Pubs Open Rail-M license (free for research, personal use, and startups under $2M funding/revenue)

Space by

@artificialguybr