Spaces:
Runtime error
Runtime error
A newer version of the Gradio SDK is available: 6.15.2
metadata
title: Surya OCR Studio
emoji: 📄
colorFrom: green
colorTo: blue
sdk: gradio
sdk_version: 6.11.0
app_file: app.py
pinned: false
Surya OCR Studio
Complete document OCR toolkit supporting 90+ languages with ZeroGPU.
Features
| Feature | Description |
|---|---|
| OCR | Text recognition in 90+ languages |
| Text Detection | Line-level text detection |
| Layout Analysis | Identify tables, figures, headers, captions, etc. |
| Table Recognition | Extract table structure to Markdown |
| LaTeX OCR | Convert equation images to LaTeX |
Usage
OCR
- Upload an image (JPG, PNG, etc.)
- Specify language codes (e.g., "en" or "en, pt, es")
- Click "Run OCR"
Table Recognition
- Upload an image of a table
- Click "Recognize Table"
- Get Markdown output
LaTeX OCR
- Upload a cropped equation image
- Click "Extract LaTeX"
- Copy the LaTeX code
Supported Languages
English (en), Portuguese (pt), Spanish (es), French (fr), German (de), Italian (it), Dutch (nl), Russian (ru), Chinese (zh), Japanese (ja), Korean (ko), Arabic (ar), Hindi (hi), and 80+ more.
Model
- Surya OCR by datalab-to
License
- Code: GPL-3.0
- Model weights: Modified AI Pubs Open Rail-M license (free for research, personal use, and startups under $2M funding/revenue)