--- title: Surya OCR Studio emoji: 📄 colorFrom: green colorTo: blue sdk: gradio sdk_version: 6.11.0 app_file: app.py pinned: false --- # Surya OCR Studio Complete document OCR toolkit supporting 90+ languages with ZeroGPU. ## Features | Feature | Description | |---------|-------------| | **OCR** | Text recognition in 90+ languages | | **Text Detection** | Line-level text detection | | **Layout Analysis** | Identify tables, figures, headers, captions, etc. | | **Table Recognition** | Extract table structure to Markdown | | **LaTeX OCR** | Convert equation images to LaTeX | ## Usage ### OCR 1. Upload an image (JPG, PNG, etc.) 2. Specify language codes (e.g., "en" or "en, pt, es") 3. Click "Run OCR" ### Table Recognition 1. Upload an image of a table 2. Click "Recognize Table" 3. Get Markdown output ### LaTeX OCR 1. Upload a cropped equation image 2. Click "Extract LaTeX" 3. Copy the LaTeX code ## Supported Languages English (en), Portuguese (pt), Spanish (es), French (fr), German (de), Italian (it), Dutch (nl), Russian (ru), Chinese (zh), Japanese (ja), Korean (ko), Arabic (ar), Hindi (hi), and 80+ more. ## Model - [Surya OCR](https://github.com/datalab-to/surya) by datalab-to ## License - Code: GPL-3.0 - Model weights: Modified AI Pubs Open Rail-M license (free for research, personal use, and startups under $2M funding/revenue) ## Space by [@artificialguybr](https://twitter.com/artificialguybr)