Question 1

Which languages can the OCR engine recognise?

Accepted Answer

English, Spanish, French, German, Italian, Dutch, Portuguese, Polish, Swedish, Turkish, Russian, Arabic, Hindi, Thai, Vietnamese, Chinese (Simplified and Traditional), Japanese, Korean, and Indonesian — 20 in all. Pick the closest match for your image. Mixed-language images work best when you choose the script that covers most of the text.

Question 2

What kind of accuracy should I expect?

Accepted Answer

On clean print at 300 dpi or higher (screenshots, scanned PDFs), accuracy is typically 95% or better. Handwriting, low contrast, motion blur, or photos taken at an angle drop accuracy significantly. The confidence score next to the result tells you how sure Tesseract is.

Question 3

Why does the language pack take a few seconds to load the first time?

Accepted Answer

Each Tesseract language model is around 10–20 MB and downloads once when you first pick that language. After the first use it stays cached in your browser, so subsequent recognitions on the same language are near-instant.

Question 4

Can I paste a screenshot from the clipboard instead of uploading a file?

Accepted Answer

Yes. Take a screenshot (Win+Shift+S on Windows, Cmd+Ctrl+Shift+4 on macOS), then click Paste Image. The tool reads the image directly from the clipboard without saving a file to disk.

Question 5

Will my images be sent anywhere?

Accepted Answer

No. Tesseract runs entirely inside the page via WebAssembly. The image bytes never leave your device. You can confirm this by disabling the network in DevTools and watching the recognition still complete.

OCR Image to Text

What is OCR Image to Text?

How to use

When to use

Result

FAQ

Related Tools

PDF Bookmark Editor

PDF Flatten

Rich Text Editor

Markdown to PDF

PDF Crop

PDF Page Reorderer

OCR Image to Text