Drop your scanned PDF here or click to choose
PDF only ยท up to 25 MB ยท up to 50 pages
Press Ctrl+S to download Word, Ctrl+Shift+S for text, Ctrl+F to search, Esc to cancel.
Extract text from any scanned or image-based PDF and download it as a Word document or plain text file. Runs entirely in your browser โ your documents never leave your device.
Last updated: April 2026
Drop your scanned PDF here or click to choose
PDF only ยท up to 25 MB ยท up to 50 pages
Press Ctrl+S to download Word, Ctrl+Shift+S for text, Ctrl+F to search, Esc to cancel.
OCR (Optical Character Recognition) is technology that converts images of text โ like scanned documents or photos of printed pages โ into editable, searchable text. This tool takes a scanned or image-based PDF, runs OCR on every page, and gives you back editable Word (.docx) and plain text (.txt) files.
Everything runs in your browser using WebAssembly (WASM). There is no server, no upload, and no sign-up. The Tesseract OCR engine is loaded once on first use (~2 MB) and then cached for instant re-use โ the tool even works offline after that first load.
Drag your PDF onto the drop zone or click to choose a file. Up to 25 MB and 50 pages per document.
Pick one or more languages so Tesseract loads the right traineddata. Defaults to your browser language; supports 10+ languages.
Each page is rendered to a canvas and passed to Tesseract locally. You see progress, a live thumbnail, and the text as it streams in.
Review and inline-edit the extracted text, then download as Word, plain text, or both as a ZIP. Or send it straight into our translation service.
Run OCR in any of the languages below โ including Spanish OCR, French OCR, German OCR, Chinese OCR, Japanese OCR, Korean OCR, Arabic OCR, and Russian OCR. Language data is fetched the first time you use a language and cached for future runs.
Yes. This OCR tool runs 100% in your browser using WebAssembly. Your PDF is never uploaded to any server, we never see it, and no account is required.
If you then translate the extracted text using our document translation service, that upload is encrypted in transit and the original file is deleted within 7 days. OCR and translation are separate โ OCR happens entirely on your device.
| Method | Privacy | Cost | Accuracy |
|---|---|---|---|
| Browser OCR (this tool) | 100% private โ runs locally | Free | Good on clean scans (โฅ 300 DPI) |
| Cloud OCR (Adobe, Google) | Files uploaded to third-party servers | Subscription or pay-per-page | Excellent, but depends on vendor |
| Desktop software | Private โ runs on your machine | $50โ$500 one-time or subscription | Excellent with handwriting add-ons |
Already have editable text? Use our free Word Counter to count words and pages โ
We translate OCR'd documents into 30+ languages using DeepL neural machine translation. Click below and we'll pre-load your extracted text into the translation flow.
Translate a Document