Home/Formats/PDF → Text
Converter · PDF → Text (OCR)

PDF to
text.

Free · OCR · 20 MB · batch of 5 Works on real text PDFs and scanned image-PDFs alike.

Drop a PDF, get a .txt.

Use the converter on the home page and pick OCR (Extract Text).

Open the converter →

What you get

A plain UTF-8 .txt file containing the text from the PDF, page-marked. The text is extracted via Google Cloud Vision OCR, so it works whether the PDF is real text or a scanned image of text — both come out the same way on the other side.

Why OCR a PDF?

How it works

  1. Open the home page, drop your PDF in the box.
  2. Pick OCR (Extract Text) from the dropdown.
  3. Hit Convert. Each page is rendered to an image, OCR'd, and assembled into a single text file.
  4. Click Download. Each page is delimited with --- Page N --- for easy splitting.

What works well

What doesn't

Tips

FAQ

Will it work on a scanned PDF? Yes — that's exactly what OCR is for.

Does it preserve formatting? No. Output is plain text, page-delimited. For formatted output, OCR to text first, then format manually.

What languages? English by default. Most Latin-script European languages work. CJK and right-to-left scripts are on request.

How private is this? The PDF and the resulting text are auto-deleted after one hour. See Security.

Related