Skip to main content

PDF Text Extractor

Extract selectable text from a PDF and copy or download it as plain text.

Your files stay in your browser

How it works

  1. 1

    Upload your PDF

    Choose a PDF with a text layer (not a pure scan).

  2. 2

    Read extracted text

    View all text or page-by-page; empty pages are flagged.

  3. 3

    Copy or download

    Copy to clipboard or download as .txt.

Common use cases

  • Report

    A 10-page PDF

  • Invoice

    A PDF invoice

About This Tool

Pull the text layer out of any PDF so you can copy, search, edit, or feed it into other tools. Upload the PDF and the extractor reads the embedded text for every page, then presents it in a single view or page-by-page. Copy to clipboard or download as .txt.

Ideal for PDFs with a real text layer — reports, invoices, e-books. Scanned image-only pages are flagged (no OCR is performed).

**How It Works**

Most PDFs store a text layer alongside the visual rendering. pdf.js reads it item-by-item, reconstructing line breaks from layout metadata. The result is plain text you can paste anywhere. Extraction happens in your browser; the PDF is never uploaded. For scanned PDFs, OCR would be required (out of scope).

More examples

Examples

Report

Input

A 10-page PDF

Output

Plain text of all pages

Invoice

Input

A PDF invoice

Output

Text for accounting software
Frequently Asked Questions
Why no text from my PDF?
Scanned images have no text layer — only pixels. This reads text layers and does not do OCR; such pages are flagged.
Specific pages only?
Use the "Per page" view to see each page's text, then copy what you need.
Are formatting and images preserved?
No — output is plain text. Use PDF to Image for visual content.
Is my PDF uploaded?
No — pdf.js extracts locally.