PDF Text Extractor
Download a remote PDF and extract text through the live scrape PDF endpoint
POST
/v1/scrape/pdf
Description
Download a remote PDF and extract text through the live scrape PDF endpoint
Parameters
| Name | Type | Required | Description |
|---|---|---|---|
| x-api-key | string | optional |
How to Use
1
1. POST the PDF URL to `/v1/scrape/pdf`. 2. Optionally provide a page range such as `1-3` or `1,3,5`. 3. Use the `text` field for combined output or `pages` for per-page processing.
About This Tool
Use the scrape PDF endpoint when the source content is published as a remote PDF rather than HTML. The API downloads the file, extracts text, and returns both aggregate and per-page results.
Why Use This Tool
- Report ingestion — Pull readable content from public PDFs
- AI workflows — Feed extracted document text into summarization or retrieval pipelines
- Compliance and research — Search across PDF-based publications
- Document indexing — Normalize PDF text for downstream systems
Frequently Asked Questions
Does this run OCR on scanned PDFs?
No — this endpoint focuses on embedded PDF text extraction.
Can I limit extraction to specific pages?
Yes. Use the `pages` field to narrow the range.
Is this separate from the unified scrape endpoint?
Yes. PDF extraction remains a dedicated route because the input type and processing flow are different.
Start using PDF Text Extractor now
Get your free API key and make your first request in under a minute.