Article Extractor
Extract article-style content and metadata via the unified scrape endpoint
POST
/v1/scrape
Description
Extract article-style content and metadata via the unified scrape endpoint
Parameters
| Name | Type | Required | Description |
|---|---|---|---|
| x-api-key | string | optional |
How to Use
1
1. Send a POST request to `/v1/scrape` with a `url`. 2. Set `output` to `clean` for readable article-style text. 3. Add `extract: {"article": true}` to include metadata in the response. 4. Read the main text from `content` and the metadata from `article`.
About This Tool
Article Extractor is now exposed through the unified scrape endpoint. Request `output: "clean"` and enable `extract.article` to get readable article content plus extracted metadata such as title, author, date, language, and excerpt.
This is the best fit when you care about the main body of an article rather than raw HTML.
Why Use This Tool
- Content aggregation — Pull readable articles for internal feeds or dashboards
- Knowledge ingestion — Prepare articles for AI search, retrieval, and summarization
- Editorial tooling — Inspect metadata and content quality across publications
- Research workflows — Normalize public articles into structured JSON
Frequently Asked Questions
Is this still a dedicated endpoint?
No. The capability is now part of the unified scrape response model.
Can I still get Markdown instead?
Yes. If you want more structure, use `output: "markdown"`. If you want a more article-like body, use `clean`.
Does it work for paywalled pages?
Only if the content is publicly accessible to the request context you provide.
Start using Article Extractor now
Get your free API key and make your first request in under a minute.