Markdown Extractor
Return clean Markdown from any public page via the unified scrape endpoint
POST
/v1/scrape
Description
Return clean Markdown from any public page via the unified scrape endpoint
Parameters
| Name | Type | Required | Description |
|---|---|---|---|
| x-api-key | string | optional |
How to Use
1
1. Send a JSON body with `url` and `output: "markdown"`. 2. Use `include_links`, `include_images`, and `include_tables` to control how much structure is preserved. 3. Read the Markdown from the `content` field in the response.
About This Tool
Markdown Extractor is now powered by the unified `POST /v1/scrape` endpoint. Set `output` to `markdown` and the API returns the main page content in a format that is ideal for documentation ingestion, RAG pipelines, and AI prompts.
This mode keeps the useful structure — headings, lists, links, and basic formatting — while stripping away most HTML noise.
Why Use This Tool
- LLM ingestion — Turn live webpages into prompt-ready Markdown
- Docs migration — Convert public docs pages into reusable Markdown assets
- Knowledge bases — Store normalized source content for internal search or retrieval
- Content monitoring — Compare Markdown snapshots over time
Frequently Asked Questions
Is this a separate endpoint?
No. This capability is part of the unified scrape API and is controlled through the `output` field.
Should I use Markdown or clean text?
Use Markdown when you want structure preserved. Use `clean` or `text` when you want flatter content.
Does it work well for AI use cases?
Yes — Markdown is often the best default for RAG and summarization workflows because it keeps semantic structure without raw HTML clutter.
Start using Markdown Extractor now
Get your free API key and make your first request in under a minute.