🕸️

Markdown Extractor

Return clean Markdown from any public page via the unified scrape endpoint

POST /v1/scrape

Description

Return clean Markdown from any public page via the unified scrape endpoint

Parameters

Name	Type	Required	Description
x-api-key	string	optional

How to Use

1

1. Send a JSON body with `url` and `output: "markdown"`. 2. Use `include_links`, `include_images`, and `include_tables` to control how much structure is preserved. 3. Read the Markdown from the `content` field in the response.

About This Tool

Markdown Extractor is now powered by the unified `POST /v1/scrape` endpoint. Set `output` to `markdown` and the API returns the main page content in a format that is ideal for documentation ingestion, RAG pipelines, and AI prompts.

This mode keeps the useful structure — headings, lists, links, and basic formatting — while stripping away most HTML noise.

Why Use This Tool

LLM ingestion — Turn live webpages into prompt-ready Markdown
Docs migration — Convert public docs pages into reusable Markdown assets
Knowledge bases — Store normalized source content for internal search or retrieval
Content monitoring — Compare Markdown snapshots over time

Frequently Asked Questions

Is this a separate endpoint?

No. This capability is part of the unified scrape API and is controlled through the `output` field.

Should I use Markdown or clean text?

Use Markdown when you want structure preserved. Use `clean` or `text` when you want flatter content.

Does it work well for AI use cases?

Yes — Markdown is often the best default for RAG and summarization workflows because it keeps semantic structure without raw HTML clutter.

Start using Markdown Extractor now

Get your free API key and make your first request in under a minute.

Get Free API Key View Docs