Changelog

Version history and release notes.

v1.2.0

PDF extraction and structured data parsing.

  • Extract text and metadata from remote PDF files
  • JSON-LD and Microdata structured data extraction
  • Landing site with individual tool pages for SEO
  • Blog section with CMS-managed content
v1.1.0

Metadata and site intelligence endpoints.

  • HTTP headers endpoint with security-header audit and score
  • Meta tags extraction (title, canonical, charset, language)
  • Sitemap.xml and sitemap index parsing
  • robots.txt parsing into structured rules per user-agent
  • OpenGraph and Twitter Card link preview endpoint
v1.0.0

Initial release.

  • Article text extraction with title, author, date, and reading time
  • Markdown extraction for LLM and RAG pipelines
  • Link extraction with internal/external classification
  • Image extraction with dimensions, alt text, and srcset
  • Page text extraction with optional CSS selector scoping
  • Health check endpoint and Scalar API documentation