Scrape Markdown
Scrapes the given URL into LLM usable Markdown.
Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Query Parameters
Full URL to scrape into LLM usable Markdown (must include http:// or https:// protocol)
Preserve hyperlinks in Markdown output
Include image references in Markdown output
Shorten base64-encoded image data in the Markdown output
Extract only the main content of the page, excluding headers, footers, sidebars, and navigation
PDF parsing controls. Use start/end to limit text extraction and OCR to an inclusive 1-based page range.
When true, the contents of iframes are rendered to Markdown.
CSS selectors. When provided, only matching HTML subtrees (and their descendants) are kept before conversion to Markdown. When omitted, the entire document is kept. Examples: "article.main", "#content", "[role=main]".
502048CSS selectors to remove before conversion to Markdown. Applied after includeSelectors. Exclusion takes precedence: an element matching both is removed. Examples: "nav", "footer", ".ad-banner", "[aria-hidden=true]".
502048Return a cached result if a prior scrape for the same parameters exists and is younger than this many milliseconds. Defaults to 1 day (86400000 ms) when omitted. Max is 30 days (2592000000 ms). Set to 0 to always scrape fresh.
0 <= x <= 2592000000Optional browser wait time in milliseconds after initial page load before converting the page to Markdown. Min: 0. Max: 30000 (30 seconds).
0 <= x <= 30000Optional outbound HTTP headers forwarded only to the target URL, sent as deep-object query params such as headers[X-Custom]=value. When provided, caching is bypassed: the result is neither read from nor written to cache.
Optional timeout in milliseconds for the request. If the request takes longer than this value, it will be aborted with a 408 status code. Maximum allowed value is 300000ms (5 minutes).
1000 <= x <= 300000Response
Successful response
Indicates success
true Page content converted to GitHub Flavored Markdown
The URL that was scraped
Metadata extracted from the scraped page HTML.
Metadata about the API key used for the request. Included in every response whenever a valid API key is provided, even when the response status is not 200.