Features/Scrape API
API

Every format your
pipeline needs.

Seven endpoint types. One base URL. The same crawling infrastructure that powers Mightstill's alerts — available as a direct API.

Base URLhttps://api.mightstill.com/v1
GET/scrape/markdownTEXT

Markdown

Clean, structured markdown extracted from any URL. Strips navigation, ads, and boilerplate. Ideal for LLM ingestion and diff analysis.

LLM contextDiff analysisContent extractionRAG pipelines

Example

# Pricing — Acme AI

## Growth
**$49/month** — up to 15 seats
- Unlimited projects
- Priority support
GET/scrape/linksJSON

Links

All hyperlinks extracted from a page with anchor text and destination URLs. Useful for mapping site structure and detecting new pages.

Site mappingNew page detectionInternal link analysisRedirect tracking

Example

{
  "links": [
    { "text": "Pricing", "href": "/pricing", "internal": true },
    { "text": "Sign up", "href": "/signup", "internal": false }
  ],
  "count": 24
}
GET/scrape/brandingVISUAL

Branding

Extracts the full design system — logo, color palette, typography, tagline, and meta signals. Track positioning and visual identity changes over time.

Brand monitoringDesign system extractionPositioning trackingCompetitive profiling

Example

{
  "logo": "https://acme.ai/logo.svg",
  "tagline": "The AI platform for enterprise",
  "colors": { "primary": "#0F172A", "accent": "#6366F1" },
  "fonts": ["Inter", "Cal Sans"]
}
GET/scrape/snapshotPNG

Snapshot

Full-page high-resolution screenshot. Returns binary PNG or JSON wrapper with image URL. Supports Desktop, Laptop, Tablet, and Mobile breakpoints.

Visual change detectionDesign archivingAI vision inputDirectory screenshots

Example

GET /scrape/snapshot?viewport=1440x3200

Content-Type: image/png
X-Page-Width: 1440
X-Page-Height: 3200
X-Captured-At: 2026-04-25T14:32:00Z
POST/scrape/jsonJSON

Structured JSON

Pass a schema, get back typed, validated JSON. No selectors needed — the LLM infers the structure from the page content.

Pricing extractionFeature listsTeam pagesAny structured data

Example

// Request schema
{ "plans": "array", "has_free_tier": "boolean" }

// Response
{ "plans": [{ "name": "Growth", "price": 49 }],
  "has_free_tier": true }
GET/scrape/rawHTML

Raw HTML

The full rendered HTML after JavaScript execution. Includes dynamically loaded content. Use when you need the complete DOM.

Custom parsingDOM inspectionJS-rendered contentFull page archive

Example

<!DOCTYPE html>
<html lang="en">
<head><title>Pricing — Acme AI</title></head>
<body>
  <section class="pricing-hero">
    <h1>Pricing</h1>
  </section>
</body>
</html>
POST/scrape/crawlJSON

Crawl Site

Recursively crawl an entire domain up to a configurable depth. Returns a sitemap of discovered URLs with page types inferred for each.

Full site mappingPage discoverySitemap generationCompetitor audits

Example

// Request
{ "domain": "acme.ai", "depth": 2, "limit": 50 }

// Response
{ "pages": [
  { "url": "/pricing", "type": "pricing" },
  { "url": "/features", "type": "features" }
], "count": 5 }

Ready to start scraping?

Full API access on all plans. 14-day free trial.