Sponsored by Deepsite.site

Extrapify

Created By
christ0ppera month ago
Structured web context infrastructure for AI agents. Extract schema-guided structured JSON from websites using Claude-powered parsing and MCP-native workflows.
Overview

Extrapify MCP

Extract structured JSON from any public webpage inside Claude Desktop, Cursor, or any MCP-compatible client.

Define a schema. Point it at a URL. Get back validated, typed JSON.

Get an API key → extrapify.com


What this is

A thin, stateless MCP server that bridges MCP clients to the hosted Extrapify API.

Extraction does not happen inside this package. The MCP server forwards requests to the Extrapify API, which handles fetching, Browserless rendering for JS-heavy pages, Claude-powered extraction, schema validation, quota accounting, and observability on the backend.

  • MCP protocol server over stdio
  • One tool: extract_structured_data
  • Production-ready bridge, not a scraping framework
  • No extraction logic, no state, no side effects

Install

npm install

Copy .env.example to .env and fill in your credentials:

EXTRAPIFY_API_BASE_URL=https://extrapify.com
EXTRAPIFY_API_KEY=sk_live_your_key_here

Start the server:

npm run mcp:start

Claude Desktop setup

Add this block to your Claude Desktop config (claude_desktop_config.json):

{
  "mcpServers": {
    "extrapify": {
      "command": "node",
      "args": ["/absolute/path/to/extrapify-mcp/mcp/server.mjs"],
      "env": {
        "EXTRAPIFY_API_BASE_URL": "https://extrapify.com",
        "EXTRAPIFY_API_KEY": "sk_live_your_key_here"
      }
    }
  }
}

Restart Claude Desktop. The extract_structured_data tool will appear automatically.


Cursor setup

Cursor supports stdio MCP servers. Point it at node and the local mcp/server.mjs entrypoint with the same two environment variables.

See docs/mcp-install-examples.md for copy-paste configs.


Tool: extract_structured_data

Retrieve structured JSON from any public webpage using a schema you define.

Input:

{
  "url": "https://example.com/article",
  "mode": "auto",
  "schema": {
    "title": "string",
    "author": "string",
    "published_at": "date",
    "tags": "string[]"
  }
}

Output:

{
  "extracted": {
    "title": "How Claude Agents Are Changing Developer Workflows",
    "author": "Jane Smith",
    "published_at": "2026-04-15",
    "tags": ["AI", "agents", "developer tools"]
  },
  "type": "single",
  "count": 1,
  "confidence": 0.96,
  "tokens_used": 1820
}

Supported schema types: string, number, integer, float, boolean, date, datetime, url, and any of these as arrays using [] suffix (e.g. string[]).

Supported mode values:

  • auto — let Extrapify decide based on page structure
  • single — extract the primary item only
  • list — extract all matching items as an array

Schema templates

Starter schemas for common use cases (product pages, job listings, articles, company data) are in docs/schema-templates.md.


Example workflows

Agent patterns and demo workflows are in docs/demo-workflows.md.


Other compatible clients

Any MCP client that supports stdio transport works with this package. Typically you only need:

  • command: node
  • args: absolute path to mcp/server.mjs
  • env: EXTRAPIFY_API_BASE_URL and EXTRAPIFY_API_KEY

Troubleshooting

SymptomFix
Server exits immediatelyVerify EXTRAPIFY_API_BASE_URL is a valid absolute URL
Tool calls return 401 or 403Check your API key at extrapify.com/dashboard
Client cannot discover toolsConfirm it is launching node against mcp/server.mjs over stdio
Requests time outVerify the Extrapify API is reachable from your machine
JS-heavy pages return empty contentExtrapify handles Browserless fallback automatically — no action needed

Repository layout

mcp/
  server.mjs               ← MCP stdio server entrypoint
  tool-registry.mjs        ← tool definitions
  extrapify-client.mjs     ← minimal Extrapify API client
  tools/
    extract-structured-data.mjs
  configs/
    claude-desktop.local.example.json
    claude-desktop.production.example.json
docs/
  mcp-install-examples.md
  schema-templates.md
  demo-workflows.md
  mcp-marketplace-copy.md

Server Config

{
  "mcpServers": {
    "extrapify": {
      "command": "node",
      "args": [
        "/absolute/path/to/extrapify-mcp/mcp/server.mjs"
      ],
      "env": {
        "EXTRAPIFY_API_BASE_URL": "https://extrapify.com",
        "EXTRAPIFY_API_KEY": "sk_live_your_key_here"
      }
    }
  }
}
Recommend Servers
TraeBuild with Free GPT-4.1 & Claude 3.7. Fully MCP-Ready.
MiniMax MCPOfficial MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video generation APIs.
BlenderBlenderMCP connects Blender to Claude AI through the Model Context Protocol (MCP), allowing Claude to directly interact with and control Blender. This integration enables prompt assisted 3D modeling, scene creation, and manipulation.
Y GuiA web-based graphical interface for AI chat interactions with support for multiple AI models and MCP (Model Context Protocol) servers.
Howtocook Mcp基于Anduin2017 / HowToCook (程序员在家做饭指南)的mcp server,帮你推荐菜谱、规划膳食,解决“今天吃什么“的世纪难题; Based on Anduin2017/HowToCook (Programmer's Guide to Cooking at Home), MCP Server helps you recommend recipes, plan meals, and solve the century old problem of "what to eat today"
Visual Studio Code - Open Source ("Code - OSS")Visual Studio Code
Jina AI MCP ToolsA Model Context Protocol (MCP) server that integrates with Jina AI Search Foundation APIs.
EdgeOne Pages MCPAn MCP service designed for deploying HTML content to EdgeOne Pages and obtaining an accessible public URL.
Playwright McpPlaywright MCP server
WindsurfThe new purpose-built IDE to harness magic
Serper MCP ServerA Serper MCP Server
AiimagemultistyleA Model Context Protocol (MCP) server for image generation and manipulation using fal.ai's Stable Diffusion model.
CursorThe AI Code Editor
DeepChatYour AI Partner on Desktop
Amap Maps高德地图官方 MCP Server
Zhipu Web SearchZhipu Web Search MCP Server is a search engine specifically designed for large models. It integrates four search engines, allowing users to flexibly compare and switch between them. Building upon the web crawling and ranking capabilities of traditional search engines, it enhances intent recognition capabilities, returning results more suitable for large model processing (such as webpage titles, URLs, summaries, site names, site icons, etc.). This helps AI applications achieve "dynamic knowledge acquisition" and "precise scenario adaptation" capabilities.
Baidu Map百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。
ChatWiseThe second fastest AI chatbot™
MCP AdvisorMCP Advisor & Installation - Use the right MCP server for your needs
Tavily Mcp
RedisA Model Context Protocol server that provides access to Redis databases. This server enables LLMs to interact with Redis key-value stores through a set of standardized tools.