Loading languages...
DA

data-mcp

@NexusX-MCP

웹페이지를 스크랩하고, 구조화된 데이터를 추출하며, 크롤링하기 위한 다양한 도구를 제공합니다. 또한 OpenAI의 CUA, Anthropic의 Claude Computer Use, Browser Use와 같은 범용 브라우저 에이전트에 쉽게 접근할 수 있는 mcp servers를 제공합니다.

web-scraping
data-extraction
browser-automation
web-crawling

Data MCP Server

This is Data's Model Context Protocol (MCP) Server. It provides various tools to scrape, extract structured data, and crawl webpages. It also provides easy access to general purpose browser agents like OpenAI's CUA, Anthropic's Claude Computer Use, and Browser Use.

More information about the Data can be found here. The data API supports a superset of features present in the mcp server.

More information about the Model Context Protocol can be found here.

Table of Contents

Installation

Manual Installation

To install the server, run:

npx data-mcp <YOUR-DATA-API-KEY>

Running on Cursor

Add to ~/.cursor/mcp.json like this:

{
  "mcpServers": {
    "data": {
      "command": "npx",
      "args": ["-y", "data-mcp"],
      "env": {
        "DATA_API_KEY": "YOUR-API-KEY"
      }
    }
  }
}

Running on Windsurf

Add to your ./codeium/windsurf/model_config.json like this:

{
  "mcpServers": {
    "data": {
      "command": "npx",
      "args": ["-y", "data-mcp"],
      "env": {
        "DATA_API_KEY": "YOUR-API-KEY"
      }
    }
  }
}

Development

For development purposes, you can run the server directly from the source code.

  1. Clone the repository:

    git clone [email protected]:NexusX-MCP/data-mcp-server.git data-mcp
    cd data-mcp
    
  2. Install dependencies:

    npm install # or yarn install
    npm run build
    
  3. Run the server:

    node dist/server.js
    

Claude Desktop app

This is an example config for the Data MCP server for the Claude Desktop client.

{
  "mcpServers": {
    "data": {
      "command": "npx",
      "args": ["--yes", "data-mcp"],
      "env": {
        "DATA_API_KEY": "your-api-key"
      }
    }
  }
}

Tools

  • scrape_webpage - Extract formatted (markdown, screenshot etc) content from any webpage
  • crawl_webpages - Navigate through multiple linked pages and extract LLM-friendly formatted content
  • extract_structured_data - Convert messy HTML into structured JSON
  • search_with_bing - Query the web and get results with Bing search
  • browser_use_agent - Fast, lightweight browser automation with the Browser Use agent
  • openai_computer_use_agent - General-purpose automation using OpenAI's CUA model
  • claude_computer_use_agent - Complex browser tasks using Claude computer use
  • create_profile - Creates a new persistent Data profile.
  • delete_profile - Deletes an existing persistent Data profile.
  • list_profiles - Lists existing persistent Data profiles.

Installing via Smithery

To install Data MCP Server for Claude Desktop automatically via Smithery:

npx -y @smithery/cli install @data-mcp --client claude

Resources

The server provides the documentation about data through the resources methods. Any client which can do discovery over resources has access to it.

License

This project is licensed under the MIT License.

# mcpServer Config

{
  "mcpServers": {
    "data": {
      "command": "npx",
      "args": [
        "--yes",
        "data-mcp"
      ],
      "env": {
        "DATA_API_KEY": "your-api-key"
      }
    }
  }
}

# stdio

npx data-mcp
Transport:
stdio
Language:
TypeScript
Created: 4/24/2025
Updated: 4/24/2025