SmoothOperatorAgentTools
Smooth Operator Agent Tools 支持强大的 Windows 自动化、浏览器控制以及 AI 驱动的计算机交互。无论您是在构建能够操作计算机的 AI agent、自动化业务流程,还是创建测试脚本,这些工具都能通过多种集成选项提供全面的解决方案。
Smooth Operator Agent Tools MCP Server
Smooth Operator Agent Tools provide robust capabilities for Windows automation, browser control, and AI-driven computer interaction. They are designed to empower AI agents to understand and utilize computer functionalities.
What it does:
This MCP server allows AI agents (like Claude) to interact with your Windows environment, control web browsers (specifically Chrome), perform UI automation, and execute keyboard/mouse actions. It also includes AI Vision capabilities for mouse control based on UI element descriptions.
How to use:
- Download the Windows App: Obtain the
Smooth Operator Agent ToolsWindows application. - Installation: Follow the setup instructions provided by the app.
- MCP Client Configuration: Configure your MCP client with the following
mcpServersentry to connect via stdio:{ "mcpServers": { "SmoothOperatorAgentTools": { "command": "C:\\Users\\[USERNAME]\\AppData\\Roaming\\SmoothOperator\\AgentToolsServer\\smooth-operator-server.exe", "args": [ "/silent", "/close-with-parent-process" ] } } } - HTTP Server (Optional): The tools also expose functionality via an HTTP server at
http://localhost:54321, which can be used as a streamable endpoint. - Authentication: All API requests require an API key as a Bearer token in the
Authorizationheader. This key is shared with Screengrasp and can be obtained fromhttps://screengrasp.com/api.html.
Key Features:
- Screenshot and Analysis: Capture screenshots and analyze system state, UI elements, and application details.
- Mouse Control: Precise coordinate-based and AI Vision-powered mouse operations.
- Keyboard Input: Typing text, hotkeys, and key combinations.
- Chrome Browser Control: Navigation, DOM manipulation, and JavaScript execution.
- Windows Automation: Advanced UI Automation and code execution.
- API Documentation: Endpoints for accessing API documentation directly.
Implementation Details:
The server is provided as a Windows executable (.exe). Client libraries are available for Python, TypeScript, and C#/.NET, indicating a multi-language support for integration.
Recommend MCP Servers 💡
nodemcu-mcp
An MCP service for managing NodeMCU IoT devices with REST API, WebSocket, and AI integration.
@executeautomation/playwright-mcp-server
Playwright Model Context Protocol Server - Tool to automate Browsers and APIs in Claude Desktop, Cline, Cursor IDE and More 🔌
wayland-mcp
Provides screenshot, image analysis, mouse, and keyboard control tools for modern Linux desktops running Wayland.
@mobilenext/mobile-mcp
Model Context Protocol Server for Mobile Automation and Scraping (iOS, Android, Emulators, Simulators and Real Devices)
@automatalabs/mcp-server-playwright
null
@alexarevalo.ai/mcp-server-ticktick
A Model Context Protocol (MCP) server designed to integrate with the TickTick task management platform, enabling intelligent context-aware task operations and automation.