SmoothOperatorAgentTools
Smooth Operator Agent Tools enable powerful Windows automation, browser control, and AI-driven computer interaction. Whether you're building AI agents that can use computers, automating business processes, or creating test scripts, these tools provide a comprehensive solution with multiple integration options.
Smooth Operator Agent Tools MCP Server
Smooth Operator Agent Tools provide robust capabilities for Windows automation, browser control, and AI-driven computer interaction. They are designed to empower AI agents to understand and utilize computer functionalities.
What it does:
This MCP server allows AI agents (like Claude) to interact with your Windows environment, control web browsers (specifically Chrome), perform UI automation, and execute keyboard/mouse actions. It also includes AI Vision capabilities for mouse control based on UI element descriptions.
How to use:
- Download the Windows App: Obtain the
Smooth Operator Agent ToolsWindows application. - Installation: Follow the setup instructions provided by the app.
- MCP Client Configuration: Configure your MCP client with the following
mcpServersentry to connect via stdio:{ "mcpServers": { "SmoothOperatorAgentTools": { "command": "C:\\Users\\[USERNAME]\\AppData\\Roaming\\SmoothOperator\\AgentToolsServer\\smooth-operator-server.exe", "args": [ "/silent", "/close-with-parent-process" ] } } } - HTTP Server (Optional): The tools also expose functionality via an HTTP server at
http://localhost:54321, which can be used as a streamable endpoint. - Authentication: All API requests require an API key as a Bearer token in the
Authorizationheader. This key is shared with Screengrasp and can be obtained fromhttps://screengrasp.com/api.html.
Key Features:
- Screenshot and Analysis: Capture screenshots and analyze system state, UI elements, and application details.
- Mouse Control: Precise coordinate-based and AI Vision-powered mouse operations.
- Keyboard Input: Typing text, hotkeys, and key combinations.
- Chrome Browser Control: Navigation, DOM manipulation, and JavaScript execution.
- Windows Automation: Advanced UI Automation and code execution.
- API Documentation: Endpoints for accessing API documentation directly.
Implementation Details:
The server is provided as a Windows executable (.exe). Client libraries are available for Python, TypeScript, and C#/.NET, indicating a multi-language support for integration.
Recommend MCP Servers 💡
cronlytic-mcp-server
MCP server to let LLMs (AI Agents) communicate and interact with Cronlytic to perform CRUD operations for serverless cron jobs
wayland-mcp
Provides screenshot, image analysis, mouse, and keyboard control tools for modern Linux desktops running Wayland.
UnityMCPIntegration
Enable AI Agents to Control Unity through MCP integration
srmorete/adb-mcp
An MCP server for interacting with Android devices through ADB
browser-use-mcp-server
Browse the web, directly from Cursor etc.
mcp-prefect
An MCP server implementation for Prefect, enabling AI assistants to interact with Prefect via natural language