SmoothOperatorAgentTools
Smooth Operator Agent Tools enable powerful Windows automation, browser control, and AI-driven computer interaction. Whether you're building AI agents that can use computers, automating business processes, or creating test scripts, these tools provide a comprehensive solution with multiple integration options.
Smooth Operator Agent Tools MCP Server
Smooth Operator Agent Tools provide robust capabilities for Windows automation, browser control, and AI-driven computer interaction. They are designed to empower AI agents to understand and utilize computer functionalities.
What it does:
This MCP server allows AI agents (like Claude) to interact with your Windows environment, control web browsers (specifically Chrome), perform UI automation, and execute keyboard/mouse actions. It also includes AI Vision capabilities for mouse control based on UI element descriptions.
How to use:
- Download the Windows App: Obtain the
Smooth Operator Agent ToolsWindows application. - Installation: Follow the setup instructions provided by the app.
- MCP Client Configuration: Configure your MCP client with the following
mcpServersentry to connect via stdio:{ "mcpServers": { "SmoothOperatorAgentTools": { "command": "C:\\Users\\[USERNAME]\\AppData\\Roaming\\SmoothOperator\\AgentToolsServer\\smooth-operator-server.exe", "args": [ "/silent", "/close-with-parent-process" ] } } } - HTTP Server (Optional): The tools also expose functionality via an HTTP server at
http://localhost:54321, which can be used as a streamable endpoint. - Authentication: All API requests require an API key as a Bearer token in the
Authorizationheader. This key is shared with Screengrasp and can be obtained fromhttps://screengrasp.com/api.html.
Key Features:
- Screenshot and Analysis: Capture screenshots and analyze system state, UI elements, and application details.
- Mouse Control: Precise coordinate-based and AI Vision-powered mouse operations.
- Keyboard Input: Typing text, hotkeys, and key combinations.
- Chrome Browser Control: Navigation, DOM manipulation, and JavaScript execution.
- Windows Automation: Advanced UI Automation and code execution.
- API Documentation: Endpoints for accessing API documentation directly.
Implementation Details:
The server is provided as a Windows executable (.exe). Client libraries are available for Python, TypeScript, and C#/.NET, indicating a multi-language support for integration.
Recommend MCP Servers 💡
fetcher-mcp
MCP server for fetching web page content using Playwright headless browser.
@makehq/mcp-server
Make MCP Server
gbox
Cli and MCP for gbox. Enable AI agents to operate Android/Browser/Desktop like human.
mcp-server-apache-airflow
A Model Context Protocol (MCP) server implementation for Apache Airflow, enabling seamless integration with MCP clients. This project provides a standardized way to interact with Apache Airflow through the Model Context Protocol.
mcp-prefect
An MCP server implementation for Prefect, enabling AI assistants to interact with Prefect via natural language
talk-with-figma-claude
Enables Claude Desktop App to control Figma through MCP (Model Context Protocol) via stdio