ScreenPilot
MCP server to let LLM take full control on your device by providing screen automation toolkit for controlling and interacting with graphical user interfaces. Good for automation, education and having fun.
Main Features
- 📷 Screen capture and analysis
- 🖱️ Mouse control (clicking, positioning)
- ⌨️ Keyboard input (typing, key presses, hotkeys)
watch demo
https://github.com/user-attachments/assets/c18380c0-b3dd-4b7c-925d-28ef205ca11f
Installation
- Install python 3.12
- Clone the repository:
git clone https://github.com/Mtehabsim/ScreenPilot.git - create virtiual environment
python -m venv venv
- activate the env
venv\Scripts\activate
- Install the required packages:
pip install -r requirements.txt - Open Claude AI desktop
- file -> settings -> developer -> edit config
- open config file and paste this
{
"mcpServers": {
"device-controll": {
"command": "pathToEnv\\venv\\Scripts\\python.exe",
"args": [
"pathToProject\\ScreenPilot\\main.py"
]
}
}
}
-
Replace "pathToEnv\venv\Scripts\python.exe" → with the full path to your python.exe "pathToProject\ScreenPilot\main.py" → with the full path to your main.py file
-
Save the config file.
-
Open Claude AI Desktop.
-
Go to File → Exit
-
You can now open Claude AI Desktop and enjoy ScreenPilot.
Available Tools
- Screen Capture: Take screenshots and get screen information
- Mouse Control: Move the mouse and perform clicks
- Keyboard Actions: Type text, press keys, and use hotkey combinations
- Scrolling: Scroll in different directions and to specific positions
- Element Detection: Check if elements exist on screen and wait for them to appear
- Action Sequences: Perform multiple actions in sequence
Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
Recommend MCP Servers 💡
homeassistant-mcp
The server uses the MCP protocol to share access to a local Home Assistant instance with an LLM application. A powerful bridge between your Home Assistant instance and Language Learning Models (LLMs), enabling natural language control and monitoring of your smart home devices through the Model Context Protocol (MCP). This server provides a comprehensive API for managing your entire Home Assistant ecosystem, from device control to system administration.
facebook-mcp-server
Facebook MCP server for automating posts, comment moderation, insights, and sentiment filtering.
Skyvern
Skyvern's MCP server connects AI applications to the browser, enabling form filling, file downloading, and web research.
android-adb-controller
An MCP server that allows AI agents to control Android devices using ADB commands, providing functionalities like listing devices and executing screen taps.
systemsage
A powerful cross-platform system management and monitoring tool that provides comprehensive system insights and management capabilities through Model Context Protocol (MCP)
GH05TCREW/winsecMCP
A Python-based MCP server for automating Windows security configuration tasks including firewall, RDP, UAC, and account policy management
