🚀 Wayland MCP Server
Note: This package was created because existing screenshot solutions didn't work reliably on my Wayland setup. Wayland MCP provides screenshot, analysis, mouse and keyboard control tools for modern Linux desktops.
✨ Features
-
📸 Screenshot & Analysis
- Custom VLM integration for image analysis
- Image comparison capabilities
-
🖱️ Input Simulation
- Mouse control (move, click, drag, scroll)
- Keyboard input (typing, key presses)
- Action chaining for complex sequences
⚠️ Security Warning
WARNING: Enabling input control gives the MCP server full access to your mouse and keyboard. Only use with trusted MCP servers and models.
🚀 Quick Start
🔧 Input Control Setup
./setup.sh
Configures permissions for evemu-event to control input devices.
⚙️ MCP Server Configuration
Add to your MCP server config (.roo/mcp.json):
{
"mcpServers": {
"wayland-screenshot": {
"command": "uvx",
"args": ["wayland-mcp"],
"env": {
"OPENROUTER_API_KEY": "your-api-key",
"VLM_MODEL": "qwen/qwen2.5-vl-72b-instruct:free",
"XDG_RUNTIME_DIR": "/run/user/1000",
"WAYLAND_MCP_PORT": "4999",
"DISPLAY": ":0",
"WAYLAND_DISPLAY": "wayland-0",
"XDG_SESSION_TYPE": "wayland"
}
}
}
}
🛠️ Development
git clone https://github.com/someaka/wayland-mcp
cd wayland-mcp
pip install -e .
📜 License
GPL 3
Recommend MCP Servers 💡
airtop-mcp
An MCP server that provides tools for interacting with Airtop's browser automation service
@codesys/mcp-toolkit
An MCP server enabling interaction between MCP clients and CODESYS automation platform
@playwright/trace-mcp
An MCP server that provides browser automation capabilities using Playwright, with trace viewer and video recording functionality.
UnityMCPIntegration
Enable AI Agents to Control Unity through MCP integration
thinqconnect-mcp
This is the official MCP (Model Context Protocol) server for LG ThinQ devices. This server provides integrated control capabilities including status monitoring, device control, and profile information for various LG ThinQ devices, built on the LG ThinQ API and Python Open SDK.
@mobilenext/mobile-mcp
Model Context Protocol Server for Mobile Automation and Scraping (iOS, Android, Emulators, Simulators and Real Devices)