mario_andreschak_mcp_windows_desktop_automation

mario_andreschak_mcp_windows_desktop_automation

by mario-andreschak
A TypeScript-based MCP server for automating Windows desktop tasks using AutoIt.

Windows Desktop Automation MCP Server

A Model Context Protocol (MCP) server for Windows desktop automation using AutoIt.

Overview

This project provides a TypeScript MCP server that wraps the node-autoit-koffi package, allowing LLM applications to automate Windows desktop tasks through the MCP protocol.

The server exposes:
- Tools: All AutoIt functions as MCP tools
- Resources: File access and screenshot capabilities
- Prompts: Templates for common automation tasks

Features

  • Full wrapping of all AutoIt functions as MCP tools
  • Support for both stdio and WebSocket transports
  • File access resources for reading files and directories
  • Screenshot resources for capturing the screen or specific windows
  • Prompt templates for common automation tasks
  • Strict TypeScript typing throughout

Installation

# Clone the repository
git clone https://github.com/yourusername/mcp-windows-desktop-automation.git
cd mcp-windows-desktop-automation

# Install dependencies
npm install

# Build the project
npm run build

Usage

Starting the Server

# Start with stdio transport (default)
npm start

# Start with WebSocket transport
npm start -- --transport=websocket --port=3000

# Enable verbose logging
npm start -- --verbose

Command Line Options

  • --transport=stdio|websocket: Specify the transport protocol (default: stdio)
  • --port=<number>: Specify the port for WebSocket transport (default: 3000)
  • --verbose: Enable verbose logging

Tools

The server provides tools for:
- Mouse operations: Move, click, drag, etc.
- Keyboard operations: Send keystrokes, clipboard operations, etc.
- Window management: Find, activate, close, resize windows, etc.
- Control manipulation: Interact with UI controls, buttons, text fields, etc.
- Process management: Start, stop, and monitor processes
- System operations: Shutdown, sleep, etc.

Resources

The server provides resources for:
- File access: Read files and list directories
- Screenshots: Capture the screen or specific windows

Prompts

The server provides prompt templates for:
- Window interaction: Find and interact with windows
- Form filling: Automate form filling tasks
- Automation tasks: Create scripts for repetitive tasks
- Monitoring: Wait for specific conditions

Development

# Run in development mode
npm run dev

# Lint the code
npm run lint

# Run tests
npm run test

License

MIT

About

A Model Context Protocol (MCP) server for Windows desktop automation using AutoIt.

Resources

Readme

License

MIT license

Activity

Stars

45 stars

Watchers

1 watching

Forks

4 forks

Report repository

Releases


No releases published

Packages 0


No packages published

Languages

Features & Capabilities

Categories
mcp_server model_context_protocol typescript javascript windows_automation autoit file_access screenshots api_integration

Implementation Details

Stats

0 Views
45 GitHub Stars

Repository Info

mario-andreschak Organization

Similar MCP Servers

continuedev_continue by continuedev
25049
21423
9300
SunMonTueWedThuFriSat
303112345678910111213141516171819202122232425262728293012345678910
:
PM
SunMonTueWedThuFriSat
303112345678910111213141516171819202122232425262728293012345678910
:
PM