The Puppeteer Browser Automation MCP Server is a powerful tool that enables browser automation capabilities using Puppeteer. It allows Large Language Models (LLMs) to interact with web pages, take screenshots, and execute JavaScript in a real browser environment. This server is part of the Model Context Protocol (MCP) and provides a seamless way to automate web interactions.
url
(string)name
(string, required): Name for the screenshot.selector
(string, optional): CSS selector for element to screenshot.width
(number, optional, default: 800): Screenshot width.height
(number, optional, default: 600): Screenshot height.selector
(string): CSS selector for element to click.selector
(string): CSS selector for element to hover.selector
(string): CSS selector for input field.value
(string): Value to fill.selector
(string): CSS selector for element to select.value
(string): Value to select.script
(string): JavaScript code to execute.console://logs
): Browser console output in text format.screenshot://<name>
): PNG images of captured screenshots.NOTE: The Docker implementation will use headless Chromium, whereas the NPX version will open a browser window.
{
"mcpServers": {
"puppeteer": {
"command": "docker",
"args": ["run", "-i", "--rm", "--init", "-e", "DOCKER_CONTAINER=true", "mcp/puppeteer"]
}
}
}
{
"mcpServers": {
"puppeteer": {
"command": "npx",
"args": ["-y", "@modelcontextprotocol/server-puppeteer"]
}
}
}
To build the Docker image:
docker build -t mcp/puppeteer -f src/puppeteer/Dockerfile .
This MCP server is licensed under the MIT License. You are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.