w_jeon_mcp_framework

by w-jeon

A powerful framework for building custom tools that interact with large language models, extending Cursor IDE capabilities.

Model Context Protocol Development Framework

A powerful development framework for creating custom tools that interact with large language models using the Model Context Protocol (MCP). This framework provides a comprehensive toolset to extend the functionality of the Cursor IDE, enabling advanced features like web content retrieval and file processing (PDF, Word, Excel).

Overview

The Model Context Protocol Development Framework is designed to simplify the creation of tools that interact with large language models. It offers a modular and extensible architecture, making it easy to add new functionalities and integrate with existing systems.

Core Features

1. Comprehensive File Processing

The file tool automatically identifies file types and selects the appropriate processing method. Supported formats include PDF, Word, and Excel.

Usage: file /path/to/document
Supported Formats:
PDF files (.pdf)
Word documents (.doc, .docx)
Excel files (.xls, .xlsx, .xlsm)
Parameters: file_path - The local path to the file
Returns: Processed content based on the file type

2. PDF Document Processing

The pdf tool processes PDF documents with two modes:

Usage: pdf /path/to/document.pdf [mode]
Parameters:
file_path - The local path to the PDF file
mode - Processing mode (optional):
- quick - Quick preview mode, extracts text only
- full - Full parsing mode, extracts text and images (default)
Returns:
Quick preview mode: Text content
Full parsing mode: Text and images
Features:
High-quality text extraction and image processing using PyMuPDF
Automatic handling of large files
Image extraction and saving

3. Word Document Parsing

The word tool parses Word documents to extract text, tables, and images.

Usage: word /path/to/document.docx
Features: Extracts text, tables, and images from Word documents
Parameters: file_path - The local path to the Word document
Returns: Text content, tables, and images
Features: High-quality text and table extraction using python-docx

4. Excel File Processing

The excel tool parses Excel files to provide complete table data and structure information.

Usage: excel /path/to/spreadsheet.xlsx
Features: Parses all sheets in an Excel file
Parameters: file_path - The local path to the Excel file
Returns:
File metadata (filename, number of sheets)
Detailed information for each sheet:
- Number of rows and columns
- List of column names
- Complete table data
Features:
High-quality table data processing using pandas and openpyxl
Support for multiple sheets
Automatic data type conversion

5. Web Content Retrieval

The url tool retrieves content from any web page.

Usage: url https://example.com
Parameters: url - The URL of the website to retrieve content from
Returns: Text content of the web page
Features:
Comprehensive HTTP error handling
Timeout management
Automatic encoding handling

Technical Highlights

Smart File Type Recognition:
Automatically selects the appropriate processing tool based on file extension
Provides a unified file processing interface
Efficient Document Processing:
PDF processing: Supports quick preview and full parsing modes
Word processing: Accurately extracts text, tables, and images
Excel processing: Efficiently handles large table data
Memory Optimization:
Uses temporary files to manage large files
Automatically cleans up temporary resources
Processes large documents in chunks
Error Handling:
Comprehensive exception capture and handling
Detailed error feedback
Graceful failure handling mechanisms

Project Structure

The framework is modular and easy to maintain:

mcp_tool/
├── tools/
│   ├── __init__.py        # Defines tool base classes and registries
│   ├── loader.py          # Tool loader, automatically loads all tools
│   ├── file_tool.py       # Comprehensive file processing tool
│   ├── pdf_tool.py        # PDF parsing tool
│   ├── word_tool.py       # Word document parsing tool
│   ├── excel_tool.py      # Excel file processing tool
│   └── url_tool.py        # URL tool implementation
├── __init__.py
├── __main__.py
└── server.py              # MCP server implementation

Development Guide

How to Develop New Tools

Create a new Python file in the tools directory, e.g., your_tool.py
Import necessary dependencies and base classes
Create a tool class that inherits from BaseTool
Register the tool using the @ToolRegistry.register decorator
Implement the execute method for the tool

Tool Template Example

import mcp.types as types
from . import BaseTool, ToolRegistry

@ToolRegistry.register
class YourTool(BaseTool):
    """Your tool description"""
    name = "your_tool_name"  # Unique identifier for the tool
    description = "Your tool description"  # Description displayed to users
    input_schema = {
        "type": "object",
        "required": ["param1"],  # Required parameters
        "properties": {
            "param1": {
                "type": "string",
                "description": "Description of parameter 1",
            },
            "param2": {
                "type": "integer",
                "description": "Description of parameter 2 (optional)",
            }
        },
    }

    async def execute(self, arguments: dict) -> list[types.TextContent | types.ImageContent | types.EmbeddedResource]:
        """Execute tool logic"""
        # Parameter validation
        if "param1" not in arguments:
            return [types.TextContent(
                type="text",
                text="Error: Missing required argument 'param1'"
            )]

        # Get parameters
        param1 = arguments["param1"]
        param2 = arguments.get("param2", 0)  # Get optional parameter with default value

        # Execute tool logic
        result = f"Processed parameters: {param1}, {param2}"

        # Return result
        return [types.TextContent(
            type="text",
            text=result
        )]

Deployment Guide

Docker Deployment (Recommended)

Initial Setup:

# Clone the repository
git clone https://github.com/your-username/mcp-framework.git
cd mcp-framework

# Create environment file
cp .env.example .env

Using Docker Compose:

# Build and start
docker compose up --build -d

# View logs
docker compose logs -f

# Manage containers
docker compose ps
docker compose pause
docker compose unpause
docker compose down

Access the Service:
SSE endpoint: http://localhost:8000/sse
Cursor IDE Configuration:
Settings → Features → Add MCP Server
Type: "sse"
URL: http://localhost:8000/sse

Traditional Python Deployment

Install System Dependencies:

# Ubuntu/Debian
sudo apt-get update
sudo apt-get install -y poppler-utils tesseract-ocr tesseract-ocr-chi-sim

# macOS
brew install poppler tesseract tesseract-lang

# Windows
# 1. Download and install Tesseract: https://github.com/UB-Mannheim/tesseract/wiki
# 2. Add Tesseract to the system PATH

Install Python Dependencies:

# Create a virtual environment
python -m venv venv
source venv/bin/activate  # Linux/Mac
# or
.\venv\Scripts\activate  # Windows

# Install dependencies
pip install -r requirements.txt

Start the Service:

python -m mcp_tool

Configuration

Environment Variables

Configure in the .env file:

MCP_SERVER_PORT: Server port (default: 8000)
MCP_SERVER_HOST: Binding address (default: 0.0.0.0)
DEBUG: Debug mode (default: false)
MCP_USER_AGENT: Custom User-Agent

Dependencies

Main dependencies:

mcp: Model Context Protocol implementation
PyMuPDF: PDF document processing
python-docx: Word document processing
pandas and openpyxl: Excel file processing
httpx: Asynchronous HTTP client
anyio: Asynchronous I/O support
click: Command-line interface

Contribution Guidelines

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Features & Capabilities

Implementation Details

View on GitHub

Stats

0 Views

0 Favorites

1 GitHub Stars

Repository Info

w-jeon Organization

Similar Servers

continuedev_continue by continuedev

cherryhq_cherry_studio by CherryHQ

bytedance_ui_tars_desktop by bytedance

No result found

Advanced Search

Search Preferences