zahidgalea_windtools_mcp

zahidgalea_windtools_mcp

by ZahidGalea
An MCP server for semantic code search and document embedding using ChromaDB and sentence transformers.

WindTools MCP Server for Codebase Semantic Search

Overview

The WindTools MCP Server is designed to provide document embedding and retrieval capabilities for codebases using ChromaDB and sentence transformers. It enables semantic code search, code repository indexing, and persistent storage of code embeddings.

Features

  • Semantic Code Search: Utilizes sentence transformers for embedding code snippets and retrieval.
  • Code Repository Indexing: Automatically indexes code files from specified directories.
  • Persistent Storage: Saves code embeddings in ChromaDB for persistent retrieval.
  • Directory Exploration: Built-in tools for navigating and exploring codebases.
  • Background Initialization: Loads resources asynchronously to minimize startup time.
  • Environment Configuration: Configurable through environment variables.

Tools

list_dir

  • Purpose: List the contents of a directory.
  • Inputs:
  • directory_path (string): Path to list contents of, should be absolute path to a directory.
  • Returns: JSON string containing directory information including file types and sizes.

get_initialization_status

  • Purpose: Check the status of the background initialization process.
  • Returns: JSON string with initialization status of ChromaDB and embedding model.

index_repository

  • Purpose: Index code files from specified directories into ChromaDB.
  • Inputs:
  • target_directories (array of strings): List of absolute paths to directories to index.
  • force_reindex (boolean, optional): If true, reindex all files even if they already exist in the index.
  • Returns: JSON string containing indexing statistics and results.

codebase_search

  • Purpose: Find code snippets relevant to a search query.
  • Inputs:
  • query (string): Search query describing what you're looking for.
  • limit (integer, optional): Maximum number of results to return (default: 10).
  • min_relevance (float, optional): Minimum relevance score threshold (0.0 to 1.0).
  • Returns: JSON string containing search results with relevant code snippets.

Technical Architecture

The WindTools MCP Server is built on these key components:
- ChromaDB: Vector database for storing and retrieving code embeddings.
- Sentence Transformers: Deep learning models for creating embeddings from code.
- FastMCP: Framework for building MCP-compliant servers.
- Async Lifespan Management: Efficient resource initialization and cleanup.

Initialization Process

The server initializes ChromaDB and the embedding model in the background, allowing it to start accepting requests immediately while resource loading continues in the background. The get_initialization_status tool can be used to check if the initialization is complete.

Setup

Environment Variables

The server can be configured with the following environment variables:
- DATA_ROOT: Absolute directory where ChromaDB database and model cache will be stored (default: a 'data' directory inside the package).
- CHROMA_DB_FOLDER_NAME: Name of the folder where ChromaDB stores data (default: "default").
- SENTENCE_TRANSFORMER_PATH: Path to the sentence transformer model (default: "jinaai/jina-embeddings-v2-base-code").

Installation

Using pip

pip install windtools-mcp

From source

git clone https://github.com/ZahidGalea/windtools-mcp
cd windtools-mcp
pip install -e .

Usage with Claude Desktop

Add the following to your claude_desktop_config.json:

Direct Execution

Using Python 3.11 as ChromaDB has issues with newer Python versions.

{
  "mcpServers": {
    "windtools": {
      "command": "uvx",
      "args": [\
        "-p",\
        "3.11",\
        "-U",\
        "windtools-mcp"\
      ],
      "env": {
        "DATA_ROOT": "/Users/<user>/windtools_data",
        "CHROMA_DB_FOLDER_NAME": "chromadb",
        "SENTENCE_TRANSFORMER_PATH": "jinaai/jina-embeddings-v2-base-code"
      }
    }
  }
}

Development

Requirements

  • Python 3.11
  • Dependencies listed in pyproject.toml

Development Setup

For developing:

# Install development dependencies
uv sync --dev

If you want to use locally:

pip install -e .

Configuration for local development:

{
  "mcpServers": {
    "windtools": {
      "command": "uv",
      "args": [\
        "run",\
        "windtools-mcp"\
      ],
      "env": {
        "DATA_ROOT": "/Users/<user>/windtools_data",
        "CHROMA_DB_FOLDER_NAME": "chromadb",
        "SENTENCE_TRANSFORMER_PATH": "jinaai/jina-embeddings-v2-base-code"
      }
    }
  }
}

Inspector

npx @modelcontextprotocol/inspector uvx -p 3.11 windtools-mcp
npx @modelcontextprotocol/inspector uv run windtools-mcp

Running Tests

pytest tests/

Project Structure

src/
  windtools_mcp/
    __init__.py
    __main__.py
    server.py
tests/
  test_client.py
  test_unit.py
.github/
  workflows/
    publish.yml
    test.yml
.gitignore
.python-version
pyproject.toml
README.md
VERSION

Release Process

The project version is managed centrally in the VERSION file. The release process is automatic:
1. Update the version number in the VERSION file.
2. Commit and push to the main branch.
3. The GitHub Actions workflow will automatically:
- Detect the change in the VERSION file.
- Create a git tag with the format v{VERSION}.
- Generate a release on GitHub.
- Publish the package to PyPI.

License

This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License.

About

Your own codebase tools like code semantic search.

Topics

Resources

Stars

2 stars

Watchers

1 watching

Forks

0 forks

Report repository

Releases 14


Release v1.2.1 Latest\
\
Mar 17, 2025

+ 13 releases

Packages 0


No packages published

Languages

Features & Capabilities

Categories
mcp_server model_context_protocol python chromadb sentence_transformers code_search api_integration developer_tools

Implementation Details

Stats

0 Views
2 GitHub Stars

Repository Info

ZahidGalea Organization

Similar MCP Servers

continuedev_continue by continuedev
25049
21423
9300