This repository contains a collection of server implementations for Pixeltable, designed to handle multimodal data indexing and querying (audio, video, images, and documents). These services are orchestrated using Docker for local development.
Located in servers/audio-index/
, this server provides:
- Audio file indexing with transcription capabilities
- Semantic search over audio content
- Multi-index support for audio collections
- Accessible at /audio
endpoint
Located in servers/video-index/
, this server provides:
- Video file indexing with frame extraction
- Content-based video search
- Accessible at /video
endpoint
Located in servers/image-index/
, this server provides:
- Image indexing with object detection
- Similarity search for images
- Accessible at /image
endpoint
Located in servers/doc-index/
, this server provides:
- Document indexing with text extraction
- Retrieval-Augmented Generation (RAG) support
- Accessible at /doc
endpoint
Located in servers/base-sdk/
, this server provides:
- Core functionality for Pixeltable integration
- Foundation for building specialized servers
pip install pixeltable
git clone https://github.com/pixeltable/mcp-server-pixeltable.git
cd mcp-server-pixeltable/servers
docker-compose up --build # Run locally with docker-compose
docker-compose down # Take down resources
This project is licensed under the Apache 2.0 License.