The OpenAI Speech-to-Text Transcriber MCP Server is a powerful tool designed to provide audio transcription capabilities using OpenAI's Whisper API. This server is ideal for developers looking to integrate speech-to-text functionality into their applications.
shell
git clone https://github.com/Ichigo3766/audio-transcriber-mcp.git
cd audio-transcriber-mcp
shell
npm install
shell
npm run build
Set up your OpenAI API key in your environment variables.
Add the server configuration to your environment:
json
{
"mcpServers": {
"audio-transcriber": {
"command": "node",
"args": [
"/path/to/audio-transcriber-mcp/build/index.js"
],
"env": {
"OPENAI_API_KEY": "",
"OPENAI_BASE_URL": "", // Optional
"OPENAI_MODEL": "" // Optional
}
}
}
}
Replace /path/to/audio-transcriber-mcp
with the actual path where you cloned the repository.
transcribe_audio
- Transcribe audio files using OpenAI's APIfilepath
(required): Path to the audio file.save_to_file
(optional): Boolean to save transcription to a file.language
(optional): ISO-639-1 language code (e.g., "en", "es").This MCP server is licensed under the MIT License. You are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.
A MCP server that provides audio transcription capabilities using OpenAI's Whisper API.