Skip to main content
mcpSource-backedReview first Safety Privacy

FunASR MCP Server

MCP server example from FunASR that lets Claude transcribe local audio files with local speech recognition, automatic language handling, timestamps, and speaker labels when available.

by FunASR·added 2026-06-06·
Claude CodeClaude Desktop
HarnessClaude CodeClaude Desktop
Review first review before installing

Open the source and read safety notes before installing.

Safety notes

  • The MCP server exposes a `transcribe_audio` tool that reads the local file path supplied by the agent.
  • Configure clients so Claude can only request audio files from approved directories; do not expose arbitrary private folders or shared drives.
  • First use can download FunASR model weights and dependencies from upstream model hosts; review network policy, cache location, and disk usage before use in restricted environments.
  • Long recordings and GPU transcription can consume significant CPU, GPU, memory, and disk cache resources.
  • Require confirmation before transcribing meetings, calls, interviews, voice notes, customer audio, regulated recordings, or files containing other people.

Privacy notes

  • Audio recordings can contain voices, names, accents, speaker identity, background speech, locations, health details, financial details, customer data, credentials spoken aloud, or other sensitive personal information.
  • The upstream MCP example performs local inference and does not require an API key, but MCP clients, model providers, logs, terminal output, transcripts, screenshots, and shared chats can still retain audio paths and transcription text.
  • Generated transcripts, timestamps, and speaker labels may identify individuals or reveal confidential conversations.
  • Model downloads and package installation can contact PyPI, ModelScope, Hugging Face, or other dependency hosts depending on the environment and model configuration.

Prerequisites

  • Python environment with FunASR installed from PyPI or a reviewed source checkout.
  • Local checkout or copy of `examples/mcp_server/funasr_mcp.py` from the FunASR repository.
  • Audio files in an approved location and format such as WAV, MP3, FLAC, M4A, or OGG.
  • Optional GPU, Apple silicon, or CPU device selection through `FUNASR_DEVICE`.
  • Review of model-download behavior, storage location, compute requirements, and organization policy for processing speech recordings locally.

Schema details

Install type
cli
Troubleshooting
No
Source repository stats
Scope
Source repo
Collection metadata
Estimated setup
20 minutes
Difficulty
intermediate
Tool listing metadata
Disclosure
MIT-licensed FunASR repository with an MCP server example for local speech transcription. Verify model licenses, recording consent, and data-handling requirements before using it with real meeting, call, or customer audio.
Full copyable content
{
  "mcpServers": {
    "funasr": {
      "command": "python",
      "args": ["<path-to-FunASR>/examples/mcp_server/funasr_mcp.py"],
      "env": {
        "FUNASR_DEVICE": "cpu"
      }
    }
  }
}

About this resource

Content

FunASR MCP Server is the MCP server example included with the FunASR speech recognition toolkit. It exposes a stdio MCP tool named transcribe_audio so Claude can transcribe approved local audio files through FunASR's local ASR models.

Use it when Claude needs to turn a meeting recording, interview, voice memo, podcast clip, or other approved local audio file into text without sending the audio to a hosted transcription API by default.

Source Review

These sources were reviewed on 2026-06-06. Prefer the live repository, MCP example README, PyPI package page, main README, license, setup metadata, MCP server script, model-selection guide, deployment matrix, and migration guide for current setup and model behavior.

Features

  • Expose one MCP tool, transcribe_audio, over stdio.
  • Accept a local audio_path for WAV, MP3, FLAC, M4A, OGG, and similar audio files supported by the FunASR stack.
  • Return transcription text and, when available from the model output, segment timestamps and speaker labels.
  • Run local inference with no service API key required by the MCP example.
  • Select CPU, CUDA, or Apple mps execution through FUNASR_DEVICE.
  • Use FunASR's speech recognition stack for multilingual ASR, VAD, punctuation, and diarization workflows.

Installation

Install FunASR in a Python environment:

pip install funasr

Then configure your MCP client to launch the example server script from a reviewed FunASR checkout:

{
  "mcpServers": {
    "funasr": {
      "command": "python",
      "args": ["<path-to-FunASR>/examples/mcp_server/funasr_mcp.py"],
      "env": {
        "FUNASR_DEVICE": "cpu"
      }
    }
  }
}

Use cuda or mps only on machines where those accelerators are approved and available.

Use Cases

  • Transcribe a meeting recording saved in an approved local folder.
  • Convert a voice memo into text before summarizing it.
  • Extract timestamped segments from an interview or podcast clip.
  • Compare local ASR output against a hosted transcription result.
  • Draft meeting notes while keeping the raw audio on the local machine.
  • Prototype speech-to-text workflows before deploying a dedicated FunASR API server.

Safety and Privacy

FunASR MCP Server reads local audio paths supplied through the MCP client. Keep the configured script and working directory scoped to approved recordings, and require explicit approval before transcribing files from Downloads, shared drives, customer folders, or private meeting archives.

Treat audio files, file paths, transcripts, timestamps, speaker labels, terminal logs, and MCP conversation history as sensitive. Local inference avoids a hosted transcription API by default, but package installation, model downloads, logs, and connected AI clients can still expose metadata or transcript content.

Duplicate Check

No modelscope/FunASR, FunASR MCP, FunASR MCP Server, funasr_mcp.py, or matching source URL entry was found in content/mcp or README.md. Existing audio, media conversion, and local AI entries do not cover FunASR's MCP transcription example.

#speech-to-text#transcription#audio#local-ai#python

Source citations

Signals

Loading live community signals…

More like this, weekly

A short, calm digest of reviewed Claude resources. Unsubscribe any time.