Semble MCP Server
Local code-search MCP server for agents, using CPU-only hybrid retrieval to index repositories and return focused snippets for natural-language queries.
Open the source and read safety notes before installing.
Safety notes
- Semble indexes local paths or remote git repositories and returns code chunks through MCP tools.
- The interactive installer can modify agent configuration files, instruction files, and sub-agent files for supported clients.
- In MCP mode, local paths can be watched for file changes and re-indexed automatically.
- Remote repository search accepts git URLs and clones or indexes source on demand.
- Cached indexes and usage statistics can persist after the MCP session ends.
Privacy notes
- Source code, filenames, directory structure, snippets, documentation, configuration files, search queries, related-code requests, cache keys, and tool results may be visible to the MCP client and model provider.
- Using `--content docs`, `--content config`, or `--content all` can include README files, deployment notes, YAML/TOML files, credentials accidentally present in config, and internal runbooks.
- Persistent cache data may reveal repository names, paths, symbols, comments, and source-derived chunks.
- Review `.gitignore` and `.sembleignore` coverage before indexing workspaces that contain generated files, vendored code, secrets, customer data, or unrelated private repositories.
Prerequisites
- Python 3.10 or newer and uv available to the MCP client runtime.
- A local repository or remote git repository you are authorized to index.
- Approval before the interactive installer modifies MCP client configs, AGENTS.md, CLAUDE.md, or sub-agent files.
- Cache location and retention reviewed before indexing proprietary repositories.
- Optional `--content docs`, `--content config`, or `--content all` scope chosen deliberately for sensitive projects.
Schema details
- Install type
- cli
- Troubleshooting
- No
- Scope
- Source repo
- Estimated setup
- 10 minutes
- Difficulty
- intermediate
Full copyable content
{
"mcpServers": {
"semble": {
"command": "uvx",
"args": ["--from", "semble[mcp]", "semble"]
}
}
}About this resource
Content
Semble is a local-first code-search MCP server for coding agents. It indexes local paths or remote git repositories and exposes focused search results through MCP instead of making Claude grep broadly and read entire files.
The MCP server provides search for natural-language or code queries and
find_related for finding chunks similar to a known file and line. The project
uses CPU-only hybrid retrieval with static code embeddings, BM25, tree-sitter
chunking, and reranking. Indexes are cached and local repositories can be
watched for automatic re-indexing during an MCP session.
Source Review
- https://github.com/MinishLab/semble
- https://github.com/MinishLab/semble/blob/main/README.md
- https://github.com/MinishLab/semble/blob/main/docs/installation.md
- https://github.com/MinishLab/semble/blob/main/pyproject.toml
- https://github.com/MinishLab/semble/blob/main/src/semble/mcp.py
- https://github.com/MinishLab/semble/blob/main/src/semble/cache.py
- https://github.com/MinishLab/semble/blob/main/benchmarks/README.md
- https://github.com/MinishLab/semble/blob/main/LICENSE
- https://pypi.org/pypi/semble/json
These sources were reviewed on 2026-06-05. Prefer the live repository, README, installation guide, Python project metadata, MCP server implementation, cache implementation, benchmark notes, license, and PyPI metadata for current commands, package versions, supported clients, tool behavior, cache behavior, and benchmark claims.
Features
- Search local repositories by natural-language or code query.
- Search remote git repositories by passing an explicit repository URL.
- Find code chunks related to a file path and line number from a prior result.
- Cache indexes on disk for faster repeat queries.
- Watch local paths in MCP mode and rebuild indexes after file changes.
- Index code by default, with optional documentation, configuration, or all-file scopes.
- Use
.gitignoreand.sembleignorerules to control indexed files. - Install MCP configs, usage instructions, or a dedicated search sub-agent for supported clients.
- Run on CPU without requiring API keys, GPUs, or external vector services.
Installation
The recommended installer uses uv:
uv tool install semble
semble install
For manual MCP setup, configure the server to launch through uvx:
{
"mcpServers": {
"semble": {
"command": "uvx",
"args": ["--from", "semble[mcp]", "semble"]
}
}
}
To include documentation, configuration, or every supported file type, append
--content docs, --content config, or --content all to the server command.
Use Cases
- Ask Claude where a feature is implemented without loading every matched file.
- Search unfamiliar repositories by behavior, API name, or architecture concept.
- Find related implementations after inspecting one relevant search result.
- Let a coding agent query documentation and config only when that scope is approved.
- Add AGENTS.md or CLAUDE.md guidance so agents prefer Semble before broad grep/read.
- Install a dedicated
semble-searchsub-agent in harnesses that support sub-agents.
Safety and Privacy
Semble is intentionally close to source code. It can index local repositories, clone or index remote git repositories, persist cache data, and return source chunks to the MCP client. Scope each search to repositories you are authorized to expose to an AI assistant.
Use .gitignore and .sembleignore to exclude generated files, vendored code,
secrets, private fixtures, and unrelated projects. Treat caches and usage
statistics as source-derived artifacts, especially when using documentation,
configuration, or all-file indexing modes.
Duplicate Check
No MinishLab/semble entry, semble PyPI package entry, or matching source URL
was found in content/mcp.
Source citations
Signals
Loading live community signals…
A short, calm digest of reviewed Claude resources. Unsubscribe any time.