Tools that pair well with Claude
Editorial picks and disclosed partners. Free, open-source community resources live in the directory.
Self-hostable workflow automation with AI pieces and MCP access.
Open-source prompt management, evaluation, and observability for LLM apps.
Open-source observability for AI agent traces, replays, and costs.
Build and run agent platforms with agents, teams, workflows, memory, MCP, and AgentOS.
Terminal AI coding assistant for Git repositories.
Author, schedule, monitor, and operate workflow DAGs as code.
Web automation and scraping platform.
Authenticated tool-calling platform for agents.
YAML workflow engine for deterministic, repeatable AI coding agent development.
Open-source LLM observability and evaluation tooling.
Record, replay, stream, and embed terminal sessions as text-based casts.
Fast AST-based structural search, linting, and rewriting for codebases.
Local credential broker for AI agents.
Build, package, serve, containerize, and deploy AI model inference APIs.
Browser AI app builder for web projects.
Evaluation and logging platform for AI applications.
Open-source browser automation for AI agents.
Cloud browser infrastructure for AI agents.
Headless browser infrastructure for Puppeteer, Playwright, and AI agents.
Local CLI for coding-agent token usage and cost reports.
Build Python conversational AI apps with chat UI, steps, auth, and integrations.
Store embeddings and metadata for AI retrieval with local, self-hosted, or cloud Chroma.
Terminal coding agent for Claude-powered development.
Open-source autonomous coding agent for VS Code.
Framework for building AI agents on Cloudflare Workers.
AI code review platform for pull requests.
Tool integration platform for AI agents.
Open-source AI coding assistant with model control.
Sign, verify, and attest containers, binaries, SBOMs, and OCI artifacts.
Multi-agent workflow framework and platform.
Terminal-based AI coding agent from Charm with multi-model support, LSP and MCP context, and permission prompts before running tools.
Codebase-aware AI editor for agent-assisted development.
Orchestrate data assets, pipelines, jobs, schedules, sensors, and observability.
Open-source sandbox infrastructure for executing AI-generated code in isolated environments, with SDKs, snapshots, and an optional managed cloud.
Transform warehouse data with SQL models, tests, docs, lineage, and artifacts.
Python LLM evaluation tests, metrics, regression checks, and tracing.
AI software engineering agent for development tasks.
Program and optimize LM systems with signatures, modules, tools, and metrics.
Run embedded analytical SQL over local files, data frames, and DuckDB databases.
Version datasets, models, ML pipelines, experiments, metrics, and remotes with Git.
Open-source cloud sandboxes for executing AI-generated code, with Python and JavaScript SDKs and a Code Interpreter package.
Evaluate, test, and monitor ML models, LLM apps, data quality, and drift.
Search and retrieval API for AI agents.
Web crawling API for LLM-ready content.
Open-source LLM vulnerability scanner.
AI testing platform for LLM and ML quality.
AI developer assistant across GitHub and editors.
Open-source secret scanner for repositories and files.
Build, run, evaluate, and deploy code-first AI agents and workflows.
Open-source, extensible AI agent that installs, executes, edits, and tests with any LLM, as a desktop app, CLI, and API.
Build and share Python machine-learning demos, AI apps, and chatbot interfaces.
AI code review assistant inside Graphite workflows.
Define, run, document, and automate data quality validations with GX Core.
Scan images, directories, SBOMs, PURLs, and CPEs for known vulnerabilities.
Open-source input/output guardrails and validators for LLM apps.
Build agents, RAG pipelines, search, retrieval, and tool-using LLM apps.
LLM observability and cost tracking platform.
Run raw PyTorch training and inference across distributed and mixed-precision setups.
Load, stream, inspect, and preprocess AI datasets for training and evaluation.
Run and train diffusion pipelines for image, video, and audio generation.
Load metrics, comparisons, and measurements for reproducible model and dataset evaluation.
Fine-tune large models with lightweight adapters, LoRA, and PEFT methods.
Use pretrained models for inference, generation, multimodal tasks, and training.
Open-source IDE for orchestrating AI coding agents, built on Claude Code, with parallel sessions, worktrees, and team context engineering.
Cloud-hosted Chrome sessions with scraping APIs and agent integrations for Claude, OpenAI, and BrowserUse.
Structured LLM outputs with Pydantic models, validation, and retries.
Scan Kubernetes clusters, manifests, charts, repositories, and images for security risk.
Open-source data labeling and human-in-the-loop AI evaluation.
AI security guardrails for LLM applications.
Store multimodal data and run vector, full-text, and SQL retrieval for AI applications.
Visual builder for agents, workflows, RAG apps, and MCP-enabled LLM tools.
Open-source LLM tracing and evaluation platform.
Stateful orchestration framework for LLM agents.
Observability and evaluation platform for LLM apps.
Open-source AI gateway for routing LLM calls across providers.
Run GGUF models locally with C/C++ inference and an OpenAI-compatible server.
Build agents, RAG pipelines, retrieval, indexing, and data-aware LLM apps.
AI app builder for prompt-driven web products.
Visual automation platform for integrations and AI workflows.
Build reactive Python notebooks that run as scripts, apps, and git-friendly data workflows.
TypeScript framework for agents and AI workflows.
Visual MCP server testing and debugging.
Self-host many MCP servers as HTTP endpoints via supergateway.
Open-source framework for multi-agent AI applications.
Microsoft framework for generative AI risk identification and red-team assessment.
Run scalable vector, sparse, and hybrid search for RAG and AI retrieval systems.
Minimal CLI coding agent with confirm, human, and automatic execution modes.
Trace, evaluate, monitor, and manage agents, LLM apps, prompts, and models.
Self-hostable workflow automation for AI operations.
Open-source platform that runs Claude agents in isolated containers across messaging channels, with per-agent memory and vaulted credentials.
Local model runner for open models and developer workflows.
Self-hosted coding agent framework built on LangGraph — trigger via Slack, Linear, or GitHub and get automatic PRs from isolated sandboxes.
OpenAI's open-source terminal coding agent that edits and runs code locally with sandbox and approval modes.
Build repeatable LLM and agent evals with OpenAI's open-source framework.
Generate clients, servers, docs, schemas, and config from OpenAPI specs.
Terminal-first AI coding agent for local development.
Run coding agents with a local GUI, CLI, SDK, sandboxes, and hosted options.
Developer workflow automation and API integration platform.
Use a fast Rust DataFrame query engine for local, lazy, and cloud-backed analytics.
Orchestrate resilient Python data pipelines with flows, tasks, schedules, and workers.
Open-source prompt testing and red-teaming framework.
AI security platform for models and applications.
Type-safe Python agent framework with tools, outputs, MCP, and evals.
Open-source evaluation framework for RAG and LLM application testing.
Scale Python and AI workloads with Ray Core and Ray AI libraries.
macOS launcher and extension platform with AI workflows.
Browser-based AI app builder inside Replit.
VS Code coding agent with planning and workflow modes.
Static analysis, SAST, secrets, dependency checks, and custom code rules.
Build embeddings, sparse encoders, rerankers, and semantic search pipelines.
MCP server discovery and deployment platform.
AI coding assistant for large codebase context.
Generate SDKs, CLIs, Terraform providers, and MCP servers from OpenAPI.
AI-assisted browser automation framework.
Turn Python scripts into interactive data apps, dashboards, and chat interfaces.
Open-source framework that extends Claude Code with slash commands, specialized agents, behavioral modes, and MCP integrations.
Generate SBOMs from images, directories, files, archives, and OCI layouts.
Build resilient long-running workflows with durable execution and worker-backed activities.
Background jobs and workflows for TypeScript apps.
Evaluate and trace agents, RAG systems, LLM apps, metrics, and regressions.
AI interface builder for React and web UI.
TypeScript toolkit for AI apps and streaming UI.
Serve open models with high-throughput inference and OpenAI-compatible APIs.
Tracking and evaluation toolkit for LLM applications.
Build AI retrieval systems with semantic search, hybrid search, RAG, and cloud-native deployment.
Agentic coding environment for multi-file AI development.
Enterprise automation platform for business workflows.
AI-assisted automation across business apps.
Fast collaborative code editor with AI assistance.