pip install mcp-risk-linter
Risk lint for MCP manifests.
Risk taxonomy, CLI, and GitHub Action wrapper for MCP manifests, permissions, claims, and unsafe tool surfaces.
Twelve installable packages for MCP and A2A review, trace-to-regression workflows, LeRobot dataset gates, VLA robustness diagnostics, recovery metadata, and lab review packets.
Risk linting, contract tests, trace replay, OTEL bridges, trace cards.
Quality gates, recovery metrics, VLA robustness probes, embodiment cards.
Failure cases with reproducible commands. Outreach packets without endorsement.
Each scoped to a command-line diagnostic, portable artifact, GitHub workflow, or review packet a lab engineer can inspect without a sales call.
MCP risk linting, A2A contract tests, deterministic replay, OTEL bridges, trace cards, and prompt-rubric drift review.
pip install mcp-risk-linter
Risk taxonomy, CLI, and GitHub Action wrapper for MCP manifests, permissions, claims, and unsafe tool surfaces.
pip install a2a-contract-test
Offline contract tests for A2A agent cards, task lifecycle states, structured payloads, and errors.
pip install tool-call-replay
Deterministic replay harness that turns failed agent tool-call traces into local regression tests.
pip install agent-trace-card
Portable Markdown and JSON cards for one agent run: tools, retries, data touched, outcome, failure mode.
pip install otel-eval-bridge
Bridge OpenTelemetry and Phoenix GenAI spans into redacted eval regression cases.
pip install prompt-rubric-drift
No-model PR review notes for prompt and rubric changes: weights, criteria, boundaries, injection exposure.
LeRobot quality gates, recovery and intervention metrics, VLA robustness probes, and embodiment release cards.
pip install lerobot-quality-gates
Local quality gates for LeRobot-style datasets: metadata, episodes, sensors, action and state fields.
pip install robot-recovery-bench
Schema and metrics for human intervention and recovery segments, including repeated-failure clusters.
pip install vla-robustness-kit
Simulator-light VLA diagnostics for language, vision, metadata, task-phase, and embodiment perturbations.
pip install embodiment-card
Structured robot dataset and VLA release cards for sensors, action spaces, frames, control rate, limits.
Synthetic failure cases with reproducible commands. Outreach packets that carry no endorsement claim.
pip install failure-gallery
Synthetic agent and robotics failure cases with reproducible commands and expected review labels.
Repo only · no PyPI rollout yet
Technical review packets, no-endorsement language, and a feedback-log schema for lab review asks.
Practical trust infrastructure. Not certification. Not endorsement. Read the source. Run it locally.