

Rifft provides developers with tools to monitor and diagnose issues in AI agent systems built on frameworks such as CrewAI and LangGraph. By automatically capturing spans for tool calls and state mutations, the platform reconstructs the sequence of events leading to breakdowns and highlights the initial point where incorrect data was introduced. The system organizes detected problems into a taxonomy of failure classes including unverified outputs, repeated tool invocations, schema mismatches during agent transitions, and excessive context accumulation. Users can then fork a run at any identified span, apply adjustments, and compare results side by side to validate improvements before deployment. Integration requires minimal code changes through a simple wrapper that works across supported stacks and open telemetry setups. This approach reduces the time spent reviewing extensive logs by focusing attention on causal chains rather than surface-level error messages.
Trace bad state back to its origin handoff, classify the failure mode with MAST, and view causal attribution instead of symptoms.
Fork any trace at a chosen span, apply patches or new agents, then replay end-to-end to validate fixes before shipping.
Receive NL explanations and alerts for classified failures while automatically instrumenting handoffs and tool calls across runs.
Pricing model: Freemium. Plan details are indicative — check the site for current prices.
Our take: Rifft is a solid coding & dev choice. It's valued for minimal code changes (one import, ~5 lines) and fast root-cause identification vs raw logs. The main trade-off is free tier limited to 50k spans/month. A good pick if you want capable AI without a high upfront cost.
MAST taxonomy detects four classes of failures including unverified output, tool loops, handoff mismatches, and context overflow.
Rifft is a solid coding & dev choice. It's valued for minimal code changes (one import, ~5 lines) and fast root-cause identification vs raw logs. The main trade-off is free tier limited to 50k spans/month. A good pick if you want capable AI without a high upfront cost.
Verified reviews from the community shape this tool's rating.
Loading reviews…
Similar coding & dev tools worth comparing.