Edge, on-prem, or cloud. Any hardware your agents can reach. See every CPU and GPU each turn ran on, what it cost, and which sub-agent caused the spend.
The three pillars OpenInfer ships against. ClawMetry adds the fourth so you can prove it.
"We make it possible for the 90% of agentic workloads that are routine and always on, to run on leaner and often under-utilized compute topologies."Behnam Bastani · CEO, OpenInfer
An inference OS that treats compute as a scheduling problem. Latency-critical turns run on GPU. The other 90 percent run on CPU and Graviton. Sessions migrate across processors without re-paying the prefill cost.
Learn more about OpenInferReal benchmarks on commodity hardware. +50% capacity on a single AWS g6e.16xlarge by recruiting otherwise-idle CPUs into the inference fabric.
Learn moreOpenInfer makes the substrate schedule smartly. ClawMetry shows you what actually ran. Every turn, every chip, every dollar, every sub-agent. Local-first and end-to-end encrypted, so the substrate stops being a black box.
Read the joint launch postpip install clawmetry.Two ten-minute integrations. No code changes. No new dashboard to learn.
"OpenInfer turned the inference substrate from a fixed cost into a scheduling problem. ClawMetry's job is to make sure that, when the substrate gets smart, the operator does not get blind."Vivek Chand · founder, ClawMetry
Both halves are free to try. Neither requires changes to your agent code.
macOS, Linux, Windows. The installer figures it out.
Contact: hello@clawmetry.com · contact@openinfer.io