
RunAnywhere
ActiveThe default way of running on-device AI at Scale
About
Edge AI is inevitable, but shipping it is painful: every device class behaves differently, runtimes vary, models are huge, and performance collapses under memory/power constraints. RunAnywhere turns that into an enterprise-ready workflow: one SDK to run models on-device, plus a control plane to manage models, enforce policies, and measure outcomes across thousands of devices.
Founders ยท 2
Former Intuit engineer building RunAnywhere, the infrastructure layer for deploying fast, private, multimodal AI on-device at scale. Deep background in mobile SDKs, platform tooling, and developer products, including systems used by 50M+ active users. Previously founded products across consumer discovery, context management, agentic documentation, and mobile testing, and now focused on making on-device AI production-ready across mobile, edge, and embedded devices.
Co-founder & CTO of RunAnywhere (W26). Built MetalRT: the first complete multi-modal inference engine for Apple Silicon. Custom Metal GPU kernels that pushed on-device voice AI from 900ms to ~110ms. Ex-Amazon EC2 Spot ($100M+ ARR), Ex-Microsoft Azure. Peer-reviewed researcher.
Related startups

Enabling Model-Hardware Co-Design at Scale

The first AI agent for optimizing ML model inference on edge hardware



