platoseed
โ† All companies
Trainy logo

Trainy

Active

Infrastructure for managing GPU clusters for training/serving.

Summer 2023Founded 20233 peopleSan Francisco, CA, USA; Remote
โ†ป Pivot / rename signal

This company has previously operated under โ€œAB Labsโ€. A rename frequently marks a pivot in positioning or product โ€” useful raw material for variant ideas.

About

Goodbye Slurm, Hello Konduktor. Trainy Konduktor is a software platform for AI teams to schedule workloads with priority, control resource allocation, and improve GPU reliability. With Konduktor, teams submit jobs to a healthy pool of GPUs, assign job priority with a simple user interface, and never worry about hardware faults again.

Founders ยท 2

Roanak Baviskar
Roanak BaviskarFounder

Studied CS & Mathematics at UC Santa Cruz. Led Audio team at Hive AI, where we trained and deployed 500M parameter-scale models to production.

Andrew Aikawa
Andrew AikawaFounder
Berkeley

Co-founder and CTO at Trainy building a training platform to make deep learning go faster. Previously a lead Machine Learning Engineer for Hive AI's object detection products. I completed my Physics Ph.D. UC Berkeley '22 where my thesis focused on applying computer vision and deep learning on nanoscience. Physics & Computer Science B.A. UC Berkeley '17.

Launch

Launched on Y Combinator ยท Jun 2023
View launch post โ†—

Dashboards to help ML engineers training large models isolate performance bottlenecks and boost training speed.

Trainy builds a dashboard that aggregates profiling data across many GPUs to help ML engineers training large models identify performance bottlenecks, isolate slow GPUs, and optimize training speed by balancing computation, communication, and memory operations.

From the original launch (Jun 2023) โ€” may be outdated.

B2BDeveloper ToolsMachine LearningSaaSInfrastructure

Related startups

Also in Summer 2023