AI JOB
Site Reliability Engineer, Infrastructure - Analytics Platform
OpenAI
San Francisco
Posted:
Description
The Scaling team designs, builds, and operates critical infrastructure that enables research at OpenAI. Our mission is simple: accelerate the progress of research towards AGI. We do this by building core systems that researchers rely on - ranging from low-level infrastructure components to research-facing custom applications. These systems must scale with the increasing complexity and size of our workloads, while remaining reliable and easy to use. We're looking for an experienced Site Reliability Engineer to own production-critical infrastructure end to end...