DevOps engineers and site reliability engineers at AI infrastructure and high-growth technology companies earned a median salary of $218K with a p90 ceiling of $300K in 2026, based on 1,000 active roles tracked by Recruiting from Scratch. Demand is concentrated at companies running large-scale AI training and inference infrastructure — where operational reliability is directly tied to product quality.
| Company | Role Focus | Verified p90 |
|---|---|---|
| Datadog | Observability / SRE | $385K |
| Databricks | ML Platform / SRE | $355K |
| Cloudflare | Network / Platform | $320K |
| Stripe | Payments Infrastructure | $310K |
| HashiCorp | Infrastructure / IaC | $295K |
> Hiring DevOps/SRE Engineers? Recruiting from Scratch tracks ATS boards and H1B filings for 163+ companies in this space. We typically present qualified candidates within 5 business days. Work with us →
These three role categories increasingly overlap in job descriptions but are meaningfully different in practice:
Platform Engineering — builds internal developer tooling, CI/CD infrastructure, and developer experience. Compensation is closest to software engineering. Site Reliability Engineering (SRE) — focuses on reliability, latency, and error budgets. Google pioneered this; most large-scale internet companies have adopted it. Compensation tracks closely with platform engineering. DevOps Engineering — bridges development and operations, focuses on deployment pipelines, cloud infrastructure, and automation. Title usage varies widely; compensation slightly lower than SRE/platform at comparable levels.At AI companies specifically, the "platform engineering" and "ML infrastructure" distinction has emerged — engineers who build the systems that train and serve models. These roles command the highest compensation in the DevOps/SRE family.
The DevOps and SRE talent market at AI companies is tight at the senior level. The engineers who can manage Kubernetes clusters at the scale AI training requires — or who can build the tooling that makes 100-engineer ML teams productive — are a small population and highly sought after. Companies hiring SREs often underestimate the comp premium these engineers expect at AI-native companies vs. traditional tech companies.
DevOps/SRE engineers with AI infrastructure experience — specifically: experience with GPU cluster management, ML training pipeline reliability, or inference serving at scale — are in a strong market position in 2026. The $300K ceiling in our dataset reflects this premium. If you're a DevOps/SRE engineer at a traditional software company evaluating AI company opportunities, expect 20–30% comp upside if you have transferable infrastructure skills.
DevOps and SRE engineers at AI infrastructure and high-growth tech companies earned a median of $218K with a p90 ceiling of $300K, based on 1,000 active roles tracked by Recruiting from Scratch in 2026.
Platform engineering focuses on internal developer productivity — CI/CD, tooling, developer experience. SRE focuses on production reliability — error budgets, latency, on-call, incident response. At AI companies, a third category (ML infrastructure/platform) has emerged that spans both, focused on the reliability and productivity of ML workflows specifically.
ML infrastructure engineers — who build training pipeline tooling, inference serving, and ML platform capabilities — typically earn 15–25% more than traditional SRE roles at the same company. The premium reflects the combination of systems engineering depth and ML workflow knowledge.
Recruiting from Scratch places infrastructure engineers at AI companies and high-growth technology startups. Get in touch to discuss your DevOps or SRE search.
On-call pay structures vary. Larger AI companies (Series C+) often pay an annual on-call stipend of $10K–$25K or per-incident compensation. Earlier-stage companies typically don't formalize on-call pay separately — it's baked into base comp expectations. This is an underrated item to negotiate: ask specifically about on-call frequency and any associated compensation before accepting.
Yes, consistently. ML infrastructure engineers — responsible for training cluster reliability, model serving uptime, and pipeline monitoring — earn 15–25% more than traditional SRE roles at the same company in our dataset. The combination of distributed systems knowledge and ML workflow understanding is rarer than either alone.
Data from Recruiting from Scratch's market intelligence platform: 1,000 active DevOps, SRE, and platform engineering job postings (role_family classification + title patterns) across our ATS network. Updated June 2026.
RFS places DevOps / SRE Engineers exclusively at VC-backed companies and pre-IPO startups. If you're evaluating an offer or exploring what's out there, we can help. Connect with a recruiter →
Tell us about your open roles and we'll start sourcing within 48 hours.