Hiring
min read

How a Trust & Safety AI Startup Hired 12 ML Engineers in 6 Months

June 25, 2026

How a Trust & Safety AI Startup Hired 12 ML Engineers in 6 Months

Trust & safety is one of the most technically demanding and least celebrated domains in software engineering. The engineers who build content moderation infrastructure, harmful content detection systems, and AI-powered safety tooling are working on genuinely hard problems — adversarial content designed to evade detection, global language and cultural variation that breaks rule-based systems, and the real-world consequences of both false positives (legitimate content blocked) and false negatives (harmful content surfaced).

In early 2026, a well-funded trust & safety AI company came to us with an urgent need: 12 ML engineers in 6 months, ahead of a significant product expansion. This is their story — anonymized, but with the real numbers and real lessons.

The Challenge

The company's core product was an AI infrastructure layer that helped platforms detect and respond to harmful content at scale. They had raised a significant Series B and had large enterprise contracts contingent on delivery milestones that required expanded engineering capacity.

The problem: trust & safety ML is a niche. There are perhaps a few thousand engineers in the world with direct experience building classifiers, content understanding models, and adversarial detection systems for this domain — and a disproportionate number of them are at FAANG companies or large social platforms that pay extremely well and have strong retention incentives.

The 6-month timeline was real. Missing it had contractual consequences. The standard "post and pray" approach to hiring was going to fail.

Why This Search Was Harder Than It Looked

Domain specificity. A strong NLP engineer from a general company doesn't automatically understand the trust & safety domain. The failure modes are different (adversarial users actively trying to defeat your models), the evaluation is harder (what's the right false positive rate for a suicide prevention model vs. a spam filter?), and the ethical weight of the work creates selection in who wants to do it. Competition. Meta, Google, Twitter, Apple, and Microsoft all have large trust & safety teams that pay FAANG salaries and offer the scale of content moderation problems that ambitious engineers find compelling. A Series B startup competing for the same talent needs a strong counter-narrative. Volume at speed. 12 engineers in 6 months is 2 per month. At the typical offer acceptance rates for ML engineers (70%), that means 17 offers extended, which means roughly 40 final-round interviews, 80+ second-round interviews, and 240+ first-touch qualifications. That's a machine, not a one-off search.

The Sourcing Strategy

We focused on three overlapping pools:

Trust & safety adjacent alumni. Engineers who had worked on content moderation, fraud detection, or adversarial ML at platforms where this was a core function — not just a compliance checkbox. These engineers had developed the domain judgment that's hard to learn on the job. Academic NLP and safety AI researchers. The AI safety research community had grown significantly, and a subset of researchers were interested in applied work with real-world impact. A mission-driven trust & safety company was a compelling pitch for these candidates. Senior engineers from ML infrastructure roles at mid-size companies. Engineers who had solid ML production experience but wanted to work on a more focused, impactful problem. The contrast with broad platform ML work (where you might work on recommendation or ads) was a real recruiting advantage.

What Made the Pitch Work

The company had thought carefully about their recruiting narrative before we started:

Mission specificity. "Making the internet safer" is a generic pitch. "Building the infrastructure that lets a 50-person platform deploy the same content safety systems as a major social network" is specific, interesting, and solves a real problem. The specificity of the mission attracted engineers who found the problem genuinely compelling. Technical depth of the work. The company gave candidates a clear picture of the technical problems: multilingual content understanding at scale, adversarial robustness, and novel detection systems for emerging harmful content patterns. Engineers who had been doing generic ML work were energized by the specificity. The "right-sized" argument. At FAANG, a strong ML engineer works on one corner of a massive recommendation system. At a Series B, they own a significant piece of the core product. The ownership argument is real, and it landed with engineers who had experienced the impersonality of large-platform ML work.

The Numbers

MonthFirst touchesQualificationsSecond roundsFinal roundsOffersHires
118060241296
216054221186
Total34011446231712

Offer acceptance rate: 71%. Time-to-first-qualified-candidate: 9 days. Average time from first touch to offer: 38 days.

Key Lessons

High-volume ML hiring requires parallel pipelines, not sequential searches. Twelve searches running simultaneously means twelve active candidate funnels, twelve sets of interview panels, and twelve offers in various stages of negotiation. The coordination overhead is significant — invest in it. The domain expertise filter is a real constraint. We identified ~600 engineers globally who met the trust & safety domain criteria. Working from a known, finite pool changes the sourcing strategy: you need to reach most of the pool, not just find the easiest. Speed is a quality signal. Companies that process candidates quickly signal confidence and competence. A candidate who experiences a 3-day turnaround from first interview to second-round invite is more likely to stay engaged than one waiting two weeks. In a competitive market, process speed is a recruiting advantage.

Why Recruiting from Scratch

For high-volume, specialized ML searches with real deadlines, we build parallel sourcing pipelines with regular calibration and market intelligence reporting. We work as an extension of your recruiting function — you get the same level of context and communication you'd expect from an internal team. Start a specialized ML search →

Related: How to Hire an LLM / AI Engineer at a Startup · How to Hire 10 Engineers Per Month

Frequently Asked Questions

Q: Is trust & safety engineering a permanent demand area or a current trend? A: Structural demand, not a trend. The regulatory environment (EU Digital Services Act, US platform liability discussions), the growth of AI-generated content, and the adversarial sophistication of bad actors are creating permanent demand for engineers who can build adaptive safety systems. The domain will evolve but it won't shrink. Q: How do you evaluate an ML engineer's trust & safety domain knowledge? A: Ask them to describe a trust & safety system they've built or contributed to: what was the problem, how did they define success, what failure modes did they encounter, and how did they handle adversarial adaptation (attackers who change their behavior in response to your system)? Engineers with genuine experience will give specific, non-generic answers. Q: What's the compensation premium for trust & safety ML engineers? A: 10–20% above standard ML engineer rates at equivalent seniority, driven by domain rarity. Senior engineers with 5+ years of direct trust & safety experience command the higher end of the premium. Q: What's the biggest mistake in trust & safety ML hiring? A: Treating it as generic ML hiring and hoping domain knowledge develops on the job. The ethical weight of the work, the adversarial dynamics, and the precision required in evaluation metrics are domain-specific skills that develop over years. For core safety systems, under-hiring on domain experience is a quality and reputational risk.

For the latest engineering compensation benchmarks, levels.fyi and The Pragmatic Engineer are the most cited sources.

Ready to hire?

Tell us about your open roles and we'll start sourcing within 48 hours.

Learn more from our blog

Visit our blog