About the Role
A fast-growing, venture-backed AI startup is building a monitoring platform for AI agents. Engineering teams at leading AI companies use the platform to detect and diagnose silent failures in production systems.
The product surfaces issues when AI agents behave unexpectedly, enabling engineers to investigate conversations or traces, identify root causes, and resolve problems quickly.
Why It Matters
AI agents fail in ways that are fundamentally different from traditional software. Instead of throwing clear errors, they often fail silently-making it difficult to understand real-world performance.
Today, teams rely on combing through large volumes of logs or debugging evaluations that don't reflect production behavior. While evaluations validate specific test cases, real-world agents operate across complex workflows, interacting with tools, running long-lived processes, and handling unpredictable inputs.
This platform addresses that gap by learning the unique failure patterns of each AI system. It can detect issues such as incomplete execution, context loss, or task breakdowns, while also identifying previously unknown failure modes.
Engineers can track issues across all production data, analyze trends over time, understand user impact, and drill into relevant signals.
To support this at scale, the system processes large volumes of events and trains lightweight, company-specific models that adapt to how each product behaves in production.
As part of the early team, you'll play a key role in shaping product direction, technical strategy, and company growth.
Backers
The company is supported by top-tier venture capital firms and prominent founders/operators from leading AI and developer-focused companies.
Your Focus
Secure the platform end-to-end across application and infrastructure layers
Design and implement scalable security controls for high-throughput systems
Lead threat modeling, security reviews, and incident response efforts
Partner directly with customer security teams Ideal Candidate
Experience in security engineering within high-growth startup environments
Familiarity with compliance frameworks (e.g. SOC 2, HIPAA)
Contributions to the security community (e.g. research, tooling, open source)
Interest in AI products and systems (hands-on experience preferred)
Strong ownership mindset with a bias toward action
Clear communicator, able to translate complex concepts effectively
Based in (or willing to relocate to) San Francisco
About the Company
A platform designed to help teams monitor and improve AI-powered applications in production.
It provides visibility into how AI systems behave, highlighting both failures and successful outcomes, and enabling teams to quickly investigate and resolve issues through direct access to underlying events and traces.