Post Job Free
Sign in

Cloud Monitoring (Observability) Engineer

Company:
Tantus
Location:
Washington, DC
Posted:
May 20, 2025
Apply

Description:

Tantus Technologies, Inc. (Tantus) - recognized by the Washington Post as a Top Workplace - is seeking a skilled Observability Engineer with deep expertise in New Relic and other observability tools to help design, implement, and manage our monitoring and observability infrastructure. You will work closely with engineering, DevOps, and site reliability teams to ensure application performance and reliability through actionable telemetry and insights.

What You'll Do

Design and implement observability solutions using New Relic and other observability tools, including custom dashboards, alerts, and instrumentation.

Collaborate with development and DevSecOps teams to define monitoring requirements and SLIs/SLOs.

Optimize performance and troubleshoot issues using logs, metrics, and traces.

Integrate New Relic with CI/CD pipelines and other monitoring tools.

Lead efforts to mature the observability practice across the organization (standardization, training, documentation).

Work with federal stakeholders to set up and manage an Observability Center of Excellence.

Participate in on-call rotation and incident response to provide insights from observability data.

Stay up-to-date with New Relic feature releases and observability trends.

Required Knowledge and Skills

Degree in Computer Science, Mathematics, Engineering, or equivalent professional experience

5+ years of experience in observability, Site Reliability Engineer, DevOps, or infrastructure engineering roles.

A working knowledge of AI/ML-driven monitoring solutions

At least 10 years overall IT SLDC and Cloud experience

Strong understanding of distributed systems, microservices, and cloud-native architectures.

Solid knowledge of cloud platforms (AWS and Azure).

Experience working with containerized environments (e.g., Docker, Kubernetes).

Familiarity with Terraform, CloudFormation, or other infrastructure as code tools.

Abilities

Hands-on experience with New Relic (APM, Infrastructure, Browser, Logs, Synthetics, Dashboards).

Exposure to AIOps tools (e.g., New Relic AI, Moogsoft, BigPanda, or similar).

Proficiency in instrumenting code (e.g., custom events, traces) in at least one language (e.g., Java, Python, Node.js).

Nice to Haves

Certifications in New Relic or AI/ML frameworks, nice to have.

Experience with other observability tools (Datadog, Site 24x7 and WhatsUpGold).

Exposure to ITIL or ITSM practices.

Strong communication and documentation skills.

Salary Range

Salary range is $140,000-160,000/year. The salary range for this position reflects a variety of factors that influence compensation decisions, including skills, experience, training, certifications, and organizational needs.

Regular Full-Time

Apply