Post Job Free
Sign in

Sr. Network Observability Engineer SME

Company:
Netpace Inc
Location:
Pleasanton, CA, 94566
Posted:
May 15, 2025
Apply

Description:

Sr. Network Observability Engineer SME ( Azure/GCP/OCI, Grafana, MSFT/GCP/OCI tooling)

THIS JOB IS OPEN FOR FULLTIME/SALARIED CANDIDATES AS WELL

Key Responsibilities:

· Design and deploy scalable network observability frameworks for multi/hybrid-cloud environments (Azure, GCP, OCI) using Grafana, Prometheus, OpenTelemetry, and cloud-native tools.

· Implement custom dashboards, alerts, and log analytics for network performance metrics (latency, packet drops, BGP routing health, throughput) and security telemetry (firewall logs, flow logs, IDS/IPS).

· Integrate observability tools with cloud networking services:

o Azure: Monitor ExpressRoute/VNet Gateway metrics, NSG Flow Logs, Traffic Analytics.

o GCP: Stackdriver/Operations Suite for VPC flow logs, Firewall Insights, Network Intelligence Center.

o OCI: VCN Flow Logs, Network Visualizer, Service Connector Hub.

· Automate observability pipelines using Terraform, Python, or PowerShell to ingest, correlate, and visualize telemetry data.

· Troubleshoot network anomalies by analyzing packet captures (PCAP), NetFlow/sFlow, and distributed tracing data.

· Collaborate with SRE and DevOps teams to reduce MTTR via AI/ML-driven anomaly detection (e.g., Azure Sentinel, GCP Chronicle, OCI AI Anomaly Detection).

· Optimize costs by right-sizing monitoring tools and eliminating redundant telemetry data.

Required Skills & Experience:

· 8+ years in network observability, monitoring, or cloud operations, with expertise in Azure/GCP/OCI.

· Hands-on experience with:

o Grafana (dashboarding, Loki for logs, Mimir for metrics).

o Cloud-native tools: Azure Monitor, GCP Cloud Logging/Monitoring, OCI Observability & Management.

o Telemetry protocols: SNMP, gNMI, NetFlow/IPFIX, eBPF.

· Network diagnostics: Wireshark, tcpdump, traceroute, BGP route analytics.

· Automation/scripting: Python, Terraform, or equivalent IaC tools.

· Certifications (Preferred):

o Azure: AZ-120 (Monitoring), AZ-700 (Networking).

o GCP: Professional Cloud Network Engineer.

o OCI: Oracle Cloud Infrastructure Certified Architect.

o Grafana: Grafana Certified Associate (or higher).

Nice-to-Have:

· Experience with AIOps platforms (Dynatrace, New Relic, Splunk ITSI).

· Knowledge of Kubernetes networking observability (Calico, Cilium, Istio).

· Familiarity with compliance frameworks (ISO 27001, NIST CSF) for audit logging.

Apply