Sr. Network Observability Engineer SME ( Azure/GCP/OCI, Grafana, MSFT/GCP/OCI tooling)
THIS JOB IS OPEN FOR FULLTIME/SALARIED CANDIDATES AS WELL
Key Responsibilities:
· Design and deploy scalable network observability frameworks for multi/hybrid-cloud environments (Azure, GCP, OCI) using Grafana, Prometheus, OpenTelemetry, and cloud-native tools.
· Implement custom dashboards, alerts, and log analytics for network performance metrics (latency, packet drops, BGP routing health, throughput) and security telemetry (firewall logs, flow logs, IDS/IPS).
· Integrate observability tools with cloud networking services:
o Azure: Monitor ExpressRoute/VNet Gateway metrics, NSG Flow Logs, Traffic Analytics.
o GCP: Stackdriver/Operations Suite for VPC flow logs, Firewall Insights, Network Intelligence Center.
o OCI: VCN Flow Logs, Network Visualizer, Service Connector Hub.
· Automate observability pipelines using Terraform, Python, or PowerShell to ingest, correlate, and visualize telemetry data.
· Troubleshoot network anomalies by analyzing packet captures (PCAP), NetFlow/sFlow, and distributed tracing data.
· Collaborate with SRE and DevOps teams to reduce MTTR via AI/ML-driven anomaly detection (e.g., Azure Sentinel, GCP Chronicle, OCI AI Anomaly Detection).
· Optimize costs by right-sizing monitoring tools and eliminating redundant telemetry data.
Required Skills & Experience:
· 8+ years in network observability, monitoring, or cloud operations, with expertise in Azure/GCP/OCI.
· Hands-on experience with:
o Grafana (dashboarding, Loki for logs, Mimir for metrics).
o Cloud-native tools: Azure Monitor, GCP Cloud Logging/Monitoring, OCI Observability & Management.
o Telemetry protocols: SNMP, gNMI, NetFlow/IPFIX, eBPF.
· Network diagnostics: Wireshark, tcpdump, traceroute, BGP route analytics.
· Automation/scripting: Python, Terraform, or equivalent IaC tools.
· Certifications (Preferred):
o Azure: AZ-120 (Monitoring), AZ-700 (Networking).
o GCP: Professional Cloud Network Engineer.
o OCI: Oracle Cloud Infrastructure Certified Architect.
o Grafana: Grafana Certified Associate (or higher).
Nice-to-Have:
· Experience with AIOps platforms (Dynatrace, New Relic, Splunk ITSI).
· Knowledge of Kubernetes networking observability (Calico, Cilium, Istio).
· Familiarity with compliance frameworks (ISO 27001, NIST CSF) for audit logging.