Post Job Free
Sign in

Site Reliability Engineer (SRE) Lead with Telecom Experience

Company:
Tekskills Inc.
Location:
Bellevue, WA, 98009
Posted:
May 24, 2025
Apply

Description:

Job Title : Site Reliability Engineer (SRE) Lead with Telecom Experience

Location : Bellevue, WA (Hybrid Opportunity)

Duration : 12+ months

Experience – 10+ years who can take command of bridges and drive

Role:

An SRE who leads incidents and manages the bridge until the issue is resolved and tracks action items to closures, including Change management and problem management.

Responsibilities:

Actively driving incident calls working with Technical, Product SMEs and Tier 2 SREs.

Establishing a timeline of the incident progression and action item follow-ups until closure during or after the call.

Summarizing the discussion into knowledge articles, action items and doing warm hand-offs to Tier 2 teams.

Being adopters and advocates of Best-practices collated from experience and SMEs like OTEL, App Availability and Resiliency.

Sending reports of progress from past incidents to leadership. Change management and problem management.

Posting updates on AHOD and providing regular updates to leadership.

Skills:

Ability to focus on incidents and work with SMEs and Tier 2 Leads.

Attention to detail and catching the minutest detail spoken on a call.

Diligence in follow-ups and driving the SRE mandate to every team and partnering with them to operationalize best practices.

Great at detailed communication with ICs and precise, succinct communication with leadership.

Attitude to chase down even outlier issues to resolution.

Experience:

Someone with a strong technical, project management background.

Has worked in telecom industry and SRE Ops.

Has experience working in the Digital space and products.

Knowledgeable to leverage tools quickly like Splunk, Otel, Grafana, PowerBI, AppD.

Outcomes:

Operationalizing all the initiatives as part of SRE Transformation.

Drives SRE objectives by reducing P1, P2 numbers for identical issues by ensuring teams learn from the past.

Contributes to Knowledge base by partnering with SMEs.

100% closure of tasks (incidents, problem tasks, vendor resiliency etc) within SLAs.

Apply