Post Job Free
Sign in

Site Reliability Engineering- CTJ- Poly

Company:
Microsoft Corporation
Location:
Reston, VA
Posted:
March 22, 2026
Apply

Description:

Overview Microsoft has an exciting opportunity to join the Microsoft Sovereign Cloud organization as part of the Windows Cloud Experiences Sovereign team.

This team supports Windows 365 Cloud PC/Azure Virtual Desktop (AVD) technologies, which are redefining how people work by bringing secure, high quality Windows experiences to the cloud.

Our work enables remote computing experiences that are more secure, resilient, and easier to manage for customers operating in highly regulated environments.

The Microsoft Sovereign Clouds organization is focused on delivering secure productivity solutions for some of the world's most critical and sensitive customers.

We tackle complex technical and operational challenges to deliver reliable, high performance services at scale.

Our culture emphasizes a growth mindset, innovation, collaboration, and inclusion.

On the Windows Cloud Experiences Sovereign team, you will collaborate with engineers across disciplines to deliver and maintain Windows 365/Azure Virtual Desktop clients and infrastructure that power secure, high quality remote work experiences.

You will also take part in core SRE responsibilities such as improving service reliability, enhancing monitoring and alerting, assisting with incident response, and building automation to reduce operational toil.

Microsoft's mission is to empower every person and every organization on the planet to achieve more.

As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals.

Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Responsibilities * Responds to incidents during regular on-call rotations by identifying the level of impact, troubleshooting basic issues, taking appropriate action to mitigate impact, and deploying appropriate fixes to resolve root cause(s). Notifies product teams and owners to major customer impacting issues and escalates the resolution of complex issues and/or those affecting multiple components or features to other engineers as needed.

Contributes details and resolutions through post-mortem reports and review meetings.

* Uses existing tools to troubleshoot problems or flaws affecting the availability, security, reliability, performance, and/or efficiency of components or features with guidance from other engineers.

Suggests potential solutions to resolve and prevent recurring issues and brings them to the attention of other engineers or team leads.

* Develops an understanding of how to safely and reliably manage changes in production by using existing tools and automation, including the safe deployment process (SDP), to enable product engineering teams implement changes across a defined range of components or features, with direction from other engineers.

* Develops an understanding of the code, features, and operations of specific products at scale as required to contribute to incremental improvements in product availability, security, quality, observability, reliability, efficiency, observability, and/or performance.

Participates in on-boarding, code/design reviews, and regular meetings with the engineering teams that develop and/or manage those products.

* Supports ongoing engagements with product engineering teams by participating in code/design reviews, regular meetings, on-call rotations, and incident responses throughout product development and operations cycles.

Draws insights from product engineering teams and basic analyses of telemetry data to propose potential improvements to code and designs for a defined set of product components or features with guidance from other engineers.

Qualifications Required/Minimum Qualifications * Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.

Other Requirements: Security Clearance Requirements: Candidates must be able to meet Microsoft, customer and/or government security screening requirements are required for this role.

These requirements include, but are not limited to the following specialized security screenings: * The successful candidate must have an active U.S.

Government Top Secret Clearance with access to Sensitive Compartmented Information (SCI) based on a Single Scope Background Investigation (SSBI) with Polygraph.

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role.

Failure to maintain or obtain the appropriate U.S.

Government clearance and/or customer screening requirements may result in employment action up to and including termination.

* Clearance Verification: This position requires successful verification of the stated security clearance to meet federal government customer requirements.

You will be asked to provide clearance verification information prior to an offer of employment.

* Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

* Citizenship & Citizenship Verification: This position requires verification of U.S.

citizenship due to citizenship-based legal restrictions.

Specifically, this position supports United States federal, state, and/or local United States government agency customer and is subject to certain citizenship-based restrictions where required or permitted by applicable law.

To meet this legal requirement, citizenship will be verified via a valid passport, or other approved documents, or verified US government Clearance Preferred/Additional Qualifications * Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.

Site Reliability Engineering IC3 - The typical base pay range for this role across the U.S.

is USD $100,600 - $199,000 per year.

There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $131,400 - $215,400 per year.

Certain roles may be eligible for benefits and other compensation.

Find additional benefits and pay information here: This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.

Microsoft is an equal opportunity employer.

All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances.

If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

Apply