Post Job Free
Sign in

AWS Cloud Platform Engineer Lead

Company:
CYNET SYSTEMS
Location:
Reston, VA
Posted:
March 22, 2026
Apply

Description:

Job Overview:

Responsibilities:

Lead identification of program objectives and technical strategies; develop System Engineering Management Plans, Work Breakdown Structures, schedules, and performance metrics.

Design and implement risk and opportunity management plans, including mitigation strategies and disaster recovery planning.

Plan, design, and oversee system engineering projects and cloud platform initiatives.

Lead and manage systems engineering efforts across cross-functional teams.

Develop preventive techniques to avoid system failures and troubleshoot incidents to restore services efficiently.

Establish and maintain standard operating procedures to ensure service quality and consistency.

Gather and manage stakeholder requirements; perform functional analysis to meet business objectives.

Collaborate with network, development, and engineering teams to ensure reliable and scalable systems.

Install, configure, upgrade, and maintain cloud and system environments, including user management and networking configurations.

Mentor and guide junior and senior team members.

Stay current with emerging technologies, industry trends, and best practices through continuous learning and professional development.

Communicate architectural decisions, trade-offs, and long-term strategies effectively to stakeholders.

Support full lifecycle of services including design, deployment, and operations.

Enhance developer experience through automation and cloud-native solutions.

Provide system design consulting, capacity planning, and launch support.

Troubleshoot complex technical issues and engage with customers for resolution.

Lead cloud transformation and migration initiatives from legacy systems to modern architectures.

Implement cloud-native architectures including microservices, containers, and service mesh.

Ensure system scalability, reliability, and performance through automation and continuous improvement.

Monitor enterprise systems to identify trends, issues, and improvement opportunities.

Automate processes for patching, upgrades, and infrastructure maintenance.

Make data-driven decisions to improve delivery efficiency and system performance. Required Qualifications:

Bachelor s degree in Computer Science, Information Technology, or related field (or equivalent experience).

10+ years of overall IT experience with strong systems engineering background.

Minimum 5+ years of hands-on experience in AWS Cloud Platform engineering and administration.

3 5 years of experience in Site Reliability Engineering (SRE).

Strong experience with AWS services such as EC2, S3, RDS, Lambda, VPC, IAM, CloudFormation, CloudWatch, and EKS.

Hands-on expertise in Infrastructure as Code using Terraform, AWS CloudFormation, or CDK.

Experience with automation tools such as Ansible and scripting using Python and Bash.

Strong knowledge of DevOps practices, CI/CD pipelines, and automated provisioning.

Experience with containerization technologies such as Docker and Kubernetes.

Solid understanding of cloud architecture, design patterns, and best practices.

Experience with Linux system administration.

Strong problem-solving, troubleshooting, and analytical skills.

Excellent communication and collaboration skills. Preferred Qualifications:

Master s degree in a related field.

AWS certifications (e.g., AWS Certified Solutions Architect, AWS Certified DevOps Engineer).

Experience with enterprise cloud transformation and migration projects.

Familiarity with AI/ML services and patterns on AWS (e.g., Amazon Bedrock, Kendra, RAG models).

Experience with DevOps toolchains such as Jenkins, Bitbucket, Artifactory, and Jira.

Knowledge of service mesh, container security tools, and cloud-native monitoring solutions.

Experience working in Agile/Scrum environments. Key Skills:

AWS Cloud Architecture & Engineering.

DevOps & Automation (Terraform, Ansible, CI/CD).

Containerization & Orchestration (Docker, Kubernetes, EKS).

SRE & Reliability Engineering.

Python & Scripting.

Cloud Security & Compliance.

Microservices & Cloud-Native Design.

Monitoring, Observability & Performance Optimization.

Apply