Job Description
JOB SUMMARY
We are seeking a highly skilled Senior Linux/AWS Cloud Engineer to design, implement, and maintain robust cloud and Linux-based solutions in a dynamic, mission-critical environment. This role will focus on delivering scalable automation, ensuring 24/7 system availability, and supporting complex application ecosystems. The ideal candidate will collaborate with cross-functional teams to optimize AWS infrastructure, manage Linux server environments, and drive operational excellence through innovative tools and processes. MUST BE ABLE TO OBTAIN A PUBLIC TRUST
KEY RESPONSIBILITIES:
Design and implement scalable Linux and AWS-based infrastructure
solutions to support a dynamic application environment.
Develop and maintain automation and productivity tools for system
provisioning, configuration, patching, monitoring and redundancy
strategies to maintain 24/7 availability of mission-critical systems,
adhering to industry best practices.
Collaborate with development teams to test, troubleshoot, and optimize
Tomcat-based Java applications in AWS and EKS environments.
Administer and manage EKS clusters, including configuration, tuning, and
upgrades.
Own and manage AMI creation, lifecycle management, and EC2 instance
provisioning.
Ensure security compliance through system hardening, configuration
management, and proactive monitoring practices.
Develop and implement robust backup, recovery, and redundancy
strategies.
Participate in an on-call rotation for production support and incident
response.
Create and maintain clear documentation for operational processes and
environment configurations.
MUST HAVE SKILLS:
8+ years of hands-on experience administering RedHat Enterprise Linux
7/8/9 and/or Amazon Linux 2/2023 in enterprise environments.
3+ years of experience managing AWS infrastructure (EC2, EKS, S3, IAM,
etc.), with a solid grasp of cloud architecture and best practices.
Strong experience with automation tools such as Ansible, Puppet, AWS
Systems Manager, or CloudFormation.
Proficiency in Shell and Python scripting for automation and
troubleshooting.
Solid understanding of system monitoring, incident response, and
performance tuning in cloud environments.
Excellent problem-solving and analytical skills, with a detail-oriented and
proactive approach.
Experience in Federal Grants Management is a plus.
Full-time