Some things to note on the types of skills and candidates we need:
Senior person. No less than 4 years of experience.
Has experience with virtualization
Hands-on experience with VMware or a similar vendor or product/very beneficial
How to manage a large fleet/virtual machines/what building blocks
Must know the basics of networking
Must have experience with automation
Strength with Python as a scripting language
Ansible, Chef, Terraform - having experience with 1 scripting language/automation tools
Basic admin skills - mostly in the Windows world, but should have some Linux as well. They run a lot of things on Windows/Biz Tech - but they are shifting towards Linux - (70% Windows, 30% Linux)
Remote access technology protocols are a plus Job Description:
Site Reliability Engineer
Periodic updates and maintenance of Windows-based golden image for ESX & AWS.
Patching of software, systems, appliances etc, through scripting or manual process
Disaster recovery planning and testing of various products and distributed systems
Deployment and maintenance of infrastructure and applications in AWS using IaC
Automate the process of building VM images for ESX, AWS, OpenStack
Adoption of tooling and best practices in the space of on-prem and public cloud infrastructure and application management
Telemetry improvements (logging, monitoring, etc) for various systems and applications
Performance tuning/optimization for systems and applications
Escalated user support
Skills required
Scripting (powershell, python)
Automation tools such as Ansible, Chef, Terraform, Packer
A virtualization platform such as ESXi or OpenStack
AWS operations (EC2, S3, lambda, ELB)
Windows administration and basic navigation skills with Linux
Basic networking concepts and troubleshooting
Basic knowledge of security concepts and protocols such as HTTPS, TLS, etc.
General understanding of distributed systems
Prior experience or working knowledge of remote access protocols such as RDP or Citrix will be a plus