Software Engineering Position
Required Qualifications:
• 8+ years of software engineering experience
• 4+ years of experience in site reliability engineering teams with continued focus on improving platform health
• Familiar with agile or other rapid application development practices
• Hands-on expertise in building dashboards using APM tools
• Experience with distributed (multi-tiered) systems, algorithms, relational databases, and NoSQL databases
• Knowledge and exposure to caching tools (Redis, memcache) or messaging tools such as MQ, Kafka
• Must have working knowledge of APM tools such as splunk, GCL, ELK, Grafana, Prometheus etc.
• Able to create dashboards using GCL/Splunk/ELK and setup alerts
• Working knowledge of CICD is a plus – source control like Git, continuous integration – Jenkins / UCD release etc.
• Ability to work with engineering teams across the ecosystem such as security, networking & infrastructure challenges which can impact platform health & resiliency
• Shell scripting / DevOps tools like Ansible with good knowledge of yaml file to write playbooks
• Experience with distributed storage technologies like NFS as well as dynamic resource management frameworks PCF, Kubernetes / OpenShift, AWS or Azure
• Tech stack: Java/J2EE (Spring, Spring Boot, Python, shell scripting, Kafka, Oracle, MongoDB etc.)
• A proactive approach to spotting problems, areas for improvement, and performance bottlenecks