Site Reliability Engineer (SRE)
Location: Columbus, OH Iselin, NJ (Onsite)
Job Type: Long Term Contract
Key Responsibilities
Enhance platform reliability, performance, and observability
Build dashboards and alerts using APM tools (Splunk, ELK, Grafana, Prometheus, GCL)
Proactively identify performance bottlenecks and system risks
Support incident management and root cause analysis
Collaborate with Engineering, Security, Networking, and Infrastructure teams
Automate operational tasks using Shell scripting and DevOps tools
Support CI/CD pipelines and release processes
Required Skills
8+ years of Software Engineering experience
4+ years in Site Reliability Engineering
Strong experience with APM / monitoring tools (Splunk, ELK, Grafana, Prometheus)
Experience with distributed systems, relational & NoSQL databases
Knowledge of Redis, Memcache, MQ, Kafka
Hands-on Shell scripting, Ansible (YAML)
Experience with CI/CD tools (Git, Jenkins, UCD or similar)
Experience with Kubernetes / OpenShift, PCF, AWS or Azure
Tech stack: Java/J2EE, Spring Boot, Python, Kafka, Oracle, MongoDB
Diverse Lynx LLC is an Equal Employment Opportunity employer. All qualified applicants will receive due consideration for employment without any discrimination. All applicants will be evaluated solely on the basis of their ability, competence and their proven capability to perform the functions outlined in the corresponding role. We promote and support a diverse workforce across all levels in the company.