Software Developer - Automation, IBM Corporation, Armonk, New York, and various unanticipated client sites throughout the US: Act as Site Reliability Engineer and design, develop, and templatize Grafana dashboards for analysis and visualizing cloud incidents, change requests, and customer cases for different platforms like cloud operations, infrastructure operations, and networking operations. Develop anomaly detection systems by introducing company cloud logs data and utilizing machine learning algorithms such as Deep Anomaly Detection (Deep AD), Reconstructive Anomaly Detection (Reconstruct AD), Deep Neural Network (DNN) Autoencoder, and Isolation Forest. Containerize applications for implementing health checks for compute, storage, and network tribes in cloud production data centers. Orchestrate container applications and monitor them for high availability by running health checks. Lead the creation, implementation, and systematic scheduling of data quality reports tailored to different types of cloud operations tickets, including incidents, change requests, and cases. Summarize and interpret incidents from Slack and ServiceNow to document and train on machine learning algorithms for providing a resolution assistant feature with a predictive operating procedure to users, reducing human errors. Design and implement RESTful services and APIs using Flask to efficiently fetch and query data from various IBM Cloud services. Categorize, explain, identify, and detect error logs using Generative AI models. Implement a REST API to fetch IBM Cloud Logs and integrate the queried logs with AI models, such as the Granite, Llama family of LLMs, to extract, explain, and identify key logs. Label logs and employ supervised machine learning techniques integrated into the anomaly detection system to enhance the detection of anomalies and improve the monitoring and reliability of company cloud services. Utilize: Python, Machine Learning Libraries, Natural Language Processing for Data Processing, Machine Learning Techniques, Database Management, Jupyter Notebooks, Visual Studio Code, GitHub, Docker. Required: Master’s degree or equivalent in Computer Science, Engineering or related (employer will accept a Bachelor's degree plus five (5) years of progressive experience in lieu of a Master’s degree) and one (1) year of experience as a Software Developer or related. One (1) year of experience must include utilizing Python, Machine Learning Libraries, Natural Language Processing for Data Processing, Machine Learning Techniques, Database Management, Jupyter Notebooks, Visual Studio Code, GitHub, Docker. $179982 to $235000 per year. Please send resumes to . Applicants must reference D124 in the subject line.
Minimum Salary: 179982
Maximum Salary: 235000
Salary Unit: Yearly