Post Job Free
Sign in

AI/ML Platform Manager

Company:
ScienceLogic
Location:
Reston, VA
Posted:
June 04, 2024
Apply

Description:

*This position can be remote within the US*

What we’re looking for…

ScienceLogic is seeking a AIML Platform Manager who will assume responsibility for leading multiple teams of highly talented software engineers in the design and development of our world class AI/ML hybrid cloud monitoring platform, SL1. The ideal candidate will have an established background in designing and building highly scalable software services, excellent project management skills, great communication skills and the motivation to achieve results in a fast-paced environment. We are looking for people with a very technical background in software development across multiple technology stacks in the development of highly scalable software services and systems. Domain experience in Network Monitoring, Infrastructure Monitoring, Machine Learning/AI, scalable architectures, Cloud (Private, Public, Hybrid) Technologies, Design and operational expertise in Software as a Service (Saas) systems is a big plus. The ideal candidate will have demonstrated design, development, coding skills in building scalable architecture systems designed for high throughput and scalability. Responsibilities include hiring, developing, mentoring, and managing a team of software development engineers, technical managers, and software architects.

What you’ll be doing…

Manage a team of talented software engineers responsible for the design and development of scalable services and software deliverables.

Manage and execute against program increments and fulfill delivery commitments.

Following an AGILE process to oversee and manage the day-to-day activities of a team of software developers in an open source environment using a technology stack that includes Linux, Python, Go, ClickHouse, OpenTelemetry, and microservices using containers (Kubernetes) and deployment through HELM.

Experience with latest ML technology stack including HuggingFace, Python libs, PostgresML, Pytorch, OpenAI would be highly preferred.

Designing algorithms, performing code reviews, and documenting complex designs for software assets.

Work in ensuring quality measures account for failure modes in design, and recovery.

Troubleshooting, debugging, maintaining and improve current software assets.

Manage multiple software development SCRUM teams and work closely with the architects and technical team leads to design and develop the optimal technical solution.

Work closely with the product management team and release management to ensure successful, on-time releases.

Work cohesively with product managers, project managers and engineering management to allocate engineering resources appropriately across various projects.

Manage expectations, set realistic goals, and achieve them on time and within budget.

Implement and report performance metrics to executive management.

Foster a culture of creativity, empowerment, collaboration, speed and innovation in a fun work environment while continuously elevating the quality and caliber of the overall software development organization.

Remove obstacles to create velocity, efficiency and greater team effectiveness.

Create a productive and healthy work/life balance, foster a flexible work environment, and keep the team focused, engaged, productive, and fulfilled.

Qualities you possess…

MS in Computer Science or related degree and 10 years of software development and management experience in building and scaling applications, ideally in an open-source environment and expert level understanding of Helm chart design, configuration and troubleshooting complex configuration/operational issues in Kubernetes.

Demonstrated experience in AGILE software development using JIRA, GitHub etc.

Demonstrated track record of successfully hiring, motivating, retaining and effectively deploying talented software developers to perform against aggressive delivery goals.

Experience managing both local and remote and international offshore software engineers.

Experience with a variety of software development processes, and ability to apply the right process for the project.

Strong interpersonal skills.

Proven ability to build productive relationships and motivate, develop, and mentor team members to grow their careers at ScienceLogic. Must have a positive, can-do attitude.

Start-up technical management experience is a plus.

Background and in-depth knowledge of network, cloud and application monitoring/management tools is also a plus.

Benefits & Perks

A remote-first culture - work from home or come into the office, it's totally up to you.

Comprehensive medical, dental and vision plans.

401(k) plan with employer match.

Flexible Paid Time Off (FTO) so that you can take the time that you need to re-energise.

Volunteer Time Off (VTO) - take two days off per calendar year to volunteer with your preferred charitable organization.

5-year Service Milestone Sabbatical.

Paid parental leave.

Generous employee referral bonus program.

Pet insurance.

HQ Office centrally located in Reston Town Center featuring a well-stocked kitchen with rotating snacks and beverages, and catered lunch on Thursdays.

Regular virtual company-wide events, including cooking classes, yoga, meditation and more.

The opportunity to learn and develop from some of the best and brightest minds in the industry!

Don’t meet every single requirement? Studies have shown that women and people of color are less likely to apply to jobs unless they meet every single qualification. At ScienceLogic, we are dedicated to building a diverse, inclusive and authentic workplace, so if you’re excited about this role but your past experience doesn’t align perfectly with every qualification in the job description, we encourage you to apply anyways. You may be just the right candidate for this or other roles.

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, or any other applicable legally protected characteristics in the location in which you are applying.

About ScienceLogic

We empower intelligent and automated IT operations.

The ScienceLogic SL1 platform enables companies to digitally transform themselves by removing the difficulty of managing complex, distributed IT services. We use patented discovery techniques to find everything in your IT environment, so you get visibility across all technologies and vendors running anywhere in your data centers or clouds

All ScienceLogic employees have the responsibility to protect information assets, adhere to access controls, report suspicious activity, and comply with security and privacy policies.

#LI-Remote

Apply