The chosen candidate will be an integral member of a dedicated team overseeing the engineering and management of expansive server infrastructure in a dynamic, multi-datacenter setting. This includes CPU and GPU compute resources, job scheduling, and associated infrastructure systems. The role involves creating solutions that empower our internal customers while collaborating with the team to ensure optimal system performance.
Design, implement, and support HPC compute resources and related infrastructure.
Assist application teams in optimizing their workflows for enhanced performance.
Develop and maintain automation scripts (Python, Perl, Bash, Ansible) as well as the servers involved (Satellite, Ansible Automation Platform).
Monitor systems and troubleshoot failures, with occasional on-call responsibilities.
Qualifications
Bachelor's degree in a relevant field or equivalent work experience.
5+ years of experience in Linux administration.
2+ years of experience in server automation.
Extensive knowledge of server and infrastructure automation.
Robust experience in architectural and engineering practices within enterprise Linux environments.
Proficient scripting skills in Python and Bash, with a willingness to learn additional languages.
Exhibits a high level of technical competence and professionalism.
Exceptional problem-solving and troubleshooting abilities.
Strong communication skills and familiarity with desktop tools that enhance collaboration (Grafana, mkdocs, Wikis, IM, MS Teams, etc.).
Proactive in seeking improvements to processes and services.
Ability to generate detailed technical documentation.
Adept at managing multiple complex projects simultaneously.
Nice to Have
Experience with configuration automation tools (Red Hat Satellite, Ansible Automation Platform, etc.).
Familiarity with HPC interconnect technologies such as InfiniBand or MPI.
Knowledge of batch workload management systems.
Understanding of Site Reliability Engineering (SRE) practices and experience with Agile methodologies.
Experience managing Red Hat Enterprise Linux environments.
Even if you don’t meet every qualification but believe you can add value to our team at Ford Motor Company, we encourage you to apply!
As a global company, Ford offers you the flexibility to shape your career path. You can choose whether to expand your horizons internationally or stay closer to home. Whether you prefer to specialize in an area you love or explore diverse teams and skills, the choice is yours. We provide a work-life balance that supports your needs, including:
Comprehensive medical, dental, and prescription drug coverage.
Flexible family care, parental leave, and programs for new parents.
Employee discounts on vehicles, management leases, and more.
Tuition assistance for continuous learning.
Active employee resource groups promoting diversity.
Paid time off for individual and team community service activities.
A generous holiday schedule, including time off between Christmas and New Year's.
Options to purchase additional vacation time for personal time off.
This role is predominantly remote, with the exception of those within 50 miles of Dearborn, MI, who are required to be on-site four days a week.
Visa sponsorship is not available for this position.
Relocation assistance is not provided for this position.
Candidates must be legally authorized to work in the United States and will need to verify employment eligibility during the hiring process.
Ford Motor Company is an Equal Opportunity Employer dedicated to a diverse workforce. All qualified applicants will be considered for employment irrespective of race, religion, color, age, sex, national origin, sexual orientation, gender identity, disability status, or protected veteran status. Reasonable accommodations are available for the online application process due to disabilities.
#LI-Remote
#LI-DS2
SG7-SG8
Requisition ID: 57988