Job Description
The Data Engineer will leverage their development skills and experience, to support the successful designing, ingesting, cleansing, transformation, loading, and display of significant amounts of data.
Roles and Responsibilities
Designing, implementing, and optimizing data extraction, cleansing, transformation, loading, replication/distribution, and large-scale ingest systems in a Big Data environment
Optimizing all stages of the data lifecycle, from initial planning, to ingest, through final display and beyond
Developing custom solutions/code to ingest and exploit new and existing data sources
Developing data profiling, deduping logic, and matching logic for analysis
Organizing and maintaining Data Layer documentation, so others are able to understand and use it. Also, work closely with data scientists to craft data pipelines which serve the development of modern AI/ML workflows
Collaborating with teammates, other service providers, vendors, and users to develop new and more efficient methods
Effectively articulating the risks and constraints associated with software solutions, based on environmentRequired Skills
High School Diploma/GED with 2+ years of relevant software development/programming experience.
Demonstrated data analysis, parsing, and programming language experience (e.g. Python, Java) coupled with significant SQL/database experience.
Experience with the full data lifecycle, from ingest through display, in a Big Data environment.
Hands-on experience with Java-related technologies, such as JDK, J2EE, EJB, JDBC, and/or Spring, and experience with RESTful APIs.
Experience with data pipelining systems (e.g. Apache Airflow) and developing/performing ETL tasks in a Linux environment.
TS/SCI clearance with a polygraph Bachelor's degree in Computer ScienceDesired Skills
Experience deploying systems that leverage AI/ML technology
Experience publishing results in BI dashboards.
Full-time