Job Description
JOB DESCRIPTION
We are seeking a mid-level data engineer to support healthcare data transformation and integration work involving legacy clinical systems. You'll work closely with JSON and Parquet extracts from the RPMS/VistA ecosystem and help translate these into structured HL7v2 and FHIR-compliant representations. Your work will directly enable semantic interoperability across tribal and federal health systems, supporting public health delivery for some of the most underserved populations in the country.
REQUIRED SKILLS AND EXPERIENCE
- 2–5 years of experience in data engineering or healthcare data roles.
- Strong Python skills, especially with pandas, json, and transformation of semi-structured data.
- Experience with or strong interest in healthcare data standards — HL7v2, FHIR, or both.
- Familiarity with reading and transforming Parquet and JSON datasets.
- Ability to reason through and normalize undocumented, legacy healthcare data.
- Comfortable using Git and collaborating via GitLab or GitHub.
- Strong communication and documentation habits.
- Comfortable working in Docker-based environments for development or testing.
- Familiarity with Git-based CI workflows (e.g., GitHub Actions, GitLab CI).
- Writes modular, testable code and is comfortable debugging pipelines in containerized setups.
- Experience managing environments via requirements.txt, pyproject.toml, or similar.
NICE TO HAVE SKILLS AND EXPERIENCE
- Familiarity with HL7v2 segments (e.g., PID, OBR, OBX) and FHIR bundles/resources.
- Experience transforming clinical data to meet interoperability or public health reporting standards.
- Exposure to Azure Synapse Pipelines, Spark, or other big data frameworks.
- Experience with Nix, container-based dev environments, or CI/CD workflows.
- Prior experience with federal, tribal, or public health systems.
- Experience with AWS services (e.g., S3, Lambda, Glue) or container orchestration tools like Kubernetes.
- Bonus points for Linux-first workflows and familiarity with Neovim or terminal-based tooling.
- Experience using Nix or other reproducible development environments is a huge plus.
Full-time