Post Job Free
Sign in

Data Engineer - Document Data Structures Specialist

Company:
Sybal Corp
Location:
North Bethesda, MD
Pay:
110000USD - 130000USD per year
Posted:
May 05, 2025
Apply

Description:

Job Description

About Us:

We are a cutting-edge technology company focused on delivering secure, scalable data and governance solutions. We’re seeking a skilled Data Engineer who specializes in researching, defining, and modeling data structures embedded in diverse document types. This role is crucial to powering intelligent systems that extract and normalize structured and unstructured data across a variety of content sources.

Position Overview:

As a Data Engineer, you will be responsible for analyzing diverse document types (PDFs, XMLs, JSONs, Word docs, regulatory filings, contracts, etc.), identifying embedded data patterns, and designing efficient data structures to support parsing, transformation, and integration into enterprise systems. You will work closely with our data science, product, and engineering teams to ensure that extracted data is accurate, meaningful, and scalable.

Key Responsibilities:

Document Analysis & Research:

o Investigate and understand complex document formats and layouts across industries (legal, regulatory, financial, technical).

o Identify data patterns and relationships in structured and semi-structured documents.

Data Modeling & Structure Design:

o Define and document logical and physical data models that represent data found in various document types.

o Design schemas and structures optimized for downstream processing and retrieval.

Pipeline Development & Integration:

o Collaborate with engineering teams to integrate extracted data into processing pipelines and databases.

o Build modular code to support document parsing and normalization using scripting and automation tools.

Collaboration & Documentation:

o Work with product managers and analysts to define requirements and ensure accuracy and usability of data definitions.

o Maintain comprehensive documentation of all data structures, logic, and assumptions.

Qualifications:

Experience:

o 3–5+ years of experience in data engineering, information modeling, or related fields.

o Proven ability to work with document formats such as PDF, DOCX, XML, JSON, and HTML.

Technical Skills:

o Strong proficiency in Python or a similar language for data parsing and transformation.

o Experience with data modeling tools and techniques (e.g., ERD, schema design, normalization).

o Familiarity with document processing libraries such as PyMuPDF, pdfminer, textract, or similar.

o Knowledge of databases (PostgreSQL, NoSQL, etc.) and data warehousing concepts.

Analytical & Organizational Skills:

o Strong research skills and attention to detail when identifying data within unstructured or complex documents.

o Ability to design clear, reusable models that map to varied content formats and business requirements.

Bonus Skills (Preferred):

o Experience with NLP, OCR, or document classification techniques.

o Familiarity with regulatory or legal documents and compliance data extraction.Company Description

Founded in 2020 and HQ’d in North Bethesda, Maryland, Sybal® serves as a governance

innovation firm whose mission is to innovate governance for trusted outcomes. Our mission is

the driving force behind our work, pushing us to reshape and elevate the standards of

Governance so that others may thrive.

Our vision consists of a world where organizations

operate with unparalleled effectiveness, transparency, and accountability - elevating the quality

of missions and their impact on the communities they serve.

Our Company Culture is grounded in our ability to BE accountable, actionable, and agile while

continuously learning and operating with integrity. Your unique talents, desire to grow, and

success at Sybal® will directly impact the success of our culture and our people. Come ready to

achieve new heights and help others do the same!

As a team member, you will contribute to the enhancement and expansion of Sybal’s patented

enterprise software solution, Proof of Governance® - the world’s first computational

governance solution designed to independently measure the effectiveness of policy in real-time

and establish proof of wellness for organizations governed by policy.

As an Awardable solution to the United States, Department of Defense, your work will directly

contribute to helping our nation’s warfighters and civilian servants keep our nation secure.

Part-time

Apply