Senior Data Engineer
Location: DUMBO
Employment Type: Full-Time
Department: Engineering
About Us
We are a mission-driven technology company dedicated to enhancing transparency and accessibility in complex data ecosystems. Our AI-powered platform is designed to help organizations and individuals navigate intricate information landscapes, transforming fragmented data into actionable insights.
Our clientele spans various sectors, including large enterprises, legal firms, non-profits, and governmental bodies. Backed by industry-leading investors and advisors, our team is composed of passionate professionals committed to building impactful solutions.
The Role
We are seeking an experienced Senior Data Engineer to join our dynamic team. In this role, you will lead the design, development, and maintenance of robust data infrastructure and pipelines, contributing directly to our data platform and collaborating with cross-functional teams to deliver high-quality data solutions.
Responsibilities
Technical Leadership: Provide guidance and mentorship to data engineering team members, fostering a culture of continuous improvement and reliability.
Data Pipeline Development: Design and implement scalable batch and streaming data pipelines using tools like Apache Airflow.
Programming Expertise: Utilize Python to build modular, tested, and high-performance data processing solutions.
Cloud Infrastructure: Deploy and manage data systems on cloud platforms such as AWS or GCP, leveraging services like S3, EC2, Lambda, and Cloud Functions.
Database Management: Work with both relational and non-relational databases, optimizing for performance and scalability.
DevOps Practices: Implement CI/CD pipelines, infrastructure as code, and observability tools to ensure robust and reliable data operations.
Cross-Functional Collaboration: Partner with analysts, engineers, and stakeholders to deliver data solutions that meet organizational needs.
Qualifications
Experience: 5+ years in data engineering, with a strong portfolio of delivered projects.
Technical Skills: Proficiency in Python, experience with data pipeline tools (e.g., Airflow), and familiarity with cloud services (AWS/GCP).
Database Knowledge: Strong understanding of database systems, data warehousing, and performance optimization techniques.
DevOps & Infrastructure: Experience with CI/CD, infrastructure as code tools (e.g., Terraform), and monitoring solutions.
Soft Skills: Excellent communication skills and the ability to work effectively in a collaborative environment.
Passion: A strong interest in leveraging technology to enhance data transparency and accessibility.
Preferred Qualifications
Leadership Experience: Previous experience in a managerial or team lead role.
Advanced Tools: Familiarity with distributed data processing frameworks (e.g., Spark, Beam), data modeling tools (e.g., dbt), and search technologies (e.g., Elasticsearch).
Security & Compliance: Understanding of data security, privacy, and compliance best practices.
Compensation & Benefits
Salary: Competitive annually
Equity: Competitive stock options
Time Off: Unlimited PTO, sick leave, and paid federal holidays
Health Benefits: Comprehensive medical, dental, and vision insurance plans
Retirement Plan: 401(k) with company match
Work Environment: Autonomy in a high-growth startup setting, with a collaborative and passionate team