Title: DataStage ETL Lead Developer
Duration: 3-month contract to hire
Location: Onsite 4 days a week in Columbus Ohio
We are seeking a skilled DataStage ETL Lead Developer with experience in Python scripting and cloud-based data platforms. This role will be responsible for designing, developing, and maintaining ETL processes to support scalable and secure data pipelines across enterprise data systems.
Key Responsibilities:
Design and develop ETL workflows using IBM InfoSphere DataStage.
Build scalable data ingestion, transformation, and integration solutions.
Develop automation scripts using Python to optimize data processing and ETL jobs.
Deploy and support ETL solutions in cloud environments (e.g., AWS, Azure, GCP).
Collaborate with data architects, analysts, and other developers to ensure data quality and performance.
Troubleshoot and optimize existing ETL processes.
Document technical designs, processes, and data flows.
Required Qualifications:
4+ years of hands-on experience with IBM DataStage.
Proficiency in Python for scripting and data transformation tasks.
Experience with cloud platforms such as AWS, Azure, or GCP.
Strong understanding of data warehousing, data modeling, and SQL.
Experience working with large datasets in structured and semi-structured formats (e.g., CSV, JSON, Parquet).
Strong problem-solving and debugging skills.
Preferred Qualifications:
Experience integrating DataStage with cloud-native data services (e.g., S3, Azure Data Lake, BigQuery).
Familiarity with CI/CD pipelines for data workflows.
Knowledge of DevOps or infrastructure-as-code tools (e.g., Terraform, CloudFormation).
Experience with data governance, metadata management, and data lineage tools
Preferred:
IBM DataStage11.7, Zena, Unix, Shell Scripting, Databases - Db2, SQL, Kafka, SoapUI, IBM Data replication
Experience calling SOAP/RestAPI's from DataStage.
MettleCI for DataStage Code deployment.
Infogix for system balancing.