Post Job Free
Sign in

Python Developer Data Engineer

Location:
Annandale, VA
Posted:
November 12, 2024

Contact this candidate

Resume:

VALERI KALMYKOV

**** ********** ** ********* ** 22003

410-***-****, **********@*****.***

Secret, Public Trust

Security Plus Certificate

Python Developer 05/2023-10/2024

Accenture Federal / Fort Belvoir VA

Service Now, ICAM, Python Linux Prod support, Oracle, Git, testing, debugging.

1)Get data from Cloud every week by cron job on Linux by SFTP, SSH, Python

2)Unzip file to json type

3)Split complex json data file to various types of data CSV files by Python

4)Convert CSV files to Oracle tables

Python / Data Engineer 08/2021-04/2023

Deloitte / FDA CIDEROne project, uploading data on AWS Databricks, pipeline,

From S3 on boarding files to Delta Data Lake Bronze, Silver, Golden, Tableau, did analytics

which include model training, predictions, pivot tables, graphs, charts and statistical report, reverse engineering PALANTIR for pipelines and license cost

Python 3.9, PyCharm, Jupyter, Docker, Jenkins, Pandas, NumPy

1) make notebook extract file from remote url to S3 AWS

2) make notebook transfer file from S3 to Databricks

3) make AIRFLOW Orchestration with 2 jobs

4) make config schema for extract / ingest

5) make json for a job and schedule it

Deloitte/ Anthem Medical Insurance

Manual / Automated testing with Python, JAVA Selenium, Health Innovation Platform project

Did manual testing of HIP web application, Used Deloitte automation software COFTA

Python Developer 02/2020-06/2021

Maximus Federal / NOAA Suitland MD

Did POES, GOES satellite support of legacy code on C/C++, FORTRAN,Windows,VAX,Unix

Used Visual Studio C++ debugger, GNU autoconfig, Cross Platform Development by Make

Did Production Data Flow customer support for foreign and domestic users by FTP scripts

Did Python / Perl data extraction from satellite telemetry for support of system health

Python Developer 06/2019-10/2019

HighPoint / CFTC, Washington DC

Set data failover Linux server on AWS and installed Anaconda Python data package libs

for client group up to 70 Data Engineers

Python Developer 02/2019-03/2019

Computer Future /DS-Science/AEEC/USPTO.gov Alexandria VA

Helped client with management of volume for data storage servers.

Used FLASK proxy API to extract data from various sources by Perl, Python3, Linux, Windows, MySQL, Used web scraping with beautiful soup for data extraction by url

JAVA legacy code transfer to Python

Python Developer 03/2018-07/2018

Ekuber McLean VA

Did Production support for open source Ckan Datacenter for GSA

Used Python 2.7, Solr, Jetty, Nginx,Vagrant, Postgresql, SQLAlchemy, Macos, Pylons

Did Python code changes in Git then deployed application by Jenkins using Vagrant VB

Python Developer 02/2018-03/2018

Insight Global / Ventech Manassas VA

Automatic patching of Solaris 10,11 Unix by shell scripts using Perl and Python scripts

Did porting of Perl scripts to Python

Python Developer 01/2018-02/2018

Infotech Arlington VA

Used Python, Tornado framework for web access of Centos Linux servers with Oracle

Database bash installation scripts and run them.

These servers handle Symantec Data Loss Prevention software

Python Developer 10/2017-12/2017

Take2 Rockwell Collins, Arlington VA

Used Flask REST services to build demo for Airline customer. Demo deals with case when your frequency band is busy, so you can use other airline available bandwidth.

Python script send request as json message which on Tomcat server changes frequency band without interruption. Output goes to Google maps.

Used Mongo DB, Windows, Git, JS

Data Engineer 11/2016-10/2017

CFPB.gov Farragut DC

Vagrant, Linux, Mesos, Marathon, Ckan, Postgres, Solr, Ansible, github.

Open source web development Ckan on Pylons framework, doing user interface on ninja2, database part on SQLAlchemy, Controller, templates files on Python on Linux Centos Vagrant Virtual Box, Elastic Search, Pyspark, JAVA, Docker, AWS Lambda, Glue

Deployment Ckan web app on Network by Jenkins running and testing.

Deployment uses Mesos with scheduling by Marathon, software download and installation is done by Ansible.Changing on need basis yaml files which are ran by Ansible and Mesos.

Data migration MS SQL Server to Postgres SQL by Singer pipeline

Python Developer 06/2015-10/2016

PKW/ LEIDOS / SSA Falls Church VA

Scan denied disability applications judge’s decision by OCR using Python and analyzing

Scanned data with use of Natural Processing, Machine Learning scikit packages.

Python automatic data extraction from SSA disability files and save them in SQL database

Word document is converted to XML file then to CSV file and saved in SQLite database.

Optical scanning of older TIFF images of mental and physical disability cases using Python

code together with NIH team. SAS with Python was used for statistical purposes, created unit test plans, and did code optimization. Data analysis was done by visualization, model training, predictions, statistics. Multiprocessing and Memory management for Data manipulation

Java Developer 05/2015-06/2015

IBM Reston VA

Support legacy Java, Java Script Oracle code for FEMA emergency applications

Python Developer (contractor) 08/2014-12/2014

Foreign Language Institute UM College Park MD

Educational software

Used Python on Linux to develop monitoring scripts of Apache, MySQL servers,

and Nagios to detect network intrusion

Used Angular, Flask, Django frameworks, Java Script for web development

Used Bitbucket, Git, for source files storage

Used SQLite to keep result of servers log and error files

Citrix was used for remote access of Virtual Machines VM, Vagrant on Ubuntu Linux

Used FTP for remote file transfer from Python script

Software Developer 06/2013-06/2014

AmTote Hunt Valley MD

Computer Support of Racing for Maryland, New York Racing Association. Windows 7, VC++, MS SQL reports. I used Microsoft Visual Studio 2013 for Bug fixing, System enhancements, additions.

Developed Foreign Para Mutual Betting System,

Python was used for automated testing of several real racing events, Triple Crown 2014, Preakness 2013.

Python Developer 12/2012-03/2013

Blue Cross, Owings Mills MD

Medical insurance industry

Python was used to develop Linux script on Virtual Machine Ubuntu which did:

CGI script runs on Apache server, pulls data from Postgres SQL,

present data in web browser as HTML

present data in XML file in RSS Feed

updated results are send automatically to Managers Outlook Mail Box.

Pulled financial data from web by PERL/PYTHON script using XML TREE, DOM

Used Django framework

used OCR tools with 508 compliance for development and testing

SANS cyber security classes

NOAA Senior Production Analyst (contractor) 05/2011-10/2012

Camp Springs MD

I run Weather Forecasting Models on IBM supercomputer.

1) Deployed new models for automatic run on Unix machine by LoadLeveler

2) Monitored them by SMS Linux box.

3) Tested existed models for additional changes and enhancements, for example (SREF,WRF,GFS,WAVEWATCH).

4) I responded to night calls for troubleshooting of model crashes and delays. This include monitor of data ingest from satellites or data input from other models. I have knowledge of GRIB,GRIB2. I supported AWIPS.

5) I used PERL script 24x7 automatically collected errors from all sources and make log file.

C/FORTRAN were used for support and development of weather forecasting models.

C/C++,FORTRAN/PERL/Python/shell script. Linux, Unix AIX, HPC, Open MP/MPI

I used GPFS and Open MP which is DISTRIBUTED SYSTEM

used OCR tools with 508 compliance for development and testing

SCIENTIFIC PROGRAMMER (contractor) 06/2010-12/2010

Fugro Earth Data Frederick MD

Production of surface maps from 2 air born Synthetic Aperture Radars (Interefometry) data in X and P bands. SAR / IFSAR data processing SGI 3900 POSIX, Altix 4700, Irix, Linux, FORTRAN / C, parallel programming Open MP/MPI, Jurassic Prok (JP), Porting of JP from SGI Irix to Altix Linux box. Performance improvements by re- engineering of source code with possibility to reprocess data faster. Improvements in performance of making shape files on C/C++, breaking JP in part to dump results on disk on 1 run and pickup data at 2 run for smaller needed map area, what allows redo it much faster. PERL code run data comparison produced by different methods.

SOFTWARE ENGINEER 02/2009-10/2009

NASA Greenbelt MD Astrophysics Department

Support SWIFT mission PERL based pipeline which provide telescope/satellite data on the web for scientific community all over the world within 2 hour of any new Gamma Ray Burst. 2)

Main PERL script used modules which were developed on FORTRAN., C++

FORTRAN, MATLAB, PERL, C, C++, I used Linux cluster as Distributed System

3) used OCR tools with 508 compliance for development and testing

EDUCATION

Ph.D. Physical Oceanography, Marine Hydrophysical Institute, Sevastopol, Ukraine

B.S./M.S. Physics, Moscow State University, Moscow, Russia

US CITIZEN



Contact this candidate