Python and parallel processing algorithm
Excellent in pandas, numpy, scikitlearn, scipy, all types of multiple large file data processing, complex joins and business logic
Working knowledge of Linux including file manipulation
Working knowledge of PostgreSQL
Design, test, and modify data processing algorithms using real and/or surrogate data
Design, test, and modify big-data processing algorithms and enable implementation in either Python suitable for a cloud-based architecture
Engage with users to understand needs an analytic strength/weakness of developed algorithms
Perform data analysis to identify algorithms technical limitations or defects, document findings and suggest solutions
Process/manipulate data and provide recommendations to provide additional utility beyond manual exploitation Additional comments:
Good communication skills required.
Local Candidates Only
Must be a US Citizen
Candidate should be clearable and should be able to obtain a public trust.
Candidates will only start after the public trust is approved.
4-6 weeks wait for public trust.
Would like to see applicants work on Github