Description SAIC is seeking a Data Scientist to develop Amazon Web Services (AWS)-based resources that requires skills spanning many compute, storage, and networking services This position is located in Chantilly, VA and requires an active TS/SCI clearance with Polygraph.
JOB RESPONSIBILITIES INCLUDE, BUT ARE NOT LIMITED TO: * Architect, deploy, and maintain multiple, fast-turnaround capabilities used to perform various highly-visible and high-priority collection efforts.
* Strategically apply AI/ML to extract, format, and expose in indexed search tools relevant content such as raw text, multimedia (audio, image, video, document), tabular (CSV, Parquet, Avro) or nested (JSON, JSONL, XML), and other structured /unstructured data types.
Data is expected to be of varying formats, schemas, and structures.
* Provide Data Engineering support to include cleaning, modeling, and formatting data of unknown formats.
* Move data between different cloud storage environments for critical requests.
* Coordinate with multiple entities, including mission partners, to ensure capabilities and deliverables meet defined requirements and tradecraft needs.
* Create and maintain collection capabilities and deliverables within the Customer's Amazon Web Services environment utilizing Customer approved AWS services.
* Validate collected data to ensure it meets data format requirements.
* Maintain all source code in Customer's GitHub repository.
* Document all source code, including how to execute the code.
* Perform operations and maintenance on the collection capabilities and deliverables to adapt to changes in collection target, technologies, data formats, and naming conventions.
Qualifications * Active TS/SCI with Polygraph.
* Bachelors and 9 years or more experience; Masters 7 years or more experience.
* Demonstrated experience with Python.
* Experience with geo-spatial software and programming packages and data formats.
* Ability to create and manage AWS resources, including provisioning EC2 instances, writing and deploying Lambda functions, creating and writing to S3, and managing authorization appropriately across resources with IAM policies.
* Experience using GitHub.
DESIRED SKILLS: * Experience deploying AWS applications with AWS's Cloud Development Kit (CDK). Ansible and Terraform are NOT a substitute for CDK.
* Experience building and deploying containerized applications.
* Experience building, programmatically working with and maintaining search engines such as ElasticSeach, Lucene, or AWS's OpenSearch.
* Ability to maintain SQL and NO-SQL databases.
* Experience with other non-AWS cloud services such as, Google Cloud Platform, Microsoft Azure.
* AWS DevOps Engineer, Solutions Architect, or SysOps Administrator certifications.