 
      Description
Description
SAIC is seeking a Data Scientist to develop Amazon Web Services (AWS)-based resources that requires skills spanning many compute, storage, and networking services
This position is located in Chantilly, VA and requires an active TS/SCI clearance with Polygraph.
Job responsibilities include, but are not limited to:
- Architect, deploy, and maintain multiple, fast-turnaround capabilities used to perform various highly-visible and high-priority collection efforts. 
- Strategically apply AI/ML to extract, format, and expose in indexed search tools relevant content such as raw text, multimedia (audio, image, video, document), tabular (CSV, Parquet, Avro) or nested (JSON, JSONL, XML), and other structured /unstructured data types. Data is expected to be of varying formats, schemas, and structures. 
- Provide Data Engineering support to include cleaning, modeling, and formatting data of unknown formats. 
- Move data between different cloud storage environments for critical requests. 
- Coordinate with multiple entities, including mission partners, to ensure capabilities and deliverables meet defined requirements and tradecraft needs. 
- Create and maintain collection capabilities and deliverables within the Customer's Amazon Web Services environment utilizing Customer approved AWS services. 
- Validate collected data to ensure it meets data format requirements. 
- Maintain all source code in Customer's GitHub repository. 
- Document all source code, including how to execute the code. 
- Perform operations and maintenance on the collection capabilities and deliverables to adapt to changes in collection target, technologies, data formats, and naming conventions. 
Qualifications
- Active TS/SCI with Polygraph.
- Bachelors and 9 years or more experience; Masters 7 years or more experience.
- Demonstrated experience with Python.
- Experience with geo-spatial software and programming packages and data formats.
- Ability to create and manage AWS resources, including provisioning EC2 instances, writing and deploying - Lambda functions, creating and writing to S3, and managing authorization appropriately across resources with IAM policies. 
- Experience using GitHub.
Desired Skills:
- Experience deploying AWS applications with AWS's Cloud Development Kit (CDK). Ansible and Terraform are NOT a substitute for CDK.
- Experience building and deploying containerized applications.
- Experience building, programmatically working with and maintaining search engines such as ElasticSeach, Lucene, or AWS's OpenSearch.
- Ability to maintain SQL and NO-SQL databases.
- Experience with other non-AWS cloud services such as, Google Cloud Platform, Microsoft Azure.
- AWS DevOps Engineer, Solutions Architect, or SysOps Administrator certifications.
Apply on company website
 
       Find Connections via Linkedin
  Find Connections via Linkedin 
             
       
       
      