• Location: USA, New York
  • Date Posted: 8th May, 2020
  • Reference: OO05082020AWS_1588976038
Job Description

Data is core to my clients strategy and business. You will work directly with their Chief Data Scientist and founding team to take ownership of developing a backend data infrastructure to both support the internal AI scientists and platform as well as external-facing partnerships. You will have the opportunity to work with cutting edge data tools and be part of a team working on the bleeding edge of machine learning and biology.

Role & Responsibilities

* Optimize and manage data and algorithm storage using commercially available cloud systems like AWS
* Work with a team consisting of computational biologists and machine learning scientists to make sure ingested data is easily accessible and useable for downstream machine learning methods
* Build efficient data ingress pipelines for both existing data assets and newly identified data sources
* Help set company's overall data strategy from developing technical infrastructure to building the data engineering team

Skills & Qualifications

* Experience building scalable data infrastructures using traditional, open source and cloud technologies
* Proficient working knowledge of relational and non-relational databases
* Experience building ETL pipelines and working knowledge of AWS
* Proficient knowledge of scripting languages, preferably Python

Bonus:

* Experience in Big Data Development including Hive, Hadoop, and/or SparkPast working
* Experience in a start-up environment
* Biology or healthcare experience

Benefits

* Comprehensive Healthcare, Dental, and Vision
* 25 days PTO
* 401K
* Flexible work environment