Data Engineer

Data Science ·
Department Data Science
Employment Type Full-Time

Blackfynn is looking for a talented Data Engineer to join us in our mission to optimize therapeutics for Parkinson’s and neurodegeneration.




Blackfynn is a new kind of therapeutics company: one using high-quality clinical data, data modeling and analytics - first and always - to drive every aspect of drug development. We focus on optimizing clinical-stage assets for the near-term benefit of patients with Parkinson’s disease, with the aim of expanding into related neurodegenerative disease indications. It is our mission to improve the lives of everyone living with Parkinson’s and other neurodegenerative diseases across the globe, and to use our technology to democratize access to studies and medicines regardless of race, gender, socioeconomic status or geography.




  • Improve the core infrastructure and services that facilitate the acquisition, transformation, storage and retrieval of complex neurological datasets
  • Work with data scientists to produce fast and flexible computational infrastructure to support research projects
  • Develop the methods and tools for tracking data changes, mapping schemas across disparate datasets, and integrating analytic outputs
  • Create libraries, microservices, and infrastructure to streamline the development of data visualization dashboards and data analysis reports




In order to be successful as a Data Engineer at Blackfynn, you'll need a BS or MS in computer science, bioinformatics or related subject, with industry experience in biotech, pharma, or healthcare, preferably in neurology or directly with clinical trial data. You’ll also need to be:


  • Flexible, thriving in the face of ambiguity and finding appropriate solutions
  • Able to develop systematic, structured, and pragmatic approaches to tasks, with attention to detail
  • Proficient in Python and SQL
  • Proficient in working with and analyzing tabular data
  • Comfortable on the (unix) command line
  • Comfortable with Docker
  • Experienced in developing, monitoring, and testing automated data pipelines
  • Experienced with one or more distributed data processing or computing systems (Spark, Hadoop, Kubernetes, etc.)
  • Handling PHI and ensuring HIPAA compliance
  • Pachyderm, Scala, Terraform
  • Building and maintaining RESTful APIs




  • Competitive salaries and equity participation – we all share in Blackfynn’s success
  • Generous vacation benefits – we want you to work hard but also rest and recharge
  • Excellent health benefits – at the end of the day, we are here to improve healthcare
  • Opportunity to work with smart people, solve complex problems, and make an impact in the world 




Each of us has been affected by Parkinson’s, dementia or other disease of the brain. We are the children, partners and friends of people living with neurodegenerative diseases. Sometimes, we are patients ourselves. Current drugs are marginally effective and can often make symptoms worse. New drugs keep failing in clinical trials. At Blackfynn, we do things differently. By applying a systematic approach that combines data, technology and deep domain expertise, we are developing a pipeline of targeted, clinical-stage therapeutics to improve the lives of those living with Parkinson’s and other neurodegenerative diseases.  

Note: As a remote-first organization, the majority of our roles can be accomplished anywhere in the United States. We encourage applicants coast-to-coast to apply to our open positions. That said, our Philadelphia office will remain operational and accessible for anyone to use on a voluntary basis. 

Thank You

Your application was submitted successfully.

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

  • Location
  • Department
    Data Science
  • Employment Type