Genentech a Roche Group of companies company logo

Data Engineer

 

Data Engineer

Research Informatics             ·             South San Francisco, California, United States of America        ·         Full time

 

Data Engineer

  • Location: South San Francisco, California, United States of America 
  • Full time

APPLY NOW 

 

The Position

Description

If you are a Big Data engineer and want to work on something that truly can change the world, this job is for you.  Biology is approaching an inflection where we can directly leverage data to understand the cellular basis of human diseases and from this generate therapeutics that can treat these diseases.  Our Translational Genomics initiative is spearheading this effort and bringing together data from human genetics, functional genomics, molecular biology, disease model engineering, and tissue and cellular profiling.   We need a Data Engineering Lead to help us create a next-generation data engine that scalably and rigorously ingests and transforms data generated from this initiative so they are ready for machine-driven analysis.    The Data Engineering Lead will act as an architect and engineering manager tasked to oversee the construction and operation of this data engine. This data engine will be used to help assemble an exabyte scale connected and computable data universe composed of high-value internally and externally generated data and results that we can build our data science efforts on top of.  Your efforts will therefore directly enable computational discovery of disease targets and from these potentially life-saving therapies.  

 

A person hired in this position will

  • Work on a team that will architect and deliver a next-generation data engine that enables scalable, flexible, and rigorous data transformations using modern data management practices.  
  • Help build and deliver data infrastructure that will enable machines to crawl and compute on and across all our data.  
  • Work with a cross-functional team of scientists and engineers to design and deliver these solutions.  
  • Collaborate across the informatics organization via presentations and collaborations.

Successful candidates will meet many of the following requirements

Must-have requirements
  • You have a BS in a computational discipline with 8 years of work experience or a Masters with 5 years of experience.  
  • 7+ years experience architecting and developing scalable pipelines, frameworks and platforms to power data science efforts in distributed cloud environments, 5 of which are on AWS.  
  • Multiple years of experience working on teams to software to deliver solutions.  
  • Exceptional communication skills.
 
Nice to haves:
  • Practical understanding of the data management practices required to power rigorous data science and enable advanced analytics like AI & ML. 
  • Hands-on experience working with the following technologies, frameworks, and languages:  Java, Scala, Python, Spark, Airflow, RabbitMQ, Spring (nice to have).
  • Experience working on projects focused on omics data (nice to have)

 

What to expect from us

  • A highly collaborative and dynamic research environment where we aim to advance the rate of scientific discovery using purposefully built solutions.
  • Access to large multimodal omic datasets focused on disease biology, samples and compute resources.
  • Access to state-of-the-art technologies and pioneering research.
  • Participation in seminar series featuring academic and industry scientists.
  • Campus-like lifestyle with a healthy work-life balance.
  • Mentored opportunities to further develop professional skills.

 

Who we are 

A member of the Roche Group, Genentech has been at the forefront of the biotechnology industry for more than 40 years, using human genetic information to develop novel medicines for serious and life-threatening diseases. We are a research-driven biotechnology company, whose medical innovations for cancer and other serious illnesses make a difference for patients across the globe. Please take this opportunity to learn about Genentech where we believe that our employees are our most important asset & are dedicated to remaining a great place to work.

Genentech is an equal opportunity employer & prohibits unlawful discrimination based on race, color, religion, gender, sexual orientation, gender identity/expression, national origin/ancestry, age, disability, marital & veteran status. For more information about equal employment opportunities, visit our Genentech Careers page. The expected salary range for this position based on the primary location of California is $130,100 – 241,500.  Actual pay will be determined based on experience, qualifications, geographic location, and other job-related factors permitted by law.  A discretionary annual bonus may be available based on individual and Company performance.  This position also qualifies for the benefits detailed at the link provided below.

 

Benefits

#gCS

 

Genentech is an equal opportunity employer, and we embrace the increasingly diverse world around us. Genentech prohibits unlawful discrimination based on race, color, religion, gender, sexual orientation, gender identity or expression, national origin or ancestry, age, disability, marital status and veteran status.

 

Job Facts

  • Job Sub Category: Research Informatics
  • Schedule: Full time
  • Job Type: Regular
  • Posted Date: Jan 29th 2024
  • Job ID: 202212-143670

 

Please click here to apply.

 

No Comments

Sorry, the comment form is closed at this time.