-
Senior Data Engineer
- Genentech (South San Francisco, CA)
-
The Position
A healthier future. It’s what drives us to innovate. To continuously advance science and ensure everyone has access to the healthcare they need today and for generations to come. Creating a world where we all have more time with the people we love. That’s what makes us Roche.
Advances in AI, data and computational sciences are transforming drug discovery and development. Roche’s Research and Early Development organizations at Genentech (gRED) and Pharma (pRED) have demonstrated how these technologies accelerate R&D, leveraging data and novel computational models to drive impact. Seamless data sharing and access to models across gRED and pRED are essential to maximising these opportunities. The Computational Sciences Center of Excellence (CS CoE) is a strategic, unified group whose goal is to harness the transformative power of data and Artificial Intelligence (AI) to assist our scientists in both pRED and gRED to deliver more innovative and life-changing medicines for patients worldwide.
Within the CS CoE organisation, the Data and Digital Catalyst (DDC) organization leads the modernization of our computational and data ecosystems by integrating digital technologies across Research and Early Development to empower stakeholders, advance data-driven science and accelerate decision-making.
The Solutions team within the DDC Organization develops modernized and interconnected computational and data ecosystems. The Data Ecosystem is foundational to building solutions that accelerate the work done by our Computational and Bench Scientists and enable ML/AI tool creation and adoption. Our team specializes in building Data Pipelines and Applications for data acquisition, collection, storage, transformation, linkage and sharing.
The Opportunity:
+ Design and implement Data Engineering solutions that advance the Catalyst organization’s Data Fabric strategy.
+ Partner with product managers and scientists to understand user needs, shape requirements, and translate them into actionable technical specifications.
+ Develop and maintain systems for collecting, structuring, and storing diverse scientific data that support advanced analytics, machine learning, and other data-driven initiatives.
+ Deliver data flows and pipelines across gCS, Research Biology, Drug Discovery, Translational Medicine, Development Sciences, and beyond.
+ Contribute to architectural decisions, code reviews, and the evolution of our development processes with a focus on engineering best practices.
+ Adopt key trends and technologies with an Open Source, Cloud-first, API-first, and AI-first approach.
+ Adopt a culture of impact, scientific excellence, continuous learning, collaboration, and curiosity.
Who You Are:
+ Bachelor’s or Master’s degree in Computer Science or similar technical field, or equivalent experience and 4+ years of professional experience in data engineering or related roles.
+ Strong proficiency in a high level programming languages such as Python or Java and strong proficiency in SQL and experience with NoSQL databases, data warehouses, or data lakes.
+ Experience building end-to-end pipelines with AWS Glue, AWS Lambda, or similar services for ETL/ELT and serverless data processing workflows.
+ Experience building and operationalizing a large volume data processing platform and experience working on cloud-native architectures in public clouds (ideally AWS).
+ Proven understanding and application of engineering best practices.
+ Excellent communication skills and ability to build trusted partnerships with internal and external collaborators.
+ Ability to quickly acquire new technologies and programming languages and a passion for continuous learning.
Preferred But Not Required:
+ Experience with biological data and processes is a strong plus.
+ Experience working with scientists or in a research environment is advantageous.
Onsite presence, on our South San Francisco campus, is expected for at least 3 days a week.
Relocation benefits are available for this job posting.
The expected salary range for this position based on the primary location of California is $142,500 - $264,700 of hiring range. Actual pay will be determined based on experience, qualifications, geographic location, and other job-related factors permitted by law. A discretionary annual bonus may be available based on individual and Company performance. This position also qualifies for the benefits detailed at the link provided below.
Benefits (https://roche.ehr.com/default.ashx?CLASSNAME=splash)
\#LI-JD1
\#ComputationCoE
Genentech is an equal opportunity employer. It is our policy and practice to employ, promote, and otherwise treat any and all employees and applicants on the basis of merit, qualifications, and competence. The company's policy prohibits unlawful discrimination, including but not limited to, discrimination on the basis of Protected Veteran status, individuals with disabilities status, and consistent with all federal, state, or local laws.
If you have a disability and need an accommodation in relation to the online application process, please contact us by completing this form Accommodations for Applicants (https://docs.google.com/forms/d/e/1FAIpQLSdZWlsbfQOvFVIQgHE\_iDzWUTlhZvj6FytIzjS7xq6IGh1H5g/viewform) .
-
Recent Searches
- Consumer Card Data Owner (New York)
- Medical Staff Associate LPN (Texas)
- Client Experience Specialist Arizona (Ohio)
Recent Jobs
-
Senior Data Engineer
- Genentech (South San Francisco, CA)
-
Manufacturing Test Engineer
- Bosch (Fort Lauderdale, FL)
-
Data Manager
- PNC (Dallas, TX)