Senior Principal Data Engineer

Posted 18 Oct 2023

Bangalore, Karnataka - India

Req Id 264680


Work Your Magic with us!

Ready to explore, break barriers, and discover more? We know you’ve got big plans – so do we! Our colleagues across the globe love innovating with science and technology to enrich people’s lives with our solutions in Healthcare, Life Science, and Electronics. Together, we dream big and are passionate about caring for our rich mix of people, customers, patients, and planet. That's why we are always looking for curious minds that see themselves imagining the unimaginable with us.


Everything we do in Electronics is to help us deliver on our purpose of being the company behind the companies, advancing digital living. We are dedicated to being the trusted supplier of high-tech materials, services and specialty chemicals for the electronics, automotive and cosmetics industries. We foster a global collaborative organization made up of individuals who have the passion to win, obsess about the customer, are relentlessly curious and act with urgency. Together, we push the boundaries of science to make more possible for our customers.


Healthcare R&D Data Sciences seeks a data engineer who has substantial experience with biological/biomedical data types to support data analysis activities, build our FAIR data environment, and streamline our data handling.  The candidate will play a pioneering role in establishing this new group and a leading role in building tools and setting standards for biological data. 

‘Omics Data Engineers will work on clinical, pre-clinical, and public datasets but their focus is biological data types (e.g., genetics, proteomics, gene expression- collectively “omics data”) and medical data.  Thus, experience with these datatypes is mandatory.


While the group will be responsible for routine data engineering tasks, its success will be measured by the extent to which activities can be automated and attention focused on our data strategy.  Data engineers will collaborate daily with scientists at company's other R&D hubs and across departments. 

Your role:

Specific responsibilities for data engineers include

      • collaborating with data scientists on data ingestion and query optimization
      • developing and implementing standards for data formats, processing, and documentation to promote wider reuse of data assets
      • building tools to automate data transfers and QC
      • collaborating across functions to design and build an R&D-wide data catalog
      • training internal users on data standards, data formats, and data assets
      • arranging and managing data transfers with CROs and data providers

Who you are:

Core qualifications for ‘Omics Data Engineers include experience with

      • data engineering, data management, and cleaning in biology or biomedical domains
      • AWS tech stack, especially S3, EC2, EMR, Glue, and Lambda. 
      • Python, SQL, and tools such as Spark and PySpark
      • controlled vocabularies, ontologies, and data standards
      • at least one area of biological or medical data

And ideal candidate will also have experience with

      • R programming and RShiny
      • multiple biological and medical data types
      • experience in pharmaceutical R&D

What we offer: We are curious minds that come from a broad range of backgrounds, perspectives, and life experiences. We celebrate all dimensions of diversity and believe that it drives excellence and innovation, strengthening our ability to lead in science and technology. We are committed to creating access and opportunities for all to develop and grow at your own pace. Join us in building a culture of inclusion and belonging that impacts millions and empowers everyone to work their magic and champion human progress!
Apply now and become a part of our diverse team!

Apply Now