Randstadeos
Data engineer
POSITION DESCRIPTION & BUSINESS PRIORITIES
Job contents (responsibilities, tasks, amount of travel)
Job Title: Analyst
Job Location: Bangalore
In this role, you will be part of a growing, global team of data engineers, who collaborate in DevOps mode, in order to enable Merck's Life Science business with state-of-the-art technology to leverage data as an asset and to take better informed decisions.
The Life Science Data Engineering Team is responsible for designing, developing, testing, and supporting automated end-to-end data pipelines and applications on Life Science’s data management and analytics platform (Palantir Foundry, AWS and other components).
The Foundry platform comprises multiple different technology stacks, which are hosted on Amazon Web Services (AWS) infrastructure. Developing pipelines and applications on Foundry requires:
· Proficiency in SQL / Java / Python (Python required; all 3 not necessary)
· Proficiency in PySpark for distributed computation
· Familiarity with Postgres and Elasticsearch
· Familiarity with HTML, CSS, and JavaScript and basic design/visual competency
· Familiarity with common databases (e.g. JDBC, MySQL, Microsoft SQL). Not all types required.
· Familiarity with any cloud infrastructure/tools with respect to data engineering
This position will be project based and may work across multiple smaller projects or a single large project utilizing an agile project methodology.
Roles & Responsibilities:
· Develop data pipelines by ingesting various data sources – structured and un-structured – into Palantir Foundry
· Participate in end-to-end project lifecycle, from requirements analysis to go-live and operations of an application
· Acts as business analyst for developing requirements for Foundry pipelines
· Review code developed by other data engineers and check against platform-specific standards, cross-cutting concerns, coding and configuration standards and functional specification of the pipeline
· Document technical work in a professional and transparent way. Create high quality technical documentation
· Work out the best possible balance between technical feasibility and business requirements (the latter can be quite strict)
· Deploy applications on Foundry platform infrastructure with clearly defined checks
· Implementation of changes and bug fixes via Merck's change management framework and according to system engineering practices (additional training will be provided)
· DevOps project setup following Agile principles (e.g. Scrum)
· Besides working on projects, act as third level support for critical applications; analyze and resolve complex incidents/problems. Debug problems across a full stack of Foundry and code based on Python with Spark
· Work closely with business users, data scientists/analysts to design physical data models
Education
· B.Sc. (or higher) degree in Computer Science, Engineering, Mathematics, or related fields
Professional Experience
· 5+ years of experience in system engineering or software development
· 3+ years of experience in data and analytics.
0-2 years of experience for intern in data analytics