Senior Data Engineer (GCP)
About role:
A Data Engineer’s role involves crafting, constructing, and upholding the structure, tools, and procedures essential for an organization to gather, store, modify, and scrutinize extensive data amounts. This position involves creating data platforms using typically provided infrastructure and establishing a clear path for Analytics Engineers who utilize the system.
Responsibilities:
- Working alongside Platform Engineers to assess and choose suitable technologies and tools for the project
- R&D, maintenance, and monitoring of the platform’s components
- Implementing intricate data intake procedures
- Constructing efficient data models
- Implementing and executing policies aligned to the strategic plans of the company concerning used technologies, work organization, etc.
- Ensuring compliance with industry standards and regulations in terms of security and data privacy applied in the data processing layer
- Providing training and fostering knowledge-sharing
Job requirements:
- Proficiency in a programming language like Python and SQL
- Knowledge of the BigQuery DWH platform
- Working with Spark messaging systems
- Experience as a programmer and knowledge of software engineering, good principles, practices, and solutions
- Familiarity with cloud Google Cloud Platform (GCP) – a minimum of several years’ experience required (must have)
- Knowledge of at least one orchestration and scheduling tool, for example, Airflow, Prefect, Dragster, etc.
- Familiarity with DevOps area and tools – GKE, Docker
- Experience with Version Control System, preferably GIT
- Ability to actively participate/lead discussions with clients to identify and assess concrete and ambitious avenues for improvement
Client:
Our company is a prominent player in the data industry, serving global clients and delivering cutting-edge projects in areas such as Data, AI, Cloud, Analytics, Machine Learning, Large Language Models, and Generative AI.
We offer a wide range of projects where our experts can thrive. These include Advanced Analytics, Data Platforms, Streaming Analytics Platforms, Machine Learning Models, Generative AI, and more. We enjoy utilizing leading technologies and open-source solutions for Data, AI, and Machine Learning.