Data Engineer
Founded in 1913 in Italy, the Prada Group was built on a tradition of excellence and with a vision of innovation. The Group, a world leader in the luxury sector, operates in more than 45 countries with the PRADA, Miu Miu, Church’s and Car Shoe brands, and has employees of over 100 nationalities.
The acquisition of Pasticceria Marchesi 1824 has marked the Group’s entry into the food sector, applying the same high quality criteria. Very proactive also in the art field, the Prada Group strengthens its presence through contemporary art projects in constant evolution.
Joining our Company means working in a creative and international environment, with teams of people motivated by curiosity and the quest for excellence. The engine of our success is the importance and value that we place on the talent and passion of our people leading to their own professional growth.
JOB PURPOSE
The Data Engineer will contribute in supervising our corporate analytics and data science platform. The ideal candidate will join the team Cloud Data Platform addressing data operations and modeling, based in Milan.
RESPONSIBILITIES
- Design and implement data operations and pipelines, modeling and integration, working with global stakeholder and end users
- Develop, deploy and govern highly efficient ETL and data curation processes using big data and cloud technologies and platforms (e.g., Azure, Databricks, git,)
- Model and govern a core of key data sets, making them available, certified, reliable and easy to be accessed/integrated by other enterprise users-component-systems (e.g. normalized E/R models enriched by data catalogues /glossaries)
- Maintain and optimize existing data pipelines integrating heterogeneous data, either starting from internal or external sources (e.g. maintaining data lineages lineage e2e)
- Address a unified historical track of data, in order to provide a coherent time navigation of the data model in the long term
- Manage and maintain data access based on role and groups based control
KNOWLEDGE AND SKILLS
Excellent experience in processes related to data preparation/integration/modeling and governance.
Solid experience in designing and managing end-to-end data pipelines.
Deep knowledge of SQL programming language.
Solid experience in DBMS/Data warehouse modeling, optimization and management.
Deep knowledge of Slowly Changing Dimension Techniques.
Experience of the main big data technologies and tools (i.e. Spark, mlFlow, Azure ML, Hive, Docker, Kubernetes, etc.).
Deep knowledge in developing microservices, API (i.e. using REST, Graph-QL).
Advanced experience in enterprise development frameworks and programming languages (e.g., PySpark, Scala, Python, Java).
Knowledge of use and configuration of one or more of the cloud infrastructures and platforms, with a special regard to Microsoft Azure.
Knowledge and experience in stream-processing systems and lambda-architectures (e.g. Azure Event Hubs, Spark-Streaming, Kafka, etc.).
Expertise in developing and integrating advanced analytics and machine learning tools and libraries (e.g. PyTorch, Keras, Tensorflow).
Bachelor or Master's degree in Software Engineering, Computer Science, Mathematics or related STEM discipline.
3+ years experience in intensive data ops. in the context of Big Data and cloud infrastructures / platforms.
Fluent in Italian and in English written and spoken.
Organizational and self-management skills during work peaks.
Proactive self-starter with ability to manage multiple initiatives at once.
A good knowledge of retail and/or logistics processes and back end operational systems will be considered a plus.
Find similar opportunities