Build and manage the data asset using some of the most scalable and resilient open source big data technologies like Airflow, Spark, Kafka, etc
Build and manage a highly scalable, efficient Data and ML Infrastructure by adopting microservices driven design and architecture with proper DevOps principles and practices
Design and deliver the next-gen data lifecycle management suite of tools/frameworks, including ingestion and consumption on the top of the data lake to support real-time as well as batch use cases
Help the team in integrating various data sources across GFGs group vertical
Build and expose metadata catalog for the Data Lake for easy exploration, profiling as well as lineage requirements
JOB REQUIREMENT
At least 3+ years of relevant experience in developing scalable, secured, fault tolerant, resilient mission-critical Big Data platform.
Ab...
★ Ready to Start Your European Career?
Take the next step and apply for this exciting opportunity