Senior Data Science Engineer
dunnhumby
- Gurgaon, Haryana
- Permanent
- Full-time
- Develop and maintain engineering tools/products needed for simple and efficient data science development
- Analyse complex data pipelines to identify performance bottlenecks, and suggest robust ways to optimize the workload in reasonable costs
- Work with the infra team to ensure robust, performant and scalable platform is available for data science development work
- Proficiency in Hadoop, PySpark, Pandas, NumPy and python version > 3.5
- Good Working experience of Partitions, Joins, cache, HDFS, handling data
- Experience in developing web applications using modern frontend and backend technologies, e.g Node Js, React, Dash, Django, Voila, Streamlit
- Good knowledge of Airflow or any other data process orchestration tools like Nifi , Lungi
- Good knowledge of SQL
- Proficiency in shell scripting and has working knowledge of devops/dataops
- Proficiency in web application development and application integration (REST APIs, Web services)
- Experience in re-engineering, automating and productionising code
- Should have experience in handling/optimising various file formats like parquet, Avro etc
- Should have good experience in working with Hadoop Ecosystem components (e.g. YARN, Spark UI, Hive, HDFS etc.) and cloud equivalents possibly in GCP