1. Deep understanding of Hadoop ecosystem technologies.
2. Strong Proficient in PySpark for data processing and analytics.
3. In-depth knowledge of Hive for data warehousing and querying.
4. Experienced with Livy for interactive job submission.
5. Proficiency in shell scripting for automation and orchestration.
6. Hands-on experience with AWS SageMaker for machine learning model development and deployment.
7. Excellent communication and interpersonal skills, with the ability to collaborate with cross functional team.
8. Strong Problem-solving skills.
9. Ability to work independently and effectively prioritize tasks in a fast-paced environment.