Excellent hands-on experience in AWS Data technologies Glue, S3, Athena, EMR, IAM etc.
• Mandatory - Hands on experience in Python and PySpark.
Python as a language is practically usable for anything, we are looking for application Development and Extract/Transform/Load and Datalake curation experience using Python.
• Hands on experience in version control tools like Git.
• Worked on Amazon’s Analytics services like Amazon Athena, DynamoDB and AWS Glue • Worked on Amazon’s Compute services like Amazon Lambda, Amazon EC2 and Amazon’s Storage service like S3 and few other services.
• Experience/knowledge of bash/shell scripting will be a plus.
• Has built ETL processes to take data, copy it, structurally transform it etc.
involving a wide variety of formats like CSV, fixed width, XML and JSON.
• Have worked with columnar storage formats- Parquet, Avro, and ORC etc.
• Hands on experience in tools like Jenkins to build, test and deploy the applications.
• Excellent debugging skills.
• Ability to quickly perform critical analysis and use creative approaches for solving complex problems.
• Strong academic background.
• Excellent written and verbal communication skills, and strong relationship building skills'