About the job Cloud Data Platform Engineer Cloud Data Platform Engineer needs 5+ years implementing data applications or data platforms with BigData/Hadoop, Python/Java/Spark full stack, etc. 5+ year support experience of Big Cloud Data Platform Engineer requires:
- 3 days a week, hybrid
- Locations: Iselin, NJ, Charlotte, NC, N Y, NY
- Data Engineering 60-70 %.
- Infrastructure 30-40%. (Moving Data in & Out of cloud)
- Snowflake exp is preferred
- Masters or Bachelor's degree in computer science or related discipline
- Manage and ensure delivery of project tasks as required
- Excellent documentation skills to create and manage design, implementation and automation related documentation
- Implementation experience for Hadoop distribution platforms like Cloudera or AWS EMR.
- Extensive experience in designing, engineering and managing data lake ingestion, validation, transformation and consumption services leveraging cloud data tools like Hive, Spark, EMR, Glue ETL and Catalog, Snowflake etc.
- Experience on implementing solutions with Worflow orchestration/scheduling tools such as Airflow, Autosys etc.
- Experience building CI/CD pipelines for infrastructure implementations with Teraform, Gitlab etc.
- Must be ability to multi-task and prioritize tasks for self
- Must experience working in agile environment with multiple priorities
Preferred Skills:
- 5+ years implementing data applications or data platforms with BigData/Hadoop, Python/Java/Spark full stack, etc.
- 5+ year support experience of Big Data technologies in Hadoop ecosystem Hive, HDFS, MapReduce, Spark, Yarn, Kafka, Pig, HBase, Sqoop, Elastic Search, Kerberos.
- 5+ years experience with ETL/ELT tools such as AWS Glue ETL, Talend, Datastage etc.
- 5+ years experience with Orchestration tools such as Airflow, Autosys etc.
- Experience with Jira and Agile methodology is a plus;
- Experience with understanding DevOps or CI/CD process w.r.t to infrastructure management activities
Cloud Data Platform Engineer duties:
- Design, develop and deliver cloud datastore solutions and develop automation pipelines to migrate data sets from On-prem to Cloud platforms. Practice Infrastructure as code to develop automation routines and integration flows to manage state of the datastore platform systems
- Provision secures from start datastores and enable them with required security controls including encryption, masking, certificate/keys rotation etc.
- Collaborates with developers, analysts, various system administrators to identify business requirements in designing efficient datastore solutions and interfaces.
- Identifies and documents all system constraints, implications, and consequences of various proposed system changes.
- Reviews technical documentation to guide system users and to assist with the ongoing operation, maintenance, and development of the system. Evaluates the efficiency and effectiveness of application operations and troubleshooting problems.
- Provide expert level IT technical lead services, including the direction, evaluation, selection, configuration, implementation, and integration of new and existing technologies and tools in a cloud platform.