Lead Data Engineer (Tarrytown -NY) Day1 Onsite at Tarrytown, New York, USA |
Email: [email protected] |
http://bit.ly/4ey8w48 https://jobs.nvoids.com/job_details.jsp?id=456144&uid= From: James Smith, Rivago Infotech [email protected] Reply to: [email protected] Role: Onsite Lead Data Engineer Location: 777 Old Saw Mill River Rd, Tarrytown, NY 10591 (100% onsite role) Ideal to look for local profiles. Here are the key expectations from the tech perspective: Primary skills are PySpark, RedShift, Airflow, AWS Job Description: Candidate should have 15+ years of experience in Data Engineering Designing, creating, testing and maintaining the complete data management & processing systems. Working closely with the stakeholders & solution architect. Building highly scalable, robust & fault-tolerant systems. Knowledge of Hadoop ecosystem and different frameworks inside it HDFS, YARN, MapReduce, Apache Pig, Hive, Flume, Sqoop, ZooKeeper, Oozie, Impala and Kafka Must have knowledge and working experience in Real-time processing Framework (Apache Spark), PySpark and in AWS Redshift, Apache Airflow and EMR Must have experience on SQL-based technologies (e.g. MySQL/ Oracle DB) and NoSQL technologies (e.g. Cassandra and MongoDB) Should have Python/Scala/Java Programming skills Discovering data acquisitions opportunities Finding ways & methods to find value out of existing data. Improving data quality, reliability & efficiency of the individual components & the complete system. Problem solving mindset working in agile environment Keywords: database information technology New York http://bit.ly/4ey8w48 https://jobs.nvoids.com/job_details.jsp?id=456144&uid= |
[email protected] View All |
09:59 PM 27-Jul-23 |