GuruSchools LLC

Data Engineer

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Data Engineer in New York, 100% onsite, with a contract length of "unknown" and a pay rate of $70/hr. Requires expertise in Spark, Python, AWS, data warehousing (Redshift/Snowflake), and CI/CD tools.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
560
-
πŸ—“οΈ - Date
May 21, 2026
πŸ•’ - Duration
Unknown
-
🏝️ - Location
On-site
-
πŸ“„ - Contract
Unknown
-
πŸ”’ - Security
Unknown
-
πŸ“ - Location detailed
New York, NY
-
🧠 - Skills detailed
#Compliance #VPC (Virtual Private Cloud) #Data Bricks #Cloud #Migration #Regression #AWS Kinesis #"ETL (Extract #Transform #Load)" #SQL (Structured Query Language) #Data Integration #AWS (Amazon Web Services) #GitLab #GCP (Google Cloud Platform) #Documentation #Automation #Kafka (Apache Kafka) #Data Pipeline #Snowflake #Scripting #DynamoDB #Security #Dataflow #AWS Glue #DevOps #Data Privacy #Spark (Apache Spark) #Data Migration #Project Management #Python #Deployment #Java #Jenkins #Kubernetes #Automated Testing #NoSQL #Azure #Agile #Data Engineering #Azure Virtual Machines #Data Processing #AWS EMR (Amazon Elastic MapReduce) #Amazon EMR (Amazon Elastic MapReduce) #AWS DevOps #dbt (data build tool) #Redshift #Scala #Disaster Recovery #Data Warehouse #Version Control #Databases #Lambda (AWS Lambda) #Amazon Redshift #AWS Lambda #GIT #BigQuery #Docker
Role description
Role: Data Engineer Location: New York (100% onsite) (Need local to NY) Interview mode: (1 Video interview and Client will be in person) Rate: $70/hr. on C2C Candidates need to have colab set up ready with Gmail account so they can code on the L1 interview Must have: Languages & Scripting:Β Spark, Python, Java, Scala, Hive, Kafka, SQL Cloud Platforms:Β AWS Data Warehousing & Analytics:Β Redshift or Snowflake or Big Query Data Integration & ETL:Β AWS Glue, Aws EMR, Spark, Data Bricks CI/CD:Β AWS Code Pipeline, Jenkins, CloudFormation, Docker, Kubernetes Job Description: β€’ Results-driven Data Engineer with a decade of expertise in Data engineering across cloud platforms with a total of 12 years in IT. β€’ Extensive experience utilizing Google Cloud Platform (GCP) services, including BigQuery, Dataflow, Dataprep, and Pub/Sub, for data engineering solutions. β€’ Proficient in building and managing GCP data pipelines with tools like Cloud Composer and Cloud Dataflow. β€’ Proven ability in developing and deploying applications on Google Kubernetes Engine (GKE). β€’ Strong background in implementing security and compliance on GCP, ensuring data privacy and regulatory adherence. β€’ Track record of optimizing cost and resource usage within GCP environments. β€’ Skilled in AWS services such as Amazon EMR, Redshift, and Glue for efficient data processing. β€’ Expertise in architecting scalable, cost-effective solutions on AWS, with proficiency in configuring AWS Lambda for serverless computing. β€’ Adept at setting up AWS Kinesis streams to process real-time data, enhancing system responsiveness and data-driven decision-making. β€’ Proficient in leveraging AWS DynamoDB to create scalable, low-latency NoSQL databases for dynamic applications. β€’ Deep expertise in optimizing and managing Amazon Redshift data warehouses to deliver high-performance analytics and business insights. β€’ Experienced in integrating AWS services into CI/CD pipelines, streamlining automation for continuous integration, delivery, and deployment. β€’ Skilled in setting up and securing AWS Virtual Private Cloud (VPC) environments. β€’ Proficient in managing Azure virtual machines (VMs) for cloud infrastructure operations. β€’ Extensive experience managing on-premises data infrastructure, including data warehouses and databases. β€’ Familiar with AWS DevOps practices for continuous integration and deployment. β€’ Expertise in using Git for version control in DBT projects, ensuring proper tracking and documentation of data model changes. β€’ Skilled in performance optimization and tuning of on-premises data systems. β€’ Proficient in data migration strategies between on-premises and cloud environments. β€’ Strong troubleshooting skills in resolving issues within on-premises data systems. β€’ Proven ability to maintain high availability and disaster recovery solutions in on-premises environments. β€’ Experienced in implementing CI/CD pipelines using tools like Jenkins and GitLab CI/CD. β€’ Adept in automated testing processes, including unit, integration, and regression testing. β€’ Skilled in gathering and analyzing project requirements to ensure alignment with business goals. β€’ Experienced in Agile project management, contributing to successful outcomes through data-driven analytics and collaborative teamwork