

Rivago Infotech Inc
Data Engineer
⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is a Data Engineer position in New York, 100% onsite, with a contract length of "X months" and a pay rate of "$X/hour." Key skills required include Spark, Python, AWS, GCP, and experience with data warehousing solutions like Redshift or Snowflake.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
Unknown
-
🗓️ - Date
June 25, 2026
🕒 - Duration
Unknown
-
🏝️ - Location
On-site
-
📄 - Contract
Unknown
-
🔒 - Security
Unknown
-
📍 - Location detailed
New York, United States
-
🧠 - Skills detailed
#Deployment #Disaster Recovery #Project Management #Scala #Docker #Azure Virtual Machines #Lambda (AWS Lambda) #Migration #Python #Cloud #BigQuery #Data Engineering #Security #GIT #Data Pipeline #Snowflake #Amazon EMR (Amazon Elastic MapReduce) #Java #AWS Glue #GitLab #Compliance #Kafka (Apache Kafka) #AWS DevOps #Data Warehouse #Regression #Spark (Apache Spark) #"ETL (Extract #Transform #Load)" #Documentation #Dataflow #Jenkins #DynamoDB #Kubernetes #DevOps #AWS (Amazon Web Services) #Redshift #Agile #Amazon Redshift #Data Processing #Databases #Automated Testing #Data Privacy #Data Migration #Version Control #Scripting #AWS Lambda #VPC (Virtual Private Cloud) #NoSQL #Azure #Data Integration #Automation #AWS EMR (Amazon Elastic MapReduce) #AWS Kinesis #GCP (Google Cloud Platform) #dbt (data build tool)
Role description
Interview mode : ( 1 Video interview and CI will be in person)
Role : Data Enginee
rLocation : New York (100% onsite) (need local to NY
)Interview mode : ( 1 Video interview and CI will be in person
)Contrac
t
Candidates need to have colab set up ready with Gmail account so they can code on the L1 intervie
w
Must have
:Languages & Scripting: Spark, Python, Java, Scala, Hive, Kafka, SQ
LCloud Platforms: AW
SData Warehousing & Analytics: Redshift or Snowflake or Big Quer
yData Integration & ETL: AWS Glue, Aws EMR, Spark, Data Brick
sCI/CD: AWS Code Pipeline, Jenkins, CloudFormation, Docker, Kubernete
s
JD
• :Results-driven Data Engineer with a decade of expertise in Data engineering across cloud platforms with a total of 12 years in IT
• .Extensive experience utilizing Google Cloud Platform (GCP) services, including BigQuery, Dataflow, Dataprep, and Pub/Sub, for data engineering solutions
• .Proficient in building and managing GCP data pipelines with tools like Cloud Composer and Cloud Dataflow
• .Proven ability in developing and deploying applications on Google Kubernetes Engine (GKE)
• .Strong background in implementing security and compliance on GCP, ensuring data privacy and regulatory adherence
• .Track record of optimizing cost and resource usage within GCP environments
• .Skilled in AWS services such as Amazon EMR, Redshift, and Glue for efficient data processing
• .Expertise in architecting scalable, cost-effective solutions on AWS, with proficiency in configuring AWS Lambda for serverless computing
• .Adept at setting up AWS Kinesis streams to process real-time data, enhancing system responsiveness and data-driven decision-making
• .Proficient in leveraging AWS DynamoDB to create scalable, low-latency NoSQL databases for dynamic applications
• .Deep expertise in optimizing and managing Amazon Redshift data warehouses to deliver high-performance analytics and business insights
• .Experienced in integrating AWS services into CI/CD pipelines, streamlining automation for continuous integration, delivery, and deployment
• .Skilled in setting up and securing AWS Virtual Private Cloud (VPC) environments
• .Proficient in managing Azure virtual machines (VMs) for cloud infrastructure operations
• .Extensive experience managing on-premises data infrastructure, including data warehouses and databases
• .Familiar with AWS DevOps practices for continuous integration and deployment
• .Expertise in using Git for version control in DBT projects, ensuring proper tracking and documentation of data model changes
• .Skilled in performance optimization and tuning of on-premises data systems
• .Proficient in data migration strategies between on-premises and cloud environments
• .Strong troubleshooting skills in resolving issues within on-premises data systems
• .Proven ability to maintain high availability and disaster recovery solutions in on-premises environments
• .Experienced in implementing CI/CD pipelines using tools like Jenkins and GitLab CI/CD
• .Adept in automated testing processes, including unit, integration, and regression testing
• .Skilled in gathering and analyzing project requirements to ensure alignment with business goals
• .Experienced in Agile project management, contributing to successful outcomes through data-driven analytics and collaborative teamwor
k
Interview mode : ( 1 Video interview and CI will be in person)
Role : Data Enginee
rLocation : New York (100% onsite) (need local to NY
)Interview mode : ( 1 Video interview and CI will be in person
)Contrac
t
Candidates need to have colab set up ready with Gmail account so they can code on the L1 intervie
w
Must have
:Languages & Scripting: Spark, Python, Java, Scala, Hive, Kafka, SQ
LCloud Platforms: AW
SData Warehousing & Analytics: Redshift or Snowflake or Big Quer
yData Integration & ETL: AWS Glue, Aws EMR, Spark, Data Brick
sCI/CD: AWS Code Pipeline, Jenkins, CloudFormation, Docker, Kubernete
s
JD
• :Results-driven Data Engineer with a decade of expertise in Data engineering across cloud platforms with a total of 12 years in IT
• .Extensive experience utilizing Google Cloud Platform (GCP) services, including BigQuery, Dataflow, Dataprep, and Pub/Sub, for data engineering solutions
• .Proficient in building and managing GCP data pipelines with tools like Cloud Composer and Cloud Dataflow
• .Proven ability in developing and deploying applications on Google Kubernetes Engine (GKE)
• .Strong background in implementing security and compliance on GCP, ensuring data privacy and regulatory adherence
• .Track record of optimizing cost and resource usage within GCP environments
• .Skilled in AWS services such as Amazon EMR, Redshift, and Glue for efficient data processing
• .Expertise in architecting scalable, cost-effective solutions on AWS, with proficiency in configuring AWS Lambda for serverless computing
• .Adept at setting up AWS Kinesis streams to process real-time data, enhancing system responsiveness and data-driven decision-making
• .Proficient in leveraging AWS DynamoDB to create scalable, low-latency NoSQL databases for dynamic applications
• .Deep expertise in optimizing and managing Amazon Redshift data warehouses to deliver high-performance analytics and business insights
• .Experienced in integrating AWS services into CI/CD pipelines, streamlining automation for continuous integration, delivery, and deployment
• .Skilled in setting up and securing AWS Virtual Private Cloud (VPC) environments
• .Proficient in managing Azure virtual machines (VMs) for cloud infrastructure operations
• .Extensive experience managing on-premises data infrastructure, including data warehouses and databases
• .Familiar with AWS DevOps practices for continuous integration and deployment
• .Expertise in using Git for version control in DBT projects, ensuring proper tracking and documentation of data model changes
• .Skilled in performance optimization and tuning of on-premises data systems
• .Proficient in data migration strategies between on-premises and cloud environments
• .Strong troubleshooting skills in resolving issues within on-premises data systems
• .Proven ability to maintain high availability and disaster recovery solutions in on-premises environments
• .Experienced in implementing CI/CD pipelines using tools like Jenkins and GitLab CI/CD
• .Adept in automated testing processes, including unit, integration, and regression testing
• .Skilled in gathering and analyzing project requirements to ensure alignment with business goals
• .Experienced in Agile project management, contributing to successful outcomes through data-driven analytics and collaborative teamwor
k






