

Senior Data Engineer
β - Featured Role | Apply direct with Data Freelance Hub
This role is for a Senior Data Engineer with expertise in Java, Apache Spark, and Cloudera, located in Austin, TX or Sunnyvale, CA (Hybrid). Contract length and pay rate are unspecified. Key skills include ETL pipeline optimization and experience with big data technologies.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
-
ποΈ - Date discovered
May 21, 2025
π - Project duration
Unknown
-
ποΈ - Location type
Hybrid
-
π - Contract type
Unknown
-
π - Security clearance
Unknown
-
π - Location detailed
Austin, TX
-
π§ - Skills detailed
#Microservices #GIT #Distributed Computing #Security #Storage #Kafka (Apache Kafka) #Cloud #Data Ingestion #Hadoop #Business Analysis #Data Science #Data Engineering #Data Integration #Data Storage #"ETL (Extract #Transform #Load)" #Apache Spark #HBase #NiFi (Apache NiFi) #Databases #NoSQL #HDFS (Hadoop Distributed File System) #Java #Spark (Apache Spark) #Jenkins #PostgreSQL #Big Data #GitLab #SQL (Structured Query Language) #Batch #Spark SQL #Data Processing #Impala #HTTP & HTTPS (Hypertext Transfer Protocol & Hypertext Transfer Protocol Secure) #Scala #Deployment #Version Control #Sqoop (Apache Sqoop) #Data Lake #R #Cloudera #Data Integrity #BitBucket
Role description
Hi #connections,
We do have a Job Opening for,
Role-Senior Data engineer with Java, Spark & Cloudera
Locations: Austin, TX and Sunnyvale, CA (Hybrid)
Please share the suitable resumes to ram.r@vysystems.com & https://www.linkedin.com/in/ramkumarjhen/
Job Description:
We are seeking a Senior Java Spark Developer with expertise in Java, Apache Spark, and the Cloudera Hadoop Ecosystem to design and develop large-scale data processing applications. The ideal candidate will have strong hands-on experience in Java-based Spark development, distributed computing, and performance optimization for handling big data workloads.
Key Responsibilities:
β
Java & Spark Development:
-Develop, test, and deploy Java-based Apache Spark applications for large-scale data processing.
-Optimize and fine-tune Spark jobs for performance, scalability, and reliability.
-Implement Java-based microservices and APIs for data integration.
β
Big Data & Cloudera Ecosystem:
-Work with Cloudera Hadoop components such as HDFS, Hive, Impala, HBase, Kafka, and Sqoop.
-Design and implement high-performance data storage and retrieval solutions.
-Troubleshoot and resolve performance bottlenecks in Spark and Cloudera platforms.
β
Collaboration & Data Engineering:
-Collaborate with data scientists, business analysts, and developers to understand data requirements.
-Implement data integrity, accuracy, and security best practices across all data processing tasks.
-Work with Kafka, Flume, Oozie, and Nifi for real-time and batch data ingestion.
β
Software Development & Deployment:
-Implement version control (Git) and CI/CD pipelines (Jenkins, GitLab) for Spark applications.
-Deploy and maintain Spark applications in cloud or on-premises Cloudera environments.
Required Skills & Experience:
-Application development, with a strong background in Java and Big Data processing.
-Strong hands-on experience in Java, Apache Spark, and Spark SQL for distributed data processing.
-Proficiency in Cloudera Hadoop (CDH) components such as HDFS, Hive, Impala, HBase, Kafka, and Sqoop.
-Experience building and optimizing ETL pipelines for large-scale data workloads.
-Hands-on experience with SQL & NoSQL databases like HBase, Hive, and PostgreSQL.
-Strong knowledge of data warehousing concepts, dimensional modeling, and data lakes.
-Proven ability to troubleshoot and optimize Spark applications for high performance.
-Familiarity with version control tools (Git, Bitbucket) and CI/CD pipelines (Jenkins, GitLab).
-Exposure to real-time data streaming technologies like Kafka, Flume, Oozie, and Nifi.
-Strong problem-solving skills, attention to detail, and ability to work in a fast-paced environment.
Please attach the Updated Resume:
Please share the suitable resumes to ram.r@vysystems.com & https://www.linkedin.com/in/ramkumarjhen/
Kindly fill details,
1. Years of Exp----
1. Visa Status-----
1. Current Location--------
1. Linkedin ID-----
1. Share Updated Resume
Thanks & Regards.,
Ramkumar.R || Sr.Technical Recruiter
Email: ram.r@vysystems.com
Linkedin ID: https://www.linkedin.com/in/ramkumarjhen/
4701 Patrick Henry Drive Building 16 Santa Clara CA 95054, USA.
Hi #connections,
We do have a Job Opening for,
Role-Senior Data engineer with Java, Spark & Cloudera
Locations: Austin, TX and Sunnyvale, CA (Hybrid)
Please share the suitable resumes to ram.r@vysystems.com & https://www.linkedin.com/in/ramkumarjhen/
Job Description:
We are seeking a Senior Java Spark Developer with expertise in Java, Apache Spark, and the Cloudera Hadoop Ecosystem to design and develop large-scale data processing applications. The ideal candidate will have strong hands-on experience in Java-based Spark development, distributed computing, and performance optimization for handling big data workloads.
Key Responsibilities:
β
Java & Spark Development:
-Develop, test, and deploy Java-based Apache Spark applications for large-scale data processing.
-Optimize and fine-tune Spark jobs for performance, scalability, and reliability.
-Implement Java-based microservices and APIs for data integration.
β
Big Data & Cloudera Ecosystem:
-Work with Cloudera Hadoop components such as HDFS, Hive, Impala, HBase, Kafka, and Sqoop.
-Design and implement high-performance data storage and retrieval solutions.
-Troubleshoot and resolve performance bottlenecks in Spark and Cloudera platforms.
β
Collaboration & Data Engineering:
-Collaborate with data scientists, business analysts, and developers to understand data requirements.
-Implement data integrity, accuracy, and security best practices across all data processing tasks.
-Work with Kafka, Flume, Oozie, and Nifi for real-time and batch data ingestion.
β
Software Development & Deployment:
-Implement version control (Git) and CI/CD pipelines (Jenkins, GitLab) for Spark applications.
-Deploy and maintain Spark applications in cloud or on-premises Cloudera environments.
Required Skills & Experience:
-Application development, with a strong background in Java and Big Data processing.
-Strong hands-on experience in Java, Apache Spark, and Spark SQL for distributed data processing.
-Proficiency in Cloudera Hadoop (CDH) components such as HDFS, Hive, Impala, HBase, Kafka, and Sqoop.
-Experience building and optimizing ETL pipelines for large-scale data workloads.
-Hands-on experience with SQL & NoSQL databases like HBase, Hive, and PostgreSQL.
-Strong knowledge of data warehousing concepts, dimensional modeling, and data lakes.
-Proven ability to troubleshoot and optimize Spark applications for high performance.
-Familiarity with version control tools (Git, Bitbucket) and CI/CD pipelines (Jenkins, GitLab).
-Exposure to real-time data streaming technologies like Kafka, Flume, Oozie, and Nifi.
-Strong problem-solving skills, attention to detail, and ability to work in a fast-paced environment.
Please attach the Updated Resume:
Please share the suitable resumes to ram.r@vysystems.com & https://www.linkedin.com/in/ramkumarjhen/
Kindly fill details,
1. Years of Exp----
1. Visa Status-----
1. Current Location--------
1. Linkedin ID-----
1. Share Updated Resume
Thanks & Regards.,
Ramkumar.R || Sr.Technical Recruiter
Email: ram.r@vysystems.com
Linkedin ID: https://www.linkedin.com/in/ramkumarjhen/
4701 Patrick Henry Drive Building 16 Santa Clara CA 95054, USA.