

RIIM
Big Data Developer (Scala & Java)
β - Featured Role | Apply direct with Data Freelance Hub
This role is for a Big Data Developer (Scala & Java) with a contract length of "unknown" and a pay rate of "unknown". Key skills include expertise in Scala, Java, Apache Spark, Hadoop, Kafka, and cloud platforms. Experience in finance, healthcare, telecom, or e-commerce is preferred.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
Unknown
-
ποΈ - Date
May 22, 2026
π - Duration
Unknown
-
ποΈ - Location
Unknown
-
π - Contract
Unknown
-
π - Security
Unknown
-
π - Location detailed
Alpharetta, GA
-
π§ - Skills detailed
#Azure #Databricks #Data Engineering #Apache Spark #GCP (Google Cloud Platform) #Docker #Code Reviews #GIT #NoSQL #Scala #Hadoop #HBase #Kafka (Apache Kafka) #"ETL (Extract #Transform #Load)" #Programming #AWS (Amazon Web Services) #Cloud #Airflow #Big Data #Spark (Apache Spark) #Jenkins #Microservices #MongoDB #ML (Machine Learning) #Java #Databases #Data Processing #Python #Terraform #Data Architecture #Data Pipeline #Scrum #Infrastructure as Code (IaC) #AI (Artificial Intelligence) #Data Integration #Snowflake #Delta Lake #Kubernetes #Computer Science #Data Modeling #Distributed Computing #Agile #SQL (Structured Query Language)
Role description
Big Data Developer (Scala & Java) β Job Description
We are looking for an experienced Big Data Developer with strong Scala and Java expertise to build scalable data processing systems and high-performance data pipelines. The ideal candidate should have hands-on experience with distributed computing, real-time data streaming, and cloud-based big data technologies.
Responsibilities
β’ Design, develop, and maintain large-scale big data applications using Scala and Java.
β’ Build scalable ETL/ELT pipelines for processing structured and unstructured data.
β’ Develop distributed data processing solutions using Apache Spark, Hadoop, and Kafka.
β’ Implement real-time streaming applications and event-driven architectures.
β’ Optimize big data applications for performance, scalability, and reliability.
β’ Work closely with data engineers, architects, analysts, and business teams to deliver data solutions.
β’ Develop RESTful APIs and microservices for data integration and processing.
β’ Perform data validation, cleansing, transformation, and aggregation activities.
β’ Monitor and troubleshoot production data pipelines and resolve performance bottlenecks.
β’ Participate in code reviews, design discussions, and Agile development activities.
Required Skills
β’ Strong programming experience in Scala and Java.
β’ Hands-on expertise with Apache Spark (Core, SQL, Streaming) and Hadoop ecosystem.
β’ Experience with Kafka or other messaging/streaming platforms.
β’ Knowledge of distributed systems and parallel data processing concepts.
β’ Strong SQL skills and experience with relational and NoSQL databases.
β’ Experience building ETL/data pipeline solutions.
β’ Familiarity with Hive, HBase, Cassandra, MongoDB, or Snowflake.
β’ Experience with cloud platforms such as AWS, Azure, or GCP.
β’ Hands-on experience with Docker, Kubernetes, Jenkins, Git, and CI/CD pipelines.
β’ Understanding of data modeling, data warehousing, and big data architecture.
Preferred Qualifications
β’ Experience with Databricks, Delta Lake, or cloud-native data platforms.
β’ Knowledge of Airflow, Oozie, or workflow orchestration tools.
β’ Experience in real-time analytics and streaming data architectures.
β’ Exposure to machine learning pipelines or AI/ML data processing.
β’ Experience working in Agile/Scrum environments.
β’ Bachelorβs degree in Computer Science, Engineering, or related field.
Nice to Have
β’ Experience with Python for data engineering tasks.
β’ Knowledge of Terraform or Infrastructure as Code.
β’ Cloud certifications in AWS, Azure, or GCP.
β’ Experience in finance, healthcare, telecom, or e-commerce domains.
Big Data Developer (Scala & Java) β Job Description
We are looking for an experienced Big Data Developer with strong Scala and Java expertise to build scalable data processing systems and high-performance data pipelines. The ideal candidate should have hands-on experience with distributed computing, real-time data streaming, and cloud-based big data technologies.
Responsibilities
β’ Design, develop, and maintain large-scale big data applications using Scala and Java.
β’ Build scalable ETL/ELT pipelines for processing structured and unstructured data.
β’ Develop distributed data processing solutions using Apache Spark, Hadoop, and Kafka.
β’ Implement real-time streaming applications and event-driven architectures.
β’ Optimize big data applications for performance, scalability, and reliability.
β’ Work closely with data engineers, architects, analysts, and business teams to deliver data solutions.
β’ Develop RESTful APIs and microservices for data integration and processing.
β’ Perform data validation, cleansing, transformation, and aggregation activities.
β’ Monitor and troubleshoot production data pipelines and resolve performance bottlenecks.
β’ Participate in code reviews, design discussions, and Agile development activities.
Required Skills
β’ Strong programming experience in Scala and Java.
β’ Hands-on expertise with Apache Spark (Core, SQL, Streaming) and Hadoop ecosystem.
β’ Experience with Kafka or other messaging/streaming platforms.
β’ Knowledge of distributed systems and parallel data processing concepts.
β’ Strong SQL skills and experience with relational and NoSQL databases.
β’ Experience building ETL/data pipeline solutions.
β’ Familiarity with Hive, HBase, Cassandra, MongoDB, or Snowflake.
β’ Experience with cloud platforms such as AWS, Azure, or GCP.
β’ Hands-on experience with Docker, Kubernetes, Jenkins, Git, and CI/CD pipelines.
β’ Understanding of data modeling, data warehousing, and big data architecture.
Preferred Qualifications
β’ Experience with Databricks, Delta Lake, or cloud-native data platforms.
β’ Knowledge of Airflow, Oozie, or workflow orchestration tools.
β’ Experience in real-time analytics and streaming data architectures.
β’ Exposure to machine learning pipelines or AI/ML data processing.
β’ Experience working in Agile/Scrum environments.
β’ Bachelorβs degree in Computer Science, Engineering, or related field.
Nice to Have
β’ Experience with Python for data engineering tasks.
β’ Knowledge of Terraform or Infrastructure as Code.
β’ Cloud certifications in AWS, Azure, or GCP.
β’ Experience in finance, healthcare, telecom, or e-commerce domains.






