RIIM

Big Data Developer (Scala & Java)

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Big Data Developer (Scala & Java) with a contract length of "unknown" and a pay rate of "unknown". Key skills include expertise in Scala, Java, Apache Spark, Hadoop, Kafka, and cloud platforms. Experience in finance, healthcare, telecom, or e-commerce is preferred.
🌎 - Country
United States
πŸ’± - Currency
$ USD
-
πŸ’° - Day rate
Unknown
-
πŸ—“οΈ - Date
May 22, 2026
πŸ•’ - Duration
Unknown
-
🏝️ - Location
Unknown
-
πŸ“„ - Contract
Unknown
-
πŸ”’ - Security
Unknown
-
πŸ“ - Location detailed
Alpharetta, GA
-
🧠 - Skills detailed
#Azure #Databricks #Data Engineering #Apache Spark #GCP (Google Cloud Platform) #Docker #Code Reviews #GIT #NoSQL #Scala #Hadoop #HBase #Kafka (Apache Kafka) #"ETL (Extract #Transform #Load)" #Programming #AWS (Amazon Web Services) #Cloud #Airflow #Big Data #Spark (Apache Spark) #Jenkins #Microservices #MongoDB #ML (Machine Learning) #Java #Databases #Data Processing #Python #Terraform #Data Architecture #Data Pipeline #Scrum #Infrastructure as Code (IaC) #AI (Artificial Intelligence) #Data Integration #Snowflake #Delta Lake #Kubernetes #Computer Science #Data Modeling #Distributed Computing #Agile #SQL (Structured Query Language)
Role description
Big Data Developer (Scala & Java) – Job Description We are looking for an experienced Big Data Developer with strong Scala and Java expertise to build scalable data processing systems and high-performance data pipelines. The ideal candidate should have hands-on experience with distributed computing, real-time data streaming, and cloud-based big data technologies. Responsibilities β€’ Design, develop, and maintain large-scale big data applications using Scala and Java. β€’ Build scalable ETL/ELT pipelines for processing structured and unstructured data. β€’ Develop distributed data processing solutions using Apache Spark, Hadoop, and Kafka. β€’ Implement real-time streaming applications and event-driven architectures. β€’ Optimize big data applications for performance, scalability, and reliability. β€’ Work closely with data engineers, architects, analysts, and business teams to deliver data solutions. β€’ Develop RESTful APIs and microservices for data integration and processing. β€’ Perform data validation, cleansing, transformation, and aggregation activities. β€’ Monitor and troubleshoot production data pipelines and resolve performance bottlenecks. β€’ Participate in code reviews, design discussions, and Agile development activities. Required Skills β€’ Strong programming experience in Scala and Java. β€’ Hands-on expertise with Apache Spark (Core, SQL, Streaming) and Hadoop ecosystem. β€’ Experience with Kafka or other messaging/streaming platforms. β€’ Knowledge of distributed systems and parallel data processing concepts. β€’ Strong SQL skills and experience with relational and NoSQL databases. β€’ Experience building ETL/data pipeline solutions. β€’ Familiarity with Hive, HBase, Cassandra, MongoDB, or Snowflake. β€’ Experience with cloud platforms such as AWS, Azure, or GCP. β€’ Hands-on experience with Docker, Kubernetes, Jenkins, Git, and CI/CD pipelines. β€’ Understanding of data modeling, data warehousing, and big data architecture. Preferred Qualifications β€’ Experience with Databricks, Delta Lake, or cloud-native data platforms. β€’ Knowledge of Airflow, Oozie, or workflow orchestration tools. β€’ Experience in real-time analytics and streaming data architectures. β€’ Exposure to machine learning pipelines or AI/ML data processing. β€’ Experience working in Agile/Scrum environments. β€’ Bachelor’s degree in Computer Science, Engineering, or related field. Nice to Have β€’ Experience with Python for data engineering tasks. β€’ Knowledge of Terraform or Infrastructure as Code. β€’ Cloud certifications in AWS, Azure, or GCP. β€’ Experience in finance, healthcare, telecom, or e-commerce domains.