Cliff Services Inc

W2 Only--Data Engineer--F2F Interview (No C2C)

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a W2 Data Engineer with expertise in Python, PySpark, and AWS, requiring 3-4 days onsite in McLean VA, Richmond VA, or Dallas TX. Key skills include ETL/ELT pipeline development and AWS data services experience.
🌎 - Country
United States
💱 - Currency
$ USD
-
💰 - Day rate
Unknown
-
🗓️ - Date
December 23, 2025
🕒 - Duration
Unknown
-
🏝️ - Location
Hybrid
-
📄 - Contract
Corp-to-Corp (C2C)
-
🔒 - Security
Unknown
-
📍 - Location detailed
Richmond, VA
-
🧠 - Skills detailed
#"ETL (Extract #Transform #Load)" #Schema Design #Cloud #Spark (Apache Spark) #Apache Spark #Data Architecture #PySpark #Python #Redshift #SQL (Structured Query Language) #Kafka (Apache Kafka) #Apache Airflow #Data Processing #Data Science #Code Reviews #BI (Business Intelligence) #Lambda (AWS Lambda) #DynamoDB #Libraries #Pandas #S3 (Amazon Simple Storage Service) #Documentation #Data Transformations #Programming #Scala #Version Control #Snowflake #Data Modeling #Data Engineering #Data Pipeline #GIT #Athena #Data Quality #Security #Airflow #Data Ingestion #Data Storage #Storage #AWS (Amazon Web Services) #Data Governance
Role description
Job Title: Data Engineers Type: Onsite (Hybrid 3 to 4 days to office) Interview: In Person Locations: McLean VA, Richmond VA, Dallas TX Job Description: A Data Engineer with Python, PySpark, and AWS expertise is responsible for designing, building, and maintaining scalable and efficient data pipelines in cloud environment Responsibilities: Design, develop, and maintain robust ETL/ELT pipelines using Python and PySpark for data ingestion, transformation, and processing. Work extensively with AWS cloud services such such as S3, Glue, EMR, Lambda, Redshift, Athena, and DynamoDB for data storage, processing, and warehousing. Build and optimize data ingestion and processing frameworks for large-scale data sets, ensuring data quality, consistency, and accuracy. Collaborate with data architects, data scientists, and business intelligence teams to understand data requirements and deliver effective data solutions. Implement data governance, lineage, and security best practices within data pipelines and infrastructure. Automate data workflows and improve data pipeline performance through optimization and tuning. Develop and maintain documentation for data solutions, including data dictionaries, lineage, and technical specifications. Participate in code reviews, contribute to continuous improvement initiatives, and troubleshoot complex data and pipeline issues Required Skills: Strong programming proficiency in Python, including libraries like Pandas and extensive experience with PySpark for distributed data processing. Solid understanding and practical experience with Apache Spark/PySpark for large-scale data transformations. Demonstrated experience with AWS data services, including S3, Glue, EMR, Lambda, Redshift, and Athena. Proficiency in SQL and a strong understanding of data modeling, schema design, and data warehousing concepts. Experience with workflow orchestration tools such as Apache Airflow or AWS Step Functions. Familiarity with CI/CD pipelines and version control systems (e.g., Git). Excellent problem-solving, analytical, and communication skills, with the ability to work effectively in a team environment. Preferred Skills: Experience with streaming frameworks like Kafka or Kinesis. Knowledge of other data warehousing solutions like Snowflake Thanks & regards, K Hemanth | Recruitment Specialist Email: hemanth.k@cliff-services.com