Insight Global

Data Engineer

⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Senior Data Engineer with a contract length of "unknown," offering a pay rate of "unknown," and is remote. Key skills include Python, SQL, and Spark, with a focus on building data pipelines for AI/ML applications.
🌎 - Country
United Kingdom
💱 - Currency
£ GBP
-
💰 - Day rate
Unknown
-
🗓️ - Date
November 25, 2025
🕒 - Duration
Unknown
-
🏝️ - Location
Unknown
-
📄 - Contract
Unknown
-
🔒 - Security
Unknown
-
📍 - Location detailed
Chester, England, United Kingdom
-
🧠 - Skills detailed
#"ETL (Extract #Transform #Load)" #Data Engineering #Data Ingestion #AI (Artificial Intelligence) #Python #DevOps #SQL (Structured Query Language) #SharePoint #Data Science #Scala #Data Pipeline #Spark (Apache Spark) #ML (Machine Learning)
Role description
Insight Global are seeking a Senior Data Engineer with expertise in building scalable data pipelines for AI/ML applications. This role focuses on sourcing, processing, and preparing structured and unstructured data for Generative AI solutions. Responsibilities: • Build and maintain robust data pipelines to support AI/ML workflows. • Source data from multiple channels, including regulatory websites, internal policy documents, and SharePoint repositories. • Process, manipulate, and store data in formats optimized for Data Scientists, enabling effective LLM-based prompting. • Work with diverse data types (PDFs, Word documents) to extract, structure, and create logical data units for downstream use. • Design and implement solutions where no existing systems are in place for these processes. • End-to-end responsibility for sourcing, formatting, and preparing data for AI/ML applications. • Handle structured and unstructured data, ensuring readiness for advanced modelling and analysis. Qualifications: • Proven track record in complex Generative AI projects, with expertise in: • Data ingestion, vectorization, and chunking. • Modelling structured and unstructured data. • Strong knowledge of DevOps, CI/CD pipelines, and modern development principles (TDD, BDD). • Demonstrated ability to influence design decisions within collaborative development teams. • Advanced proficiency in Python, SQL, and distributed processing frameworks (e.g., Spark).