

Insight Global
Data Engineer
⭐ - Featured Role | Apply direct with Data Freelance Hub
This role is for a Senior Data Engineer with a contract length of "unknown," offering a pay rate of "unknown," and is remote. Key skills include Python, SQL, and Spark, with a focus on building data pipelines for AI/ML applications.
🌎 - Country
United Kingdom
💱 - Currency
£ GBP
-
💰 - Day rate
Unknown
-
🗓️ - Date
November 25, 2025
🕒 - Duration
Unknown
-
🏝️ - Location
Unknown
-
📄 - Contract
Unknown
-
🔒 - Security
Unknown
-
📍 - Location detailed
Chester, England, United Kingdom
-
🧠 - Skills detailed
#"ETL (Extract #Transform #Load)" #Data Engineering #Data Ingestion #AI (Artificial Intelligence) #Python #DevOps #SQL (Structured Query Language) #SharePoint #Data Science #Scala #Data Pipeline #Spark (Apache Spark) #ML (Machine Learning)
Role description
Insight Global are seeking a Senior Data Engineer with expertise in building scalable data pipelines for AI/ML applications. This role focuses on sourcing, processing, and preparing structured and unstructured data for Generative AI solutions.
Responsibilities:
• Build and maintain robust data pipelines to support AI/ML workflows.
• Source data from multiple channels, including regulatory websites, internal policy documents, and SharePoint repositories.
• Process, manipulate, and store data in formats optimized for Data Scientists, enabling effective LLM-based prompting.
• Work with diverse data types (PDFs, Word documents) to extract, structure, and create logical data units for downstream use.
• Design and implement solutions where no existing systems are in place for these processes.
• End-to-end responsibility for sourcing, formatting, and preparing data for AI/ML applications.
• Handle structured and unstructured data, ensuring readiness for advanced modelling and analysis.
Qualifications:
• Proven track record in complex Generative AI projects, with expertise in:
• Data ingestion, vectorization, and chunking.
• Modelling structured and unstructured data.
• Strong knowledge of DevOps, CI/CD pipelines, and modern development principles (TDD, BDD).
• Demonstrated ability to influence design decisions within collaborative development teams.
• Advanced proficiency in Python, SQL, and distributed processing frameworks (e.g., Spark).
Insight Global are seeking a Senior Data Engineer with expertise in building scalable data pipelines for AI/ML applications. This role focuses on sourcing, processing, and preparing structured and unstructured data for Generative AI solutions.
Responsibilities:
• Build and maintain robust data pipelines to support AI/ML workflows.
• Source data from multiple channels, including regulatory websites, internal policy documents, and SharePoint repositories.
• Process, manipulate, and store data in formats optimized for Data Scientists, enabling effective LLM-based prompting.
• Work with diverse data types (PDFs, Word documents) to extract, structure, and create logical data units for downstream use.
• Design and implement solutions where no existing systems are in place for these processes.
• End-to-end responsibility for sourcing, formatting, and preparing data for AI/ML applications.
• Handle structured and unstructured data, ensuring readiness for advanced modelling and analysis.
Qualifications:
• Proven track record in complex Generative AI projects, with expertise in:
• Data ingestion, vectorization, and chunking.
• Modelling structured and unstructured data.
• Strong knowledge of DevOps, CI/CD pipelines, and modern development principles (TDD, BDD).
• Demonstrated ability to influence design decisions within collaborative development teams.
• Advanced proficiency in Python, SQL, and distributed processing frameworks (e.g., Spark).






