

Data Engineer
β - Featured Role | Apply direct with Data Freelance Hub
This role is for a Data Engineer with a contract length of "Unknown", offering a pay rate of "Unknown", located in "Unknown". Key skills include AWS Glue, Elasticsearch, and Python. Experience in data engineering and familiarity with LLMs is essential.
π - Country
United Kingdom
π± - Currency
Β£ GBP
-
π° - Day rate
-
ποΈ - Date discovered
June 26, 2025
π - Project duration
Unknown
-
ποΈ - Location type
Unknown
-
π - Contract type
Unknown
-
π - Security clearance
Unknown
-
π - Location detailed
United Kingdom
-
π§ - Skills detailed
#Data Pipeline #Data Wrangling #MongoDB #Monitoring #Spark (Apache Spark) #Datasets #Lean #AWS (Amazon Web Services) #Data Processing #Elasticsearch #Pandas #PySpark #Normalization #AI (Artificial Intelligence) #S3 (Amazon Simple Storage Service) #Data Quality #JavaScript #Python #AWS Glue #"ETL (Extract #Transform #Load)" #Data Engineering
Role description
Heading 1
Heading 2
Heading 3
Heading 4
Heading 5
Heading 6
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.
Block quote
Ordered list
- Item 1
- Item 2
- Item 3
Unordered list
- Item A
- Item B
- Item C
Bold text
Emphasis
Superscript
Subscript
About Instantly.ai
Instantly.ai is a leading AI-driven sales outreach and lead intelligence platform, powering over 35K B2B companies.
The Role
Weβre hiring a Data Engineer to lead the backend infrastructure of SuperSearch, our B2B lead intelligence platform. You will own and maintain large-scale data pipelines using AWS Glue (PySpark), S3, Elasticsearch, and MongoDB. This role is central to improving how users search, filter, and find leads, applying everything from algorithm tuning to semantic enhancements with LLMs and embeddings. Youβll have full ownership of a critical system in a fast-moving and high-growth startup environment.
Responsibilities
β’ Own and maintain our data processing pipelines using AWS Glue (PySpark) and S3
β’ Work with large-scale datasets stored in Elasticsearch and MongoDB
β’ Build robust data transformation, cleaning, and normalization workflows
β’ Improve the performance and relevance of our search system through algorithmic tuning and semantic enhancements (e.g. LLMs, DeepL, embeddings)
Must-Have Skills
β’ Solid experience with data engineering on AWS (Glue, S3)
β’ Strong knowledge of Elasticsearch (query design, aggregations, performance tuning...)
β’ Proficiency in Python, especially in data wrangling (PySpark, Pandas)
β’ Experience with data quality, schema evolution, and operational monitoring
β’ Familiarity with LLMs, embeddings, or search ranking improvements is a plus
β’ Backend development skills with Node.js or general JavaScript knowledge would be advantageous
Why Instantly.ai?
β’ High-growth environment: join us on the path to unicorn status.
β’ Impact & autonomy: youβll own a marquee area critical to our success. No slow processes of big enterprise companies. We move fast like a young lean startup.
β’ Collaborative culture: work with top developers and seasoned operators.
Apply now and we'll be in touch shortly