

Performance Benchmarking Engineer - RISC-V Workloads
β - Featured Role | Apply direct with Data Freelance Hub
This role is for a Performance Benchmarking Engineer focused on RISC-V workloads, offering a contract of more than 6 months at a competitive pay rate. Key skills include performance benchmarking, Linux, scripting (Python, Bash), and data analysis.
π - Country
United Kingdom
π± - Currency
Β£ GBP
-
π° - Day rate
-
ποΈ - Date discovered
June 8, 2025
π - Project duration
More than 6 months
-
ποΈ - Location type
Unknown
-
π - Contract type
Unknown
-
π - Security clearance
Unknown
-
π - Location detailed
United Kingdom
-
π§ - Skills detailed
#Elasticsearch #Visualization #Datadog #Scripting #Grafana #Data Analysis #NoSQL #Linux #Scala #MongoDB #"ETL (Extract #Transform #Load)" #Automation #Python #Bash #Redis
Role description
Job Description
Weβre looking for a hands-on engineer to support benchmarking and performance analysis of real workloads running on a custom RISC-V processor, implemented in both silicon and FPGA-based environments.
Youβll be responsible for executing, automating, and analyzing application-level benchmarks to help evaluate system throughput, scalability, and efficiency. Your work will directly influence hardware optimization and shape how we present performance data to external partners and customers.
Responsibilities:
β’ Run and monitor Redis, MongoDB, ElasticSearch, and other similar workloads on pre-configured systems.
β’ Vary workload parameters (e.g. thread/core counts, memory configurations) to explore performance boundaries.
β’ Capture and log key metrics like:
β’
β’ Operations per second
β’ Power usage
β’ Latency
β’ Ops/sec per watt
β’ Compare results across configurations and flag anomalies or bottlenecks.
β’ Automate test runs and data collection where possible.
β’ Work with internal engineering to interpret patterns and iterate on performance hypotheses.
Must-Have:
β’ Running performance benchmarks (e.g., Redis, MongoDB, ElasticSearch)
β’ Linux system experience (Fedora 29 or similar)
β’ Scripting for automation (Python, Bash, or Go)
β’ Basic data analysis (e.g., aggregating logs, calculating throughput, visualizing metrics)
β’ Experience working in CI/CD environments or remote test infrastructure
Nice-to-Have:
β’ Redis/Mongo/Elastic performance tuning
β’ Experience collecting metrics like ops/sec, latency, power usage
β’ Familiarity with performance visualization tools (Grafana, Datadog, or similar)
β’ Comfort with custom or constrained environments (e.g., no package managers, legacy OS versions)
Bonus:
β’ Exposure to FPGA-based platforms like Zebu, FireSim, or Veloce
β’ Understanding of many-thread architectures or massively parallel hardware
β’ Prior experience benchmarking on non-x86 or pre-silicon systems
β’ Familiarity with profiling tools (e.g., perf, top, htop, or low-level instrumentation)
β’ Experience working with distributed NoSQL systems under load
About the Hardware Platform:
Youβll be benchmarking real workloads on a custom RISC-V processor, built in-house and currently deployed in both FPGA and silicon environments. The architecture includes:
β’ High thread count per core
β’ A design optimized for efficiency per watt, not peak single-threaded performance
β’ Unique memory behavior and thread scheduling characteristics that affect how workloads scale
You wonβt need to debug RTL or low-level drivers, but understanding how workload behavior maps to threads, memory traffic, and performance bottlenecks will be key to interpreting results and guiding follow-up tests.
This is a short-term contract role with room to extend or expand scope if things go well.
Job Description
Weβre looking for a hands-on engineer to support benchmarking and performance analysis of real workloads running on a custom RISC-V processor, implemented in both silicon and FPGA-based environments.
Youβll be responsible for executing, automating, and analyzing application-level benchmarks to help evaluate system throughput, scalability, and efficiency. Your work will directly influence hardware optimization and shape how we present performance data to external partners and customers.
Responsibilities:
β’ Run and monitor Redis, MongoDB, ElasticSearch, and other similar workloads on pre-configured systems.
β’ Vary workload parameters (e.g. thread/core counts, memory configurations) to explore performance boundaries.
β’ Capture and log key metrics like:
β’
β’ Operations per second
β’ Power usage
β’ Latency
β’ Ops/sec per watt
β’ Compare results across configurations and flag anomalies or bottlenecks.
β’ Automate test runs and data collection where possible.
β’ Work with internal engineering to interpret patterns and iterate on performance hypotheses.
Must-Have:
β’ Running performance benchmarks (e.g., Redis, MongoDB, ElasticSearch)
β’ Linux system experience (Fedora 29 or similar)
β’ Scripting for automation (Python, Bash, or Go)
β’ Basic data analysis (e.g., aggregating logs, calculating throughput, visualizing metrics)
β’ Experience working in CI/CD environments or remote test infrastructure
Nice-to-Have:
β’ Redis/Mongo/Elastic performance tuning
β’ Experience collecting metrics like ops/sec, latency, power usage
β’ Familiarity with performance visualization tools (Grafana, Datadog, or similar)
β’ Comfort with custom or constrained environments (e.g., no package managers, legacy OS versions)
Bonus:
β’ Exposure to FPGA-based platforms like Zebu, FireSim, or Veloce
β’ Understanding of many-thread architectures or massively parallel hardware
β’ Prior experience benchmarking on non-x86 or pre-silicon systems
β’ Familiarity with profiling tools (e.g., perf, top, htop, or low-level instrumentation)
β’ Experience working with distributed NoSQL systems under load
About the Hardware Platform:
Youβll be benchmarking real workloads on a custom RISC-V processor, built in-house and currently deployed in both FPGA and silicon environments. The architecture includes:
β’ High thread count per core
β’ A design optimized for efficiency per watt, not peak single-threaded performance
β’ Unique memory behavior and thread scheduling characteristics that affect how workloads scale
You wonβt need to debug RTL or low-level drivers, but understanding how workload behavior maps to threads, memory traffic, and performance bottlenecks will be key to interpreting results and guiding follow-up tests.
This is a short-term contract role with room to extend or expand scope if things go well.