Tekgence Inc

Senior HPC/AI Engineer

⭐ - Featured Role | Apply direct with Data Freelance Hub

This role is for a Senior HPC/AI Engineer, offering a 12+ month contract with a pay rate of "unknown". It requires expertise in advanced Ethernet networking, InfiniBand, and automation languages. A Master's or PhD in a relevant field and 12+ years of experience are essential.

🌎 - Country

United States

💱 - Currency

$ USD

💰 - Day rate

Unknown

🗓️ - Date

October 15, 2025

🕒 - Duration

More than 6 months

🏝️ - Location

Remote

📄 - Contract

Unknown

🔒 - Security

Unknown

📍 - Location detailed

United States

🧠 - Skills detailed

#Scala #Ansible #Network Engineering #Data Science #Deployment #Monitoring #Network Security #Security #Ruby #"ETL (Extract #Transform #Load)" #Puppet #CHEF #Automation #Terraform #Mathematics #AI (Artificial Intelligence) #VPN (Virtual Private Network) #Computer Science #Statistics #Python

Role description

Role: Senior HPC/AI Engineer Location: Ideally they sit in Santa Clara but open to US Remote Duration: 12 months+ Ideally candidates have experience in advanced ethernet networking including using ROCE and NICKL as well as experience building high performance networks specifically for Data Centers. They don't need all the stuff, just at least one or two of these things. We are seeking a highly skilled Principal Network Engineer to join our dynamic team to build the next generation of IT AI Clusters and help lead the team through a major technology transformation into running AI on-prem and build infrastructure by integrating Enterprise ready platforms while building a solid foundation with automation. We are looking for a passionate engineer who will solve networking problems for scalable AI clusters. This is a hands-on network engineering position focused on the architecture, design, development and deployment of ultra-high-speed, resilient, and scalable DC AI Clusters and Interconnects for GPU-accelerated data centers and compute clusters. Outstanding problem-solving abilities and a comprehensive understanding of the network security protocols & standards, routing, switching, automation and deep understanding of fundamental network theory is also critical to your success. What you will be doing: • Lead the architecture, design, and deployment of global-scale DCs inter-connects and fabric for HPC, AI, and GPU computing clusters. • Develop high-performance data center fabric using InfiniBand, Ultra Ethernet and related technologies. • Optimize carrier interconnects, intra and inter DC routing, and dark fiber deployments to ensure low latency and high reliability. • Partner with system, OS, GPU, and HPC teams to deliver scalable, highly available networks for extreme-performance workloads. • Implement network monitoring, telemetry, solving, and continuous performance improvement processes. • Drive technology selection, vendor engagement, and lifecycle management for Data Center hardware and software. • Collaborate with internal product managers develop solutions What we need to see: • MS or PhD in Electrical Engineering, Computer Science, Computer Engineering, Artificial Intelligence, Data Science, Mathematics, Statistics, or equivalent experience. • 12+ years of experience in building, managing and supporting large scale hybrid networks, developing automation pipelines with Python, Ruby, Go or other languages used in infrastructure automation. • Expert in networking technologies: InfiniBand, Ultra Ethernet, ROCEv2, DCQCN, TCP/UDP, IPv4/IPv6, BGP/MP-BGP, VPN, L2 switching, EVPN, VxLAN, Segment Routing, MPLS. • Experience automating network infrastructure • Experience using an automated configuration management system (Python,Terraform, Chef, Puppet, Ansible, Salt, etc.) • Develop, optimize, and deploy high-performance computing workloads using NVIDIA NICKL to accelerate AI and scientific simulation workflows on multi-GPU clusters.

Apply now Apply with DFH Sign up

Tekgence Inc

Senior HPC/AI Engineer

Data Science Lead Trainer/Instructor

Data Compliance Engineer

Oracle EBS Procure to Pay (P2P) - Business Analyst - Remote

Data and Analytics Consultant (NSAW Track)

Book a

chat

with us

Company