

Triveni IT
Machine Learning Engineer
β - Featured Role | Apply direct with Data Freelance Hub
This role is for a Machine Learning Engineer focusing on real-time avatar lip synchronization. It's a long-term remote contract with a pay rate of "unknown." Key skills include proficiency in Python, C++, and ML frameworks, with a focus on multi-language speech synchronization and facial animation integration. Certifications in AI/Machine Learning are required.
π - Country
United States
π± - Currency
$ USD
-
π° - Day rate
800
-
ποΈ - Date
December 11, 2025
π - Duration
Unknown
-
ποΈ - Location
Remote
-
π - Contract
Unknown
-
π - Security
Unknown
-
π - Location detailed
United States
-
π§ - Skills detailed
#Visualization #Cloud #Scripting #AI (Artificial Intelligence) #C++ #Python #ML (Machine Learning)
Role description
Role: Lip Sync Engineer β Real-Time Avatar Synchronization
Duration: long term contract
Location: Remote
About the Role
Weβre seeking a technically skilled Lip Sync Engineer to bring our avatars to life with natural, real-time speech synchronization. This role focuses on building systems that seamlessly integrate text, audio, and facial animationβensuring avatars speak convincingly across languages and platforms. Youβll work at the intersection of AI, animation, and engineering to deliver lip-sync generation in under 3 seconds, enabling immersive experiences for clients and partners. The Lip Sync Engineer role focuses on developing real-time, high-accuracy lip synchronization systems for digital avatars, enabling natural speech and facial movement across multiple languages and platforms. The candidate will work at the intersection of AI, animation, and engineering to optimize performance and integration. This position emphasizes technical fluency and collaboration rather than pure development, aiming to deliver seamless, immersive user experiences.
Responsibilities
β’ Strong technical knowledge of lip-sync technologies, phoneme/viseme mapping, and facial animation integration.
β’ Experience working with multi-language speech synchronization and ensuring phoneme accuracy across diverse languages.
β’ Proficiency in scripting and system optimization using Python, C++, or related languages to achieve 1β3 second lip-sync generation times.
β’ Familiarity with ML/AI frameworks applied to speech-to-animation synchronization.
β’ Ability to design and optimize pipelines for real-time face animation and voice integration.
β’ Demonstrable experience in system performance tuning to meet speed and accuracy benchmarks.
β’ Experience liaising with product, creative, or client teams to align technical outputs with user experience goals.
β’ Knowledge of avatar expression and emotion blending techniques.
β’ Familiarity with multilingual phoneme/viseme mapping challenges and solutions.
β’ Experience with animation/visualization pipelines including image, audio, and video input formats.
β’ Certifications or coursework in AI/Machine Learning applied to animation or speech processing.
β’ Exposure to cloud-based or distributed systems supporting real-time avatars.
Qualifications
Strong technical knowledge of lip-sync technologies, phoneme/viseme mapping, and facial animation integration.
Required Skills
β’ Strong technical knowledge of lip-sync technologies, phoneme/viseme mapping, and facial animation integration.
β’ Experience working with multi-language speech synchronization and ensuring phoneme accuracy across diverse languages.
β’ Proficiency in scripting and system optimization using Python, C++, or related languages to achieve 1β3 second lip-sync generation times.
β’ Familiarity with ML/AI frameworks applied to speech-to-animation synchronization.
β’ Ability to design and optimize pipelines for real-time face animation and voice integration.
β’ Demonstrable experience in system performance tuning to meet speed and accuracy benchmarks.
β’ Experience liaising with product, creative, or client teams to align technical outputs with user experience goals.
β’ Knowledge of avatar expression and emotion blending techniques.
β’ Familiarity with multilingual phoneme/viseme mapping challenges and solutions.
β’ Experience with animation/visualization pipelines including image, audio, and video input formats.
β’ Certifications or coursework in AI/Machine Learning applied to animation or speech processing.
β’ Exposure to cloud-based or distributed systems supporting real-time avatars.
Preferred Skills
β’ Exposure to cloud-based or distributed systems supporting real-time avatars.
Role: Lip Sync Engineer β Real-Time Avatar Synchronization
Duration: long term contract
Location: Remote
About the Role
Weβre seeking a technically skilled Lip Sync Engineer to bring our avatars to life with natural, real-time speech synchronization. This role focuses on building systems that seamlessly integrate text, audio, and facial animationβensuring avatars speak convincingly across languages and platforms. Youβll work at the intersection of AI, animation, and engineering to deliver lip-sync generation in under 3 seconds, enabling immersive experiences for clients and partners. The Lip Sync Engineer role focuses on developing real-time, high-accuracy lip synchronization systems for digital avatars, enabling natural speech and facial movement across multiple languages and platforms. The candidate will work at the intersection of AI, animation, and engineering to optimize performance and integration. This position emphasizes technical fluency and collaboration rather than pure development, aiming to deliver seamless, immersive user experiences.
Responsibilities
β’ Strong technical knowledge of lip-sync technologies, phoneme/viseme mapping, and facial animation integration.
β’ Experience working with multi-language speech synchronization and ensuring phoneme accuracy across diverse languages.
β’ Proficiency in scripting and system optimization using Python, C++, or related languages to achieve 1β3 second lip-sync generation times.
β’ Familiarity with ML/AI frameworks applied to speech-to-animation synchronization.
β’ Ability to design and optimize pipelines for real-time face animation and voice integration.
β’ Demonstrable experience in system performance tuning to meet speed and accuracy benchmarks.
β’ Experience liaising with product, creative, or client teams to align technical outputs with user experience goals.
β’ Knowledge of avatar expression and emotion blending techniques.
β’ Familiarity with multilingual phoneme/viseme mapping challenges and solutions.
β’ Experience with animation/visualization pipelines including image, audio, and video input formats.
β’ Certifications or coursework in AI/Machine Learning applied to animation or speech processing.
β’ Exposure to cloud-based or distributed systems supporting real-time avatars.
Qualifications
Strong technical knowledge of lip-sync technologies, phoneme/viseme mapping, and facial animation integration.
Required Skills
β’ Strong technical knowledge of lip-sync technologies, phoneme/viseme mapping, and facial animation integration.
β’ Experience working with multi-language speech synchronization and ensuring phoneme accuracy across diverse languages.
β’ Proficiency in scripting and system optimization using Python, C++, or related languages to achieve 1β3 second lip-sync generation times.
β’ Familiarity with ML/AI frameworks applied to speech-to-animation synchronization.
β’ Ability to design and optimize pipelines for real-time face animation and voice integration.
β’ Demonstrable experience in system performance tuning to meet speed and accuracy benchmarks.
β’ Experience liaising with product, creative, or client teams to align technical outputs with user experience goals.
β’ Knowledge of avatar expression and emotion blending techniques.
β’ Familiarity with multilingual phoneme/viseme mapping challenges and solutions.
β’ Experience with animation/visualization pipelines including image, audio, and video input formats.
β’ Certifications or coursework in AI/Machine Learning applied to animation or speech processing.
β’ Exposure to cloud-based or distributed systems supporting real-time avatars.
Preferred Skills
β’ Exposure to cloud-based or distributed systems supporting real-time avatars.






