Staff/Principal Machine Learning Engineer, Speech
Inworld AI - mountain view, CA
Apply NowJob Description
Why Join InworldInworld is the best-funded startup in AI and games with a $500 million valuation and backing from top tier investors like Intel, Microsoft, Lightspeed, Bitkraft, Founders Fund, Kleiner Perkins, and more. Inworld was recognized by CB Insights as one of the 100 most promising AI companies in the world and was nominated alongside Anthropic, DeepMind, OpenAI and Nvidia for Generative AI Innovator of the Year at the VentureBeat Awards world is the leading AI engine for games, enabling developers to build groundbreaking game mechanics, dynamic NPCs and worlds that evolve with each action. Inworld powers experiences built by Ubisoft, NVIDIA, Niantic, NetEase Games and LG, among others, and has partnerships with key industry players such as MicrosoftXbox, Epic Games, and Unity. We are seeking Staff and Principal Machine Learning Speech Engineers with extensive experience in R&D of text-to-speech (TTS) and speech-to-text (STT) technologies. In this role, you will be at the forefront of building generative AI stack to power next-generation AI characters.Minimum QualificationsBachelor's degree in Computer Science, Engineering, or a similar technical field.6+ years of experience with software development in one or more programming languages, machine learning algorithms and tools (e.g., PyTorch), artificial intelligence, deep learning andor natural language processing.Excellent problem solving skills and the ability to work independently and as part of a team.Preferred QualificationsMaster's degree or PhD in speech synthesisrecognition or adjacent fields.5+ years of experience with design and architecture; and testinglaunching software products.1+ years of experience in working with sourcing and curating speech datasets.1+ years of experience in a technical leadership role leading project teams and setting technical direction.1+ years of experience in building end-to-end speech processing systems and real-time applications.ResponsibilitiesResearch and experiment with cutting edge ML techniques for TTS and STT applications.Develop and test production-grade training and inference pipelines for TTS and STT applications.Understand optimization problems in the area of speech, signals, and natural language processing.Collaborate with cross-functional teams to integrate speech technologies into -office location: Mountain View, CA, United States.Remote locations: United States and Canada.The US base salary range for this full-time position is $240,000 - $385,000. In addition to base pay, total compensation includes equity and benefits. Within the range, individual pay is determined by work location, level, and additional factors, including competencies, experience, and business needs. The base pay range is subject to change and may be modified in the future.
Created: 2024-07-07