Senior Machine Learning Engineer - LLM
What You Will Do
We are looking for a Machine Learning Engineer to help build cutting edge ML infrastructure for building and serving LLM’s at Moveworks. This role will be critical in building, optimizing and scaling end-to-end machine learning systems. The ML infra team covers a variety of responsibilities including distributed training and inference pipeline for large language models(LLM), model evaluation and monitoring framework, LLM latency optimization, etc. These frameworks serve as a strong foundation for our hundreds of ML and NLP models in production serving hundreds of millions of enterprise employees. We are solving many challenges on scalability of services as well as optimization of core algorithms.
In this role you will work closely with our machine learning team, data infrastructure team and every core skill. Above all, your work will impact the way our customers experience AI. Put another way, this role is absolutely critical to the long term scalability of our core AI product and ultimately the company. You will be responsible for building and productionizing ML infrastructure that runs state of the art models. If you are looking for a high-impact, fast-moving role to take your work to the next level, we should have a conversation.
- Design, build and optimize scalable machine learning infrastructure to support training, evaluation, and deployment of large language models.
- Build abstractions to automate various steps in different ML workflows
- Collaborate with cross functional teams of engineers, data analytics, machine learning experts, and product to build new features
- Leverage your experience to drive best practices in ML and data engineering
What You Bring To The Table
- 2+ years of industry experience in Machine Learning, Infrastructure or related fields
- Experience with deep learning framework such as Pytorch or Huggingface or LLM serving frameworks such as vLLM or TensorRT-LLM.
- Experience with building and scaling end-to-end machine learning systems
- Experience building scalable micro services and ETL pipelines
- Expertise in Python and experience with performant language such as C++ or GoLang
- Bachelor's in Computer Science, Computer Engineering, Mathematics, or equivalent field.
- A love of research publications in the machine learning and software engineering communities
- Effective communicator with experience collaborating cross-functionally with other teams
Nice To Haves
- Experience with ML Inference optimization using TensorRT.
- Experience with distributed training frameworks such as Deepspeed.
- Experience in managing and scaling GPU Inference services via Kubernetes
Base salary compensation range: $200,000 - $275,000
*Our compensation package includes a market competitive salary, equity for all full time roles, exceptional benefits, and, for applicable roles, commissions or bonus plans.
Ultimately, in determining pay, final offers may vary from the amount listed based on geography, the role’s scope and complexity, the candidate’s experience and expertise, and other factors.
Moveworks Is An Equal Opportunity Employer
*Moveworks is proud to be an equal opportunity employer. We provide employment opportunities without regard to age, race, color, ancestry, national origin, religion, disability, sex, gender identity or expression, sexual orientation, veteran status, or any other characteristics protected by law.
Who We Are
Moveworks is an AI Assistant that helps all employees find information, automate tasks, and be more productive. We give the entire workforce one interface to get answers and take action across every enterprise system. And for developers, we make it easy to build and deploy AI agents that bring the power of Moveworks to every business process or workflow.
It’s all powered by a pioneering Reasoning Engine paired with an Agentic Automation Engine that, together, are able to handle even the most complex requests by understanding queries, then building and executing intelligent plans to fulfill them — in seconds.
Founded in 2016, Moveworks has raised $315M in funding, and eclipsed $100M in ARR in 2024 thanks to our award-winning product and team. Along the way, we’ve earned recognition as a leader in the Forrester Wave for Conversational AI Platforms for Employee Services, as a member of the Forbes Cloud 100 and AI 50 lists, and as one of America’s Most Loved Workplaces according to Newsweek.
Today, Moveworks has over 500 employees in six offices globally, and is backed by some of the world's most prominent investors including Kleiner Perkins, Lightspeed, Bain Capital Ventures, Sapphire Ventures, Iconiq, and more.
Over 350 leading organizations like Marriott, Databricks, Toyota, CVS Health, and Honeywell trust Moveworks to increase operational efficiency, enhance the employee experience, and drive lasting AI transformation.
Come join one of the most innovative teams on the planet!