AI Algorithms - Hardware Codesign Lead (Senior AI Software Engineer)

Role/Responsibilities:

EnCharge AI is looking for an exceptional technical leader who can envision, build, and lead algorithm-hardware codesign efforts for the next generation of AI Inference Hardware. You must have a strong track record of AI algorithm-hardware codesign innovations as well as providing technical leadership to teams in AI Algorithms & Hardware codesign space. This position is a strong growth opportunity, with the possibility of transitioning into further leadership roles in the future.

  • Enabling quantization-aware training (QAT) and post-training quantization (PTQ) techniques/packages that work with open-source frameworks, including Tensorflow (TF) and/or PyTorch (PT).
  • Driving key innovations that enable quantized AI models to work without loss of accuracy on EnCharge hardware.
  • Building Python-based SDKs that enable fully automated quantization schemes for AI models.
  • Work with software teams to build and enable AI services & SDKs that exploit Neural Architecture Search (NAS) for model builds – with the goal of designing and discovering EnCharge HW optimized models for customer-specific datasets.
  • Drive AI algorithms-hardware roadmap features, project management and schedules.
  • Working with clients to understand the specifics of their AI models and to translate those requirements to EnCharge teams working in algorithms, hardware, and software.
  • Mentor/lead junior engineers across the company.

Qualifications/Required Skills:

  • Masters / Ph.D. in EE/CS with >5 years of industry experience in AI algorithms & hardware.
  • At least 3-5 years of experience with C++, Python, Tensorflow & PyTorch and >1-2 years of experience in the code structure (and modifying operators) in Tensorflow & PyTorch.
  • > 1-2 years of experience with Cuda.
  • Deep experience with state-of-the-art neural network topologies in various application domains.
  • Knowledge and implementation of advanced QAT and PTQ techniques used to build highly quantized AI models.
  • 3-5 years of experience with algorithms-hardware codesign.
  • Excellent verbal and communication skills. 
  • Client engagement experience.

Preferred/Beneficial Skills:

  • Knowledge of the end-to-end runtime stack for AI applications including TVM.
  • Knowledge of AI Hardware libraries and compiler stacks (including LLVM and MLIR).
Apply for this job
logo EnCharge AI Engineering Full-time Onsite 📍 Santa Clara, CA or Princeton, NJ Apply Now
Your subscription could not be saved. Please try again.
Your subscription has been successful.

Newsletter

Subscribe and stay updated.

Your subscription could not be saved. Please try again.
Your subscription has been successful.

Join our newsletter