Machine Learning Research Engineer

Lightmatter


 220k - 230k
 Full-Time
 United States  (Mountain View)
 Hybrid   
Lightmatter logo

Description

The AI age is upon us and high performance computing is the underlying platform powering everything from Large Language Models (LLM) to Image synthesis from text. However, with the demise of Moore’s law and Dennard scaling we are at an inflection point. At Lightmatter, we are leading the transition of computing from traditional electronic transistors to photonic technologies which can operate at mind blowing efficiency and throughput.

In this role, you will support all the activities of the ML team as it guides the development of a new class of computing infrastructure. This includes fine tuning LLMs, enablement of new models on custom architectures, evaluating the performance of models at scale, developing abstract models of the hardware for evaluating accuracy and throughput and help co-design novel hardware in a new paradigm of computing. Working in a small team of highly talented engineers, you will be able to move fast and develop well architected solutions.

If you are passionate about advanced AI technology and would like to develop scalable algorithms, hardware and ML techniques, join us!

 

Responsibilities

  • Develop software for evaluating accuracy and throughput of models on a custom hardware.
  • Develop performance models for a novel class of hardware.
  • Develop scalable algorithms for large scale inference and training
  • Train LLMs and other models on a large cluster of GPUs.
  • Support ML researchers as they explore accuracy on a wide variety of models on custom hardware.
  • Publish and present new research at premier ML/CS conferences.

Requirements

  • MS in Computer Science or related fields; PhD strongly preferred
  • Minimum of 15 years of related experience with a Bachelor’s degree; or 12 years and a Master’s degree
  • Experience with developing and shipping ML/HPC software
  • Understanding of deep learning, parallel computing, compilers and/or hardware architecture.
  • Experience with developing and modifying machine learning models for scalability.
  • Experience or understanding of low precision training and inference.

 

Technical expertise

  • Experience with scalable frameworks such as MPI, PyTorch distributed, CUDA, and NCCL.
  • Highly proficient in deep learning programming languages and frameworks, e.g. Python, C++, CUDA, Tensorflow, PyTorch, JAX.
  • Experience with practical problem solving with innovative algorithmic solutions.

 

Preferred qualifications

  • Understanding of advanced techniques used in parallel computing, deep learning and HPC.
  • Ability to model complex workloads on different architecture proposals.
  • Understanding of parallel computing architectures.
  • Experience contributing first hand to important software or ML algorithms deployed in the industry.
  • You are enthusiastic about new technologies, algorithms, and mathematics.

Benefits

  • Comprehensive Health Care Plan (Medical, Dental & Vision)
  • 401k matching
  • Life Insurance (Basic, Voluntary & AD&D)
  • Generous Time Off (Vacation, Sick & Public Holidays)
  • Paid Family Leave
  • Short Term & Long Term Disability
  • Training & Development
  • Flexible, hybrid workplace model
  • Stock Option Plan

 

Base Compensation Range: $ $220,000 to $230,000. In accordance with the Colorado, California and New York law, the range provided is Lightmatter's reasonable estimate of the compensation for this role. Actual pay will be based on several factors including work experience, location and education.

Lightmatter recruits, employs, trains, compensates and promotes regardless of race, religion, color, national origin, sex, disability, age, veteran status, and other protected status as required by applicable law.

Apply now