Machine Learning Engineer

Job expired!

Machine Learning Engineer Job Opportunity at Neural Magic

About Neural Magic

Neural Magic, based in Somerville, Massachusetts, is an innovative Series A startup backed by top-tier investors such as Andreessen Horowitz, NEA, VMware, Verizon Ventures, Comcast Ventures, and Amdocs. Our mission is to democratize the power of open-source LLMs and vLLM, accelerating AI for enterprises and simplifying GenAI deployments. As a leading developer and maintainer of the vLLM project and inventor of advanced techniques for model quantization and sparsification, Neural Magic offers a robust platform for enterprises to build, optimize, and scale LLM deployments.

Our Mission

We aim to bring the transformative power of open-source LLMs and vLLM to every enterprise on the planet.

Your Role as a Machine Learning Engineer

As a Machine Learning Engineer at Neural Magic, you will work closely with our product and research teams to develop state-of-the-art deep learning software. You will collaborate closely with technical and research teams to establish training and deployment pipelines, implement model compression algorithms, and productize deep learning research. If you are passionate about solving challenging technical problems at the forefront of deep learning, this role is perfect for you!

Key Responsibilities

  • Utilize your expertise in machine learning to address significant technical challenges.
  • Collaborate with research and product development teams to build ML products.
  • Prototype and implement appropriate ML algorithms, tools, and pipelines.
  • Create and manage training and deployment pipelines.
  • Work with cross-functional teams to understand market requirements and best practices.
  • Stay updated on the latest developments in the field.

Requirements

  • Proven experience as a machine learning engineer or similar role.
  • Solid understanding of machine learning and deep learning fundamentals, with expertise in areas like computer vision, NLP, speech, reinforcement learning, and generative models.
  • Knowledge of common ML frameworks (e.g., PyTorch or Keras) and libraries (e.g., NumPy and scikit-learn).
  • Strong programming skills, particularly in Python, with experience implementing machine learning solutions.
  • Experience in engineering and supporting ML pipelines in popular frameworks such as PyTorch, TensorFlow, or jax.
  • Experience with engineering and maintaining training and/or deployment pipelines for generative models / NLG / LLMs.
  • Ability to interpret and implement research ideas and algorithms.
  • Creative, collaborative, and innovation-focused mindset.
  • Strong sense of project ownership and personal responsibility.
  • Bachelor's degree in Computer Science, Mathematics, or a related field.

Benefits

  • Health Care Plan (Medical, Dental & Vision)
  • Retirement Plan (401k, IRA)
  • Paid Time Off (Vacation, Sick, & Public Holidays)
  • Family Leave (Maternity, Paternity)
  • Short Term & Long Term Disability
  • Training & Development opportunities
  • Work From Home options
  • Free Food & Snacks
  • Wellness Resources
  • Stock Option Plan

We are an equal opportunity employer. All applicants will be considered for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran, or disability status.

Join us at Neural Magic and contribute to shaping the future of AI!