Neural Magic is an early-stage AI software company that wants to democratize high performance for deep learning models. Our aim is to lower the cost and improve the performance of end users deploying deep learning applications. Leveraging years of research at MIT, Neural Magic has built a software platform that enables developers to sparsify deep learning models to reduce footprint and achieve GPU-like speeds on CPUs. Please visit our website and GitHub repositories for a better understanding of our work.
We were founded by a team of award-winning computer scientists and researchers from MIT and our venture-backed company is located in Davis Square, Somerville, MA. Our investors include Amdocs, Andreessen Horowitz, Comcast Ventures, NEA, and Pillar VC.
We are looking for a machine learning research scientist with a track record of publications on model compression techniques such as pruning and quantization. This individual will work closely with our research team to identify, report on, and develop new algorithms in the realm of deep learning. If you are eager to tackle complex technical challenges at the front lines of deep learning, this is the right role for you!
Responsibilities:
- Use your deep understanding of machine learning to solve significant technical issues
- Cooperate with product development teams to transform your ideas into product solutions
- Conduct basic research by defining, designing, implementing, and assessing algorithms
- Actively engage with the academic community through university collaborations, publishing and presenting your work, and attending conferences
Requirements:
- Demonstrable experience as a machine learning researcher with research publications in the model compression (quantization/pruning) or generative AI / NLG / LLMs space
- Deep understanding of deep learning with expertise in one or more areas such as computer vision, NLP, speech, reinforcement learning, generative models, etc.
- Familiarity with common ML frameworks (like PyTorch or Keras) and libraries (like NumPy and scikit-learn)
- Experience in formulating/prototyping algorithms in a popular ML framework such as PyTorch, TensorFlow, jax, etc.
- Strong programming skills with proven experience prototyping and delivering advanced algorithmic solutions
- Ability to clearly explain and present analysis and machine learning concepts to a broad technical audience
- Creativity, collaboration, and a focus on innovation
- Strong sense of project ownership and personal accountability
- Ph.D. in Computer Science, Mathematics, or a similar field
Benefits:
- Health care plan (Medical, Dental & Vision)
- Retirement plan (401k, IRA)
- Paid time off (vacation, sick & public holidays)
- Family leave (maternity, paternity)
- Short-term & long-term disability
- Training & development
- Work from home opportunities
- Free food & snacks
- Wellness resources
- Stock option plan
We are an equal opportunity employer. All applicants will be considered for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.