Internship for Machine Learning Engineer, Data Processing - Remote for EMEA

  • Internship
Job expired!

Here at Hugging Face, we're on a journey to advance quality Machine Learning and make it more accessible. Along the way, we contribute to the development of better technology.

We've built the fastest-growing, open-source, library of pre-trained models globally. With over 500K+ models and 250K+ stars on GitHub, more than 15,000 companies are using HF technology in production, including leading AI organizations like Google, Elastic, Salesforce, Algolia, and Grammarly.

About the Role

This internship delves into what makes LLMs effective currently: the quality of the data that drives them. In collaboration with lead authors of renowned web-scale datasets (like Guilherme, the lead author of the RefinedWeb dataset), we will enhance LLM model performances through meticulous data engineering and processing at scale. You should be fond of large-scale data processing projects.

About You

If you have a passion for open-source, are keen about making complex technology more accessible to engineers and artists, and desire to contribute to one of the fastest-growing ML ecosystems, then we can't wait to receive your application!

If you're interested in joining us but don't meet all the criteria above, we still encourage you to apply! We're creating a diverse team whose skills, experiences, and backgrounds complement each other. We're willing to consider where you might be able to make the most significant impact.

More about Hugging Face

We're actively striving to establish a culture that values diversity, equity, and inclusivity. We're systematically creating a workplace where everyone feels respected and supported, regardless of who you are or where you come from. We believe this is essential to building a great company and community. Hugging Face is an equal opportunity employer, and we do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

We value development. You'll work with some of our industry's smartest people. We are an organization that emphasizes impact and always motivates ourselves to continue growing. We reimburse all employees for relevant conferences, training, and education.

We care about your well-being. We offer flexible working hours and remote options. We support our employees wherever they are. Although we have office spaces worldwide, particularly in the US, Canada, and Europe, we're very distributed, and all remote employees have the chance to visit our offices. If necessary, we'll also equip your workstation to ensure you succeed.

We support the community. We believe significant scientific advancements result from collaboration across the field. Join a community that supports the ML/AI community.