English
- Spanish
- French
- Ukrainian
- Polish
- Russian
- Japanese
- Egyptian

Internship for Machine Learning Engineer, Data Processing - Remote in the US

Internship

Job expired!

Here at Hugging Face, we're embarking on a mission to improve Machine Learning and make it more accessible. Along this path, we contribute to the advancement of technology for the greater good.

We have created the world's fastest-growing open-source library of pre-trained models. With over 500K+ models and 250K+ stars on GitHub, more than 15,000 companies are using HF technology in production, including leading AI organizations such as Google, Elastic, Salesforce, Algolia, and Grammarly.

About the Role

In this internship, we delve into what makes LLMs potent today: the quality of data that fuels them. In collaboration with the main authors of renowned web-scale datasets (like Guilherme, the primary author of the RefinedWeb dataset), we will enhance the performance of LLM models by meticulous data engineering and processing on a large scale. You should be fond of large-scale data processing projects.

About You

If you're passionate about open-source, enthused about making intricate technology more accessible to engineers and artists, and eager to contribute to one of the fastest-growing ML ecosystems, then we can't wait to see your application!

Even if you don't hit every mark above, we still encourage you to apply! We're putting together a diverse team whose skills, experiences, and backgrounds complement each other. We're eager to see where you could have the most significant impact.

More about Hugging Face

We're diligently working to foster a culture that values diversity, equity, and inclusivity. We are purposefully cultivating a workplace where everyone feels respected and supported—irrespective of who you are or where you come from. We believe this is the cornerstone of building a great company and community. Hugging Face is an equal opportunity employer, and we do not discriminate based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Professional development is important to us. You'll work alongside some of the brightest minds in our field. We're an organization inclined towards impact, always pushing ourselves to keep growing. We reimburse all employees for relevant conferences, training, and educational pursuits.

Your well-being is crucial to us. We provide flexible working hours and remote options, supporting our employees wherever they are. Despite having office spaces globally, particularly in the US, Canada, and Europe, we're widely dispersed and all remote employees have opportunities to visit our offices. If necessary, we'll also equip your workstation to ensure your success.

We uphold the community. We believe that significant scientific advancements stem from cross-field collaborations. Join a community that supports the ML/AI community.