PhD Multimodal AI Intern (Fall 24)

Job expired!

Join Dolby Laboratories - PhD Multimodal AI Intern (Fall 2024)

Are you ready to shape the future of entertainment technology? Join Dolby Laboratories as a PhD Multimodal AI Intern and be part of our pioneering innovation in entertainment. Our Dolby U internship program offers unparalleled project-based work in a collaborative and creative environment, working alongside industry leaders.

About the Internship Program

The Dolby U internship program is designed to amplify your insatiable curiosity by implementing real-world solutions that revolutionize how people communicate and enjoy entertainment. At Dolby, we foster a collegial culture with challenging projects, excellent compensation, and benefits, including a flexible work approach to support where, when, and how you do your best work.

Why Join Us?

  • First-hand exposure to groundbreaking Dolby technology.
  • A diverse, open, and welcoming culture.
  • Practical experience working on real-world projects.
  • Opportunities to make an impact: your work will be used by millions of people daily.
  • The potential to publish and/or patent your innovations.

About the Advanced Technology Group (ATG)

The Advanced Technology Group (ATG) is Dolby’s research division, tasked with driving insights and technological solutions to propel Dolby’s growth. Our team of researchers specializes in various fields including AI/ML, algorithms, digital signal processing, audio engineering, image processing, computer vision, data science & analytics, distributed systems, cloud, edge & mobile computing, computer networking, and IoT.

Responsibilities

As a member of the Multimodal Processing Team, your role will involve creating novel AI algorithms that utilize audio, video, text, or other input modalities. These algorithms aim to enhance audiovisual experiences and intelligently analyze or process content, building innovative technologies that revolutionize entertainment.

Candidate Profile

What We’re Looking For:

  • Solid technical skills and a passion for problem-solving.
  • Strong analytical abilities, good communication, and collaboration skills.
  • Curiosity about how things work and enthusiasm for audio, video, movies, music, or game technology.

Areas of Focus:

  • Multimodal machine learning and deep learning.
  • Adversarial machine learning.
  • Multimodal Large Language Models (LLMs).
  • Audiovisual content analysis and enhancement.
  • Multimodal representation learning.
  • Generative AI for audio and video.

Qualifications

  • Working towards a Master’s or Ph.D. degree in Artificial Intelligence, Electrical Engineering, Computer Science, or a related field.
  • Experience developing and training deep learning architectures, particularly for audio and/or video applications.
  • Experience with representation learning problems and adversarial machine learning is a plus.
  • First-author publications in peer-reviewed AI conferences (e.g., CVPR, ICCV, ECCV, NeurIPS, ICML, InterSpeech, ICASSP).
  • Programming experience in Python and working with frameworks like PyTorch or TensorFlow.
  • Ability to prototype quickly and strong critical thinking skills.
  • Excellent communication skills and a team-oriented work ethic.

Eligibility

Applicants must be currently working towards a Ph.D. degree in Computer Science, Electrical Engineering, or a related field, or be recent graduates within six months of graduation. The internship is full-time, Monday to Friday, for 3 months (September 2024 – December 2024).

Start Date: Monday, September 23, 2024 (Non-flexible)

Compensation

The San Francisco/Bay Area base hourly range for this internship position is $44-57/hr, with variations based on location. Your recruiter will provide more details about compensation and perks during the hiring process.

Equal Employment Opportunity

Dolby Laboratories is an equal opportunity employer. Our success depends on the combined skills and talents of all our employees. We are committed to making