Deep Learning for MIR (TWO WEEKS)

Schedule

Mon, 20 Jul, 2026 at 10:00 am to Fri, 31 Jul, 2026 at 05:00 pm

UTC-07:00

Location

The Knoll | Stanford, CA

Advertisement
Music Information Retrieval, starting with basics, and ending with state-of-the-art algorithms.
About this Event

Deep Learning for Music Information Retrieval

This workshop offers a fast-paced introduction to audio and music processing with deep learning to bring you up to speed with the state-of-the-art practice in 2025. Participants will learn to build tools to analyze and manipulate digital audio signals with PyTorch. Both theory and practice of digital audio processing will be discussed with hands-on exercises on algorithm implementation. These concepts will be applied to various topics in music information retrieval. Some knowledge of python, linear algebra, and object oriented programming are assumed.

In-person (CCRMA, Stanford) and online enrollment options available. Students will receive the same teaching materials and have access to the same tutorials in either format. However, students will gain access to more in-depth, hands-on 1:1 instructor discussion and feedback when taking the course in-person.



Schedule

Day 1
• Review: Fundamentals of audio signals, key mathematical concepts (linear algebra, calculus), and common music/audio features (MFCCs, chroma, spectral contrast).
• Theory: Overview of time-frequency representations (STFT, mel-spectrogram), feature extraction pipelines.
• Hands-on: Audio feature extraction using Librosa and TorchAudio.
Day 2
• Review: Feedforward neural networks and the fundamentals of deep learning (backpropagation, loss functions).
• Theory: Introduction to the Transformer architecture; comparison with traditional sequence models.
• Hands-on: Training a simple Transformer for sequence classification (e.g., audio command recognition).
Day 3
• Theory: Convolutional Neural Networks (CNNs) for audio classification; Recurrent Neural Networks (RNNs) for temporal modeling.
• Hands-on: Spectrogram-based genre or instrument classification using CNNs and/or RNNs in PyTorch.
Day 4
• Theory: Generative models for audio — Variational Autoencoders (VAEs), diffusion models, and their applications in audio/music synthesis.
• Hands-on: Musical tone generation using a pitch- or timbre-conditioned VAE; exploration of a pre-trained diffusion model for audio generation.
Day 5
• Literature: Guided reading and discussion on recent papers (e.g., AudioCLIP, Jukebox, AudioLM, MusicLM, MusicGen).
• Hands-on: Group project presentations and demos (e.g., semantic audio tagging, music synthesis, or creative audio applications using models explored during the week).


About the instructors

Kitty Shi is an accordionist, pianist, bagpipes player, and a music technologist. She received her PhD from CCRMA in 2021 and she’s now a machine learning engineer at Pinterest. Kitty’s research interest is in computer-assisted expressive musical performance.

Iran R. Roman is a faculty member at Queen Mary University London, leading research in theoretical neuroscience and machine perception. He holds a PhD from CCRMA. Iran is a passionate instructor and mentor, with extensive experience teaching AI and signal processing at institutions like Stanford University, New York University, and the National Autonomous University of Mexico. He has worked with companies companies like Plantronics, Apple, Oscilloscape, Tesla, and Raytheon/BBN to build and deploy AI models. iranroman.github.io


Advertisement

Where is it happening?

The Knoll, 660 Lomita Court, Stanford, United States

Event Location & Nearby Stays:

Tickets

USD 312.89 to USD 1038.79

Icon
Know what’s Happening Next — before everyone else does.
CCRMA Summer Workshops

Host or Publisher CCRMA Summer Workshops

Ask AI if this event suits you:

Discover More Events in Stanford

Global Nanolab Workshop - UGIM 2026 (Stanford University)
Wed, 22 Jul at 12:00 pm Global Nanolab Workshop - UGIM 2026 (Stanford University)

Allen Building Auditorium, Stanford University

MEETUPS NONPROFIT
Patient Care Services Summit 2026
Thu, 23 Jul at 08:00 am Patient Care Services Summit 2026

Frances C. Arrillaga Alumni Center

WORKSHOPS BUSINESS
San Jose Earthquakes vs. Los Angeles Galaxy
Sat, 25 Jul at 07:30 pm San Jose Earthquakes vs. Los Angeles Galaxy

Stanford Stadium

LA Galaxy at San Jose Earthquakes
Sun, 26 Jul at 02:30 am LA Galaxy at San Jose Earthquakes

Stanford Stadium

TRIPS-ADVENTURES
Stanford Escape 2026
Tue, 21 Apr at 07:30 pm Stanford Escape 2026

David Packard Electrical Engineering

ENTERTAINMENT ART
Silicon Valley Exchange Program for Corporate Leaders
Mon, 18 May at 10:00 am Silicon Valley Exchange Program for Corporate Leaders

Stanford University

BUSINESS WORKSHOPS
HAI Seminar The AI Index with Sha Sajadieh
Wed, 20 May at 12:00 pm HAI Seminar The AI Index with Sha Sajadieh

Gates Computer Science Building Room 119

WORKSHOPS SCIENCE-FAIR
US-Asia Technology Management Center 2026 Annual Meeting
Wed, 20 May at 04:30 pm US-Asia Technology Management Center 2026 Annual Meeting

Encina Commons

SCIENCE-FAIR MEETUPS
The Aerial History Project: Studying Human Development with Aerial Photos
Thu, 21 May at 02:00 pm The Aerial History Project: Studying Human Development with Aerial Photos

David Rumsey Map Center (Green Library)

Listening in the Past: Sound, Space and the Aesthetics of the Sublime
Fri, 22 May at 01:00 pm Listening in the Past: Sound, Space and the Aesthetics of the Sublime

CCRMA Stanford Center for Computer Research in Music and Acoustics

CONCERTS MUSIC
Victor Ching, Founder & CEO of Miso
Tue, 26 May at 04:30 pm Victor Ching, Founder & CEO of Miso

Stanford University

BUSINESS CHARITIES
Silicon Valley Billionaires and an Education in Power at Stanford
Tue, 26 May at 05:00 pm Silicon Valley Billionaires and an Education in Power at Stanford

Tresidder Oak Lounge

SCIENCE-FAIR ART
Artist Talk | Whisper: Cyberdeck Diary
Tue, 26 May at 05:30 pm Artist Talk | Whisper: Cyberdeck Diary

Oshman Hall, McMurtry Building

ART IT
HAI & SDS Seminar with Eyck Freymann
Wed, 27 May at 12:00 pm HAI & SDS Seminar with Eyck Freymann

Gates Computer Science Building Room 119

WORKSHOPS CONTESTS
2026 \u65af\u5766\u798f\u534e\u4eba\u6821\u53cb\u5e74\u4f1a Stanford Chinese Student-Alumni Networking
Sat, 30 May at 10:00 am 2026 斯坦福华人校友年会 Stanford Chinese Student-Alumni Networking

Vidalakis Dining Hall

BUSINESS
US-POLAND SCIENCE AND TECHNOLOGY SYMPOSIUM 2026
Mon, 01 Jun at 08:30 am US-POLAND SCIENCE AND TECHNOLOGY SYMPOSIUM 2026

Stanford University

SCIENCE-FAIR BUSINESS
RAISE Health Symposium 2026
Tue, 02 Jun at 08:30 am RAISE Health Symposium 2026

291 Campus Drive, Stanford, CA, United States, California 94305

HEALTH-WELLNESS SCIENCE-FAIR
AI in Life Sciences Symposium
Tue, 02 Jun at 01:00 pm AI in Life Sciences Symposium

291 Campus Drive, Stanford, CA, United States, California 94305

SCIENCE-FAIR HEALTH-WELLNESS
AI and Media Content Startups in Japan
Tue, 02 Jun at 04:30 pm AI and Media Content Startups in Japan

Stanford University

BUSINESS
HAI & SDS Seminar with Juan Sebastian Gomez Cannon
Wed, 03 Jun at 12:00 pm HAI & SDS Seminar with Juan Sebastian Gomez Cannon

Gates Computer Science Building Room 119

WORKSHOPS SCIENCE-FAIR

What's Happening Next in Stanford?

Discover Stanford Events