Open for Enrollment

You can also start immediately after joining!

Join Now

Go at your own pace

5 Sessions / 10 hours of work per session

Price

Premium membership $20/month (Preview session 1 free)

Certificate

Included w/ premium membership ($20/month)

Program

Music Information Retrieval

Skill Level

Expert

Video Transcripts

English

Topics

Music, Machine Learning, Music Information Retrieval, Audio Signal Processing, Feature Extraction

Not available for purchase in India

Open for Enrollment

Extracting Information From Music Signals

Would you like to enroll?

Enrollment for this course has closed. But you can enroll in a future offering (please select)

Enrollment has closed

Go at your own pace

5 Sessions / 10 hours of work per session

Price

Premium membership $20/month (Preview session 1 free)

Certificate

Included w/ premium membership ($20/month)

Program

Music Information Retrieval

Skill Level

Expert

Video Transcripts

English

Topics

Music, Machine Learning, Music Information Retrieval, Audio Signal Processing, Feature Extraction

Not available for purchase in India

Course Description

The course introduces audio signal processing concepts motivated by examples from MIR research. More specifically students will learn about spectral analysis and time-frequency representations in general, monophonic pitch estimation, audio feature extraction, beat tracking, and tempo estimation.

Reviews

schedule

This course is in adaptive mode and is open for enrollment. Learn more about adaptive courses here.

Session 1: Time, Frequency, and Sinusoids (July 30, 2024)

In this session, we will cover Phasors, Sinusoids, and Complex Numbers.

11 lessons

1. Welcome

2. About, Background, and Learning Outcomes

3. MIR History and Tasks

4. Importance of DSP, Digital Audio Recordings and Time Domain Waveforms, Sampling and Quantization

5. Pitch, Time, Music Notation, and Time-Frequency Representations

6. Spectrum and Spectrograms

7. Sinusoids

8. Sound of Tuning Fork, Physics of Sound Projection, LTI Systems (Premium Exclusive)

9. Measuring Amplitude, Frequency and Phase of Sinusoids (Premium Exclusive)

10. Phasors and Complex Numbers (Premium Exclusive)

11. DSP Concepts Using Phasors (Premium Exclusive)

Session 2: DFT and Time-Frequency Representations (August 6, 2024)

In This session, we will learn about Sampling, Quantization, RMS, and Loudness. We will also cover DFT, Hilbert Spaces, and Spectrograms.

10 lessons

1. Welcome and Overview (Premium Exclusive)

2. A Geometric View of Frequency Representations (Premium Exclusive)

3. Fourier Series (Premium Exclusive)

4. The Discrete Fourier Transform and the FFT (Premium Exclusive)

5. Understanding the Basis Functions, Magnitude and Phase Spectrum (Premium Exclusive)

6. Plotting the Spectrum and Interpreting it (Premium Exclusive)

7. Windowing, The Short-Time Fourier Transform, and Spectrograms (Premium Exclusive)

8. Filters (Premium Exclusive)

9. Amplitude in dB, Loudness (Premium Exclusive)

10. Summary (Premium Exclusive)

Session 3: Monophonic Pitch Detection (August 13, 2024)

Pitch vs Fundamental Frequency, Time-domain, Frequency-domain, Perceptual Models, Overview of applications (Query-by-Humming, Auto-tunining) will be covered in this session.

8 lessons

1. Welcome and Overview (Premium Exclusive)

2. Pitch and Fundamental Frequency (Premium Exclusive)

3. Time-Domain Pitch Extraction Using Zero-Crossings (Premium Exclusive)

4. Frequency-Domain Pitch Extracting Using Magnitude Spectra (Premium Exclusive)

5. Autocorrelation and Average Magnitude Difference Function (Premium Exclusive)

6. Perceptually Informed Hearing Models (Premium Exclusive)

7. Query-by-Humming (Premium Exclusive)

8. Auto-Tuning (Premium Exclusive)

Session 4: Audio Feature Extraction (August 20, 2024)

We will go over Spectral Features, Mel-Frequency Cepstral Coefficients, temporal aggregation, chroma and pitch profiles.

8 lessons

1. Welcome and Overview (Premium Exclusive)

2. State Space Representations for Music Tracks (Premium Exclusive)

3. Introduction to Audio Features (Premium Exclusive)

4. Frequency and Temporal Summarization (Premium Exclusive)

5. Spectral Descriptors and MFCCs (Premium Exclusive)

6. Temporal Summarization (Premium Exclusive)

7. Pitch Histograms and Chroma Vectors (Premium Exclusive)

8. Summary (Premium Exclusive)

Session 5: Rhythm Analysis (August 27, 2024)

This session is about Tempo estimation, beat tracking, drum transcription, pattern detection.

8 lessons

1. Overview (Premium Exclusive)

2. Rhythm Analysis Terminology (Premium Exclusive)

3. Tempo Estimation (Premium Exclusive)

4. Beat Tracking (Premium Exclusive)

5. Beat Strength and Rhythm Features (Premium Exclusive)

6. Drum Transcription and Pattern Analysis (Premium Exclusive)

7. Multi-Modal Real-Time Beat Tracking (Premium Exclusive)

8. Summary (Premium Exclusive)

Read More Read Less

Learning Outcomes

Below you will find an overview of the Learning Outcomes you will achieve as you complete this course.

Spectral Analysis

• Understanding of the ideas, notation and intuition behind the short-time Fourier Transform (STFT) arguably the most fundamental technique in audio signal processing.

• Understanding of the general concept of a time-frequency representations and how audio features are computed from such representations.

• Ability to discuss how spectral analysis and audio features are used in MIR tasks such as audio classification, tagging, and recommendation.

Pitch Detection

• Understanding of various types of monophonic pitch detection algorithms based on time-domain, frequency-domain and perceptual modeling.

• Ability to illustrate how pitch detection can be used in applications such as query-by-humming and auto-tuning.

Rhythmic Analysis

• Understanding of the terminology used to characterize rhythm in music as well as concepts used in rhythm analysis by computers such as onsets, onset strength function, and inter-onset intervals.

• Understanding of the fundamental ideas behind rhythm related MIR tasks such as tempo estimation, beat tracking, rhythm features, swing analysis, and drum transcription.

Instructors And Guests

George Tzanetakis

instructor

gtzan@ieee.org

George Tzanetakis is a Professor in the Department of Computer Science with cross-listed appointments in ECE and Music at the University of Victoria, Canada. He is the Canada Research Chair (Tier II) in the Computer Analysis of Audio and Music and received the Craigdarroch research award in artistic expression at the University of Victoria in 2012. In 2011 he was Visiting Faculty at Google Research. He received his PhD in Computer Science at Princeton University in 2002 and was a Post-Doctoral fellow at Carnegie Mellon University in 2002-2003. His research spans all stages of audio content analysis such as feature extraction, segmentation, classification with specific emphasis on music information retrieval. He is also the primary designer and developer of Marsyas an open source framework for audio processing with specific emphasis on music information retrieval applications. His pioneering work on musical genre classification received a IEEE signal processing society young author award and is frequently cited. More recently he has been exploring new interfaces for musical expression, music robotics, computational ethnomusicology, and computer-assisted music instrument tutoring.

View More View Less

What You Need to Take This Course

Prior Knowledge

Good knowledge of programming, basic linear algebra, probability, and statistics.

Equipment

Computer with installation privileges.

Software

The course is mostly software agnostic but existing frameworks for MIR and audio will be used. All software will be freely available and typically also open source. Examples include: Audacity, Marsyas, Sonic Visualizer, and VAMP plugins.

Open for Enrollment

Extracting Information From Music Signals

Would you like to enroll?

Enrollment has closed

This course is in adaptive mode and is open for enrollment. Learn more about adaptive courses here.

OH NO!

OH NO!

Starting Soon

Hang Tight!