Open for Enrollment

You can also start immediately after joining!
Join Now
Go at your own pace
5 Sessions / 10 hours of work per session
Price
Premium membership $20/month (Preview session 1 free)
Included w/ premium membership ($20/month)
Skill Level
Expert
Video Transcripts
English
Topics
Music, Machine Learning, Music Information Retrieval, Audio Signal Processing, Feature Extraction

Not available for purchase in India

Open for Enrollment

Extracting Information From Music Signals

Open for Enrollment
You can also start immediately after joining!

Would you like to enroll?

Enrollment for this course has closed. But you can enroll in a future offering (please select)

Enrollment has closed

Go at your own pace
5 Sessions / 10 hours of work per session
Price
Premium membership $20/month (Preview session 1 free)
Included w/ premium membership ($20/month)
Skill Level
Expert
Video Transcripts
English
Topics
Music, Machine Learning, Music Information Retrieval, Audio Signal Processing, Feature Extraction

Not available for purchase in India

Course Description

The course introduces audio signal processing concepts motivated by examples from MIR research. More specifically students will learn about spectral analysis and time-frequency representations in general, monophonic pitch estimation, audio feature extraction, beat tracking, and tempo estimation.

Reviews
schedule

This course is in adaptive mode and is open for enrollment. Learn more about adaptive courses here.

Session 1: Time, Frequency, and Sinusoids (October 8, 2024)
In this session, we will cover Phasors, Sinusoids, and Complex Numbers.
11 lessons
1. Welcome
2. About, Background, and Learning Outcomes
3. MIR History and Tasks
4. Importance of DSP, Digital Audio Recordings and Time Domain Waveforms, Sampling and Quantization
5. Pitch, Time, Music Notation, and Time-Frequency Representations
6. Spectrum and Spectrograms
7. Sinusoids
8. Sound of Tuning Fork, Physics of Sound Projection, LTI Systems (Premium Exclusive)
9. Measuring Amplitude, Frequency and Phase of Sinusoids (Premium Exclusive)
10. Phasors and Complex Numbers (Premium Exclusive)
11. DSP Concepts Using Phasors (Premium Exclusive)
Session 2: DFT and Time-Frequency Representations (October 15, 2024)
In This session, we will learn about Sampling, Quantization, RMS, and Loudness. We will also cover DFT, Hilbert Spaces, and Spectrograms.
10 lessons
1. Welcome and Overview (Premium Exclusive)
2. A Geometric View of Frequency Representations (Premium Exclusive)
3. Fourier Series (Premium Exclusive)
4. The Discrete Fourier Transform and the FFT (Premium Exclusive)
5. Understanding the Basis Functions, Magnitude and Phase Spectrum (Premium Exclusive)
6. Plotting the Spectrum and Interpreting it (Premium Exclusive)
7. Windowing, The Short-Time Fourier Transform, and Spectrograms (Premium Exclusive)
8. Filters (Premium Exclusive)
9. Amplitude in dB, Loudness (Premium Exclusive)
10. Summary (Premium Exclusive)
Session 3: Monophonic Pitch Detection (October 22, 2024)
Pitch vs Fundamental Frequency, Time-domain, Frequency-domain, Perceptual Models, Overview of applications (Query-by-Humming, Auto-tunining) will be covered in this session.
8 lessons
1. Welcome and Overview (Premium Exclusive)
2. Pitch and Fundamental Frequency (Premium Exclusive)
3. Time-Domain Pitch Extraction Using Zero-Crossings (Premium Exclusive)
4. Frequency-Domain Pitch Extracting Using Magnitude Spectra (Premium Exclusive)
5. Autocorrelation and Average Magnitude Difference Function (Premium Exclusive)
6. Perceptually Informed Hearing Models (Premium Exclusive)
7. Query-by-Humming (Premium Exclusive)
8. Auto-Tuning (Premium Exclusive)
Session 4: Audio Feature Extraction (October 29, 2024)
We will go over Spectral Features, Mel-Frequency Cepstral Coefficients, temporal aggregation, chroma and pitch profiles.
8 lessons
1. Welcome and Overview (Premium Exclusive)
2. State Space Representations for Music Tracks (Premium Exclusive)
3. Introduction to Audio Features (Premium Exclusive)
4. Frequency and Temporal Summarization (Premium Exclusive)
5. Spectral Descriptors and MFCCs (Premium Exclusive)
6. Temporal Summarization (Premium Exclusive)
7. Pitch Histograms and Chroma Vectors (Premium Exclusive)
8. Summary (Premium Exclusive)
Session 5: Rhythm Analysis (November 5, 2024)
This session is about Tempo estimation, beat tracking, drum transcription, pattern detection.
8 lessons
1. Overview (Premium Exclusive)
2. Rhythm Analysis Terminology (Premium Exclusive)
3. Tempo Estimation (Premium Exclusive)
4. Beat Tracking (Premium Exclusive)
5. Beat Strength and Rhythm Features (Premium Exclusive)
6. Drum Transcription and Pattern Analysis (Premium Exclusive)
7. Multi-Modal Real-Time Beat Tracking (Premium Exclusive)
8. Summary (Premium Exclusive)
Learning Outcomes

Below you will find an overview of the Learning Outcomes you will achieve as you complete this course.

Instructors And Guests
What You Need to Take This Course

Prior Knowledge

  • Good knowledge of programming, basic linear algebra, probability, and statistics.

Equipment

  • Computer with installation privileges.

Software

  • The course is mostly software agnostic but existing frameworks for MIR and audio will be used. All software will be freely available and typically also open source. Examples include: Audacity, Marsyas, Sonic Visualizer, and VAMP plugins.
Additional Information

PLEASE NOTE: Taking part in a Kadenze course as a Premium Member does not affirm that you have been enrolled or accepted for enrollment by the institution offering this course.

In order to receive college credit for these program courses, you must successfully complete and pass all 3 courses in this program. If a student signs up for the Music Information Retrieval program, it is recommended that these courses are taken sequentially.

*Partial credit will not be awarded for completion of only one course.

Peer Assessment Code of Conduct: Part of what makes Kadenze a great place to learn is our community of students. While you are completing your Peer Assessments, we ask that you help us maintain the quality of our community. Please:

  • Be Polite. Show your fellow students courtesy. No one wants to feel attacked - ever. For this reason, insults, condescension, or abuse will not be tolerated.
  • Show Respect. Kadenze is a global community. Our students are from many different cultures and backgrounds. Please be patient, kind, and open-minded when discussing topics such as race, religion, gender, sexual orientation, or other potentially controversial subjects.
  • Post Appropriate Content. We believe that expression is a human right and we would never censor our students. With that in mind, please be sensitive of what you post in a Peer Assessment. Only post content where and when it is appropriate to do so.

Please understand that posts which violate this Code of Conduct harm our community and may be deleted or made invisible to other students by course moderators. Students who repeatedly break these rules may be removed from the course and/or may lose access to Kadenze.

Students with Disabilities: Students who have documented disabilities and who want to request accommodations should refer to the student help article via the Kadenze support center. Kadenze is committed to making sure that our site is accessible to everyone. Configure your accessibility settings in your Kadenze Account Settings.