Open for Enrollment (In Development)Join Now
Open for Enrollment (In Development)
Would you like to enroll?
Enrollment for this course has closed. But you can enroll in a future offering (please select)
Enrollment has closed
Enrollment for this course is currently closed, but the next offering will be available shortly. Check back soon!
The course introduces audio signal processing concepts motivated by examples from MIR research. More specifically students will learn about spectral analysis and time-frequency representations in general, monophonic pitch estimation, audio feature extraction, beat tracking, and tempo estimation.
This course is in scheduled mode. Learn more about scheduled courses here.
Session 1: Overview and Introduction to DSP
In this session, we will cover Phasors, Sinusoids, and Complex Numbers.
Session 2: Time-Frequency Representations
In This session, we will learn about Sampling, Quantization, RMS, and Loudness. We will also cover DFT, Hilbert Spaces, and Spectrograms.
Session 3: Monophonic pitch analysis/autocorrelation
Pitch vs Fundamental Frequency, Time-domain, Frequency-domain, Perceptual Models, Overview of applications (Query-by-Humming, Auto-tunining) will be covered in this session.
Session 4: Audio feature extraction
We will go over Spectral Features, Mel-Frequency Cepstral Coefficients, temporal aggregation, chroma and pitch profiles.
Session 5: Rhythm Analysis
This session is about Tempo estimation, beat tracking, drum transcription, pattern detection.
Below you will find an overview of the Learning Outcomes you will achieve as you complete this course.
• Understanding of the ideas, notation and intuition behind the short-time Fourier Transform (STFT) arguably the most fundamental technique in audio signal processing.
• Understanding of the general concept of a time-frequency representations and how audio features are computed from such representations.
• Ability to discuss how spectral analysis and audio features are used in MIR tasks such as audio classification, tagging, and recommendation.
• Understanding of various types of monophonic pitch detection algorithms based on time-domain, frequency-domain and perceptual modeling.
• Ability to illustrate how pitch detection can be used in applications such as query-by-humming and auto-tuning.
• Understanding of the terminology used to characterize rhythm in music as well as concepts used in rhythm analysis by computers such as onsets, onset strength function, and inter-onset intervals.
• Understanding of the fundamental ideas behind rhythm related MIR tasks such as tempo estimation, beat tracking, rhythm features, swing analysis, and drum transcription.
Instructors & Guests
George Tzanetakis is a Professor in the Department of Computer Science with cross-listed appointments in ECE and Music at the University of Victoria, Canada. He is the Canada Research Chair (Tier II) in the Computer Analysis of Audio and Music and received the Craigdarroch research award in artistic expression at the University of Victoria in 2012. In 2011 he was Visiting Faculty at Google Research. He received his PhD in Computer Science at Princeton University in 2002 and was a Post-Doctoral fellow at Carnegie Mellon University in 2002-2003. His research spans all stages of audio content analysis such as feature extraction, segmentation, classification with specific emphasis on music information retrieval. He is also the primary designer and developer of Marsyas an open source framework for audio processing with specific emphasis on music information retrieval applications. His pioneering work on musical genre classification received a IEEE signal processing society young author award and is frequently cited. More recently he has been exploring new interfaces for musical expression, music robotics, computational ethnomusicology, and computer-assisted music instrument tutoring.
What You Need to Take This Course
- Good knowledge of programming, basic linear algebra, probability, and statistics.
- Computer with installation privileges.
- The course is mostly software agnostic but existing frameworks for MIR and audio will be used. All software will be freely available and typically also open source. Examples include: Audacity, Marsyas, Sonic Visualizer, and VAMP plugins.
PLEASE NOTE: Taking part in a Kadenze course as a Premium Member does not affirm that you have been enrolled or accepted for enrollment by the institution offering this course.
In order to receive college credit for these program courses, you must successfully complete and pass all 3 courses in this program. If a student signs up for the Music Information Retrieval program, it is recommended that these courses are taken sequentially.
*Partial credit will not be awarded for completion of only one course.
Peer Assessment Code of Conduct: Part of what makes Kadenze a great place to learn is our community of students. While you are completing your Peer Assessments, we ask that you help us maintain the quality of our community. Please:
- Be Polite. Show your fellow students courtesy. No one wants to feel attacked - ever. For this reason, insults, condescension, or abuse will not be tolerated.
- Show Respect. Kadenze is a global community. Our students are from many different cultures and backgrounds. Please be patient, kind, and open-minded when discussing topics such as race, religion, gender, sexual orientation, or other potentially controversial subjects.
- Post Appropriate Content. We believe that expression is a human right and we would never censor our students. With that in mind, please be sensitive of what you post in a Peer Assessment. Only post content where and when it is appropriate to do so.
Please understand that posts which violate this Code of Conduct harm our community and may be deleted or made invisible to other students by course moderators. Students who repeatedly break these rules may be removed from the course and/or may lose access to Kadenze.
Students with Disabilities: Students who have documented disabilities and who want to request accommodations should refer to the student help article via the Kadenze support center. Kadenze is committed to making sure that our site is accessible to everyone. Configure your accessibility settings in your Kadenze Account Settings.