Machine Learning for Signal Processing: EE698V (Fall 2020)

Vipul Arora
Department of Electrical Engineering, IIT Kanpur

warning The focus will be on AUDIO signals

Course link: https://hello.iitk.ac.in/course/ee698v

TAs:

Vishal Kumar - vishalku@iitk.ac.in
Sumit Kumar - krsumit@iitk.ac.in
Vikas Kanaujia - kvikas@iitk.ac.in
Adhiraj Banerjee - adhiraj@iitk.ac.in
Swati Singh - swatisn@iitk.ac.in
Akash Anand Apare - aaapare@iitk.ac.in
Sagnik - sagnikm@iitk.ac.in

Course Objectives:

This course aims at introducing the students to machine learning (ML) techniques used for various signal processing applications. There will be spectral processing techniques for analysis and transformation of audio signals. The lectures will focus on mathematical principles, and there will be coding based assignments for implementation. Prior exposure to ML is not required. The course will be focused on applications in audio signal processing, and the theory will be tailored towards that end.

Pre-requisites:

The course will need a strong background in linear algebra and probability theory.

Topics:

Total 40 classes of 50 minutes each

Topic # lectures
Introduction 1
Linear Algebra Refresher 2
Programming Basics1 1
Digital Signal Processing for audio 6
Probability Theory Refresher 3
Machine Learning basics 6
Neural Networks 9
Music Information Retrieval 3
Speech Recognition 3
Other applications in audio processing2 6

1 Python and bash scripting; 2 E.g., acoustic event detection, speaker diarization, music genre classification, auto-tagging, query by humming, melody estimation, etc.

Grading Scheme

  1. Continuous Assessment – 50%
    Assignments, Quizzes
  2. Mid-semester Exam – 20%
    Written and/or oral exam
  3. End-semester Exam – 30%
    Written exam and/or project submission

There may be oral exams or viva via video chat or phone call.

Plagiarism Penalty:
As heavy as possible. Zero-tolerance policy.

References:

This course will take excerpts from some standard books on machine learning and signal processing. But it will largely be based on articles and research papers in ML and SP conferences (e.g., ICASSP, NeurIPS, ICML, Interspeech, ISMIR, etc.) and journals (e.g., IEEE TASLP, JMLR, IEEE PAMI, etc.).

Books:

Articles: