Learn practical skills, build real-world projects, and advance your career

Introduction to Audio Processing for Machine Learning

Use the 'Clone' button if you want to run this notebook on a local/cloud machine, or use the 'Run' button to run it on BinderHub or Kaggle.

Data dowload & exploration

We'll use a sample audio dataset from the Open Speech & Language Resouces (http://www.openslr.org/resources.php) for our analysis. We begin by downloading our data.

Next, let's download and unzip the data:

# Only on Linux, Mac and Windows WSL
!rm -rf openslr-sample.tgz openslr-sample
!wget https://www.dropbox.com/s/u5lkq2f64ljw7au/openslr-sample.tgz?dl=1 -O openslr-sample.tgz
!tar -zxf openslr-sample.tgz
!rm openslr-sample.tgz
--2019-06-16 10:11:23-- https://www.dropbox.com/s/u5lkq2f64ljw7au/openslr-sample.tgz?dl=1 Resolving www.dropbox.com (www.dropbox.com)... 162.125.81.1 Connecting to www.dropbox.com (www.dropbox.com)|162.125.81.1|:443... connected. HTTP request sent, awaiting response... 301 Moved Permanently Location: /s/dl/u5lkq2f64ljw7au/openslr-sample.tgz [following] --2019-06-16 10:11:23-- https://www.dropbox.com/s/dl/u5lkq2f64ljw7au/openslr-sample.tgz Reusing existing connection to www.dropbox.com:443. HTTP request sent, awaiting response... 302 Found Location: https://ucc39bb4954b693fd7d64c4f1498.dl.dropboxusercontent.com/cd/0/get/Ai6PgSGZtJcxBK-E4eTWFhpmWi3LjzJp8dbGZGrbWT5qwh47tkwxrtfyz3EN_kj2iSC5H1GZao6aSzPWAOy6KehZVGhhUmb5qx-6fUiKmVsZAg/file?dl=1# [following] --2019-06-16 10:11:24-- https://ucc39bb4954b693fd7d64c4f1498.dl.dropboxusercontent.com/cd/0/get/Ai6PgSGZtJcxBK-E4eTWFhpmWi3LjzJp8dbGZGrbWT5qwh47tkwxrtfyz3EN_kj2iSC5H1GZao6aSzPWAOy6KehZVGhhUmb5qx-6fUiKmVsZAg/file?dl=1 Resolving ucc39bb4954b693fd7d64c4f1498.dl.dropboxusercontent.com (ucc39bb4954b693fd7d64c4f1498.dl.dropboxusercontent.com)... 162.125.81.6 Connecting to ucc39bb4954b693fd7d64c4f1498.dl.dropboxusercontent.com (ucc39bb4954b693fd7d64c4f1498.dl.dropboxusercontent.com)|162.125.81.6|:443... connected. HTTP request sent, awaiting response... 200 OK Length: 10463024 (10.0M) [application/binary] Saving to: 'openslr-sample.tgz' openslr-sample.tgz 100%[===================>] 9.98M 7.58MB/s in 1.3s 2019-06-16 10:11:26 (7.58 MB/s) - 'openslr-sample.tgz' saved [10463024/10463024]

We can listen to audio files directly within Jupyter using a display widget.