Created 4 years ago

Introduction to Audio Processing for Machine Learning

Use the 'Clone' button if you want to run this notebook on a local/cloud machine, or use the 'Run' button to run it on BinderHub or Kaggle.

Data dowload & exploration

We'll use a sample audio dataset from the Open Speech & Language Resouces (http://www.openslr.org/resources.php) for our analysis. We begin by downloading our data.

Next, let's download and unzip the data:

# Only on Linux, Mac and Windows WSL
!rm -rf openslr-sample.tgz openslr-sample
!wget https://www.dropbox.com/s/u5lkq2f64ljw7au/openslr-sample.tgz?dl=1 -O openslr-sample.tgz
!tar -zxf openslr-sample.tgz
!rm openslr-sample.tgz

--2019-06-16 10:11:23--  https://www.dropbox.com/s/u5lkq2f64ljw7au/openslr-sample.tgz?dl=1
Resolving www.dropbox.com (www.dropbox.com)... 162.125.81.1
Connecting to www.dropbox.com (www.dropbox.com)|162.125.81.1|:443... connected.
HTTP request sent, awaiting response... 301 Moved Permanently
Location: /s/dl/u5lkq2f64ljw7au/openslr-sample.tgz [following]
--2019-06-16 10:11:23--  https://www.dropbox.com/s/dl/u5lkq2f64ljw7au/openslr-sample.tgz
Reusing existing connection to www.dropbox.com:443.
HTTP request sent, awaiting response... 302 Found
Location: https://ucc39bb4954b693fd7d64c4f1498.dl.dropboxusercontent.com/cd/0/get/Ai6PgSGZtJcxBK-E4eTWFhpmWi3LjzJp8dbGZGrbWT5qwh47tkwxrtfyz3EN_kj2iSC5H1GZao6aSzPWAOy6KehZVGhhUmb5qx-6fUiKmVsZAg/file?dl=1# [following]
--2019-06-16 10:11:24--  https://ucc39bb4954b693fd7d64c4f1498.dl.dropboxusercontent.com/cd/0/get/Ai6PgSGZtJcxBK-E4eTWFhpmWi3LjzJp8dbGZGrbWT5qwh47tkwxrtfyz3EN_kj2iSC5H1GZao6aSzPWAOy6KehZVGhhUmb5qx-6fUiKmVsZAg/file?dl=1
Resolving ucc39bb4954b693fd7d64c4f1498.dl.dropboxusercontent.com (ucc39bb4954b693fd7d64c4f1498.dl.dropboxusercontent.com)... 162.125.81.6
Connecting to ucc39bb4954b693fd7d64c4f1498.dl.dropboxusercontent.com (ucc39bb4954b693fd7d64c4f1498.dl.dropboxusercontent.com)|162.125.81.6|:443... connected.
HTTP request sent, awaiting response... 200 OK
Length: 10463024 (10.0M) [application/binary]
Saving to: 'openslr-sample.tgz'

openslr-sample.tgz  100%[===================>]   9.98M  7.58MB/s    in 1.3s    

2019-06-16 10:11:26 (7.58 MB/s) - 'openslr-sample.tgz' saved [10463024/10463024]

We can listen to audio files directly within Jupyter using a display widget.