Observe Signal Classes

In this notebook, we'll do a very simple thing: Download the various file lists and use Pandas to see how many files we have for each signal classification.

This is just a simple exercise to show you what are in the data

File Lists


In [2]:
import pandas as pd

basic_df=pd.read_csv('public_list_basic_v2_26may_2017.csv')

In [3]:
basic_df.head()


Out[3]:
UUID SIGNAL_CLASSIFICATION
0 b1cc342f-eae4-442b-91de-10c9a444072e narrowband
1 d83f4c12-0f5d-4be0-a268-c27812a126e4 narrowband
2 8add8be2-2fb9-446a-8ff9-e4f4a3931bea narrowband
3 ad580bf4-1568-4f8f-8b58-319fd63c1da3 narrowband
4 4ffe0886-ebe7-4e7a-b209-9f86a1de9896 narrowband

In [5]:
basic_df.groupby(['SIGNAL_CLASSIFICATION']).count()


Out[5]:
UUID
SIGNAL_CLASSIFICATION
narrowband 1000
narrowbanddrd 1000
noise 1000
squiggle 1000

In [6]:
s=requests.get(primary_full).content
basic_df=pd.read_csv(io.StringIO(s.decode('utf-8')))
basic_df.groupby(['SIGNAL_CLASSIFICATION']).count()


Out[6]:
UUID
SIGNAL_CLASSIFICATION
brightpixel 20000
narrowband 20000
narrowbanddrd 20000
noise 20000
squarepulsednarrowband 20000
squiggle 20000
squigglesquarepulsednarrowband 20000

In [7]:
s=requests.get(primary_medium).content
basic_df=pd.read_csv(io.StringIO(s.decode('utf-8')))
basic_df.groupby(['SIGNAL_CLASSIFICATION']).count()


Out[7]:
UUID
SIGNAL_CLASSIFICATION
brightpixel 5000
narrowband 5000
narrowbanddrd 5000
noise 5000
squarepulsednarrowband 5000
squiggle 5000
squigglesquarepulsednarrowband 5000

In [8]:
s=requests.get(primary_small).content
basic_df=pd.read_csv(io.StringIO(s.decode('utf-8')))
basic_df.groupby(['SIGNAL_CLASSIFICATION']).count()


Out[8]:
UUID
SIGNAL_CLASSIFICATION
brightpixel 1000
narrowband 1000
narrowbanddrd 1000
noise 1000
squarepulsednarrowband 1000
squiggle 1000
squigglesquarepulsednarrowband 1000

In [ ]: