In [ ]:
%matplotlib inline
from matplotlib import pyplot as plt
import numpy as np
from sklearn import datasets
from sklearn.decomposition import PCA
In [ ]:
homolog-1.pdb and homolog-2.pdb into separate numpy arrays.
In [ ]:
x vs. y for the two proteins on the same graph.
In [ ]:
sklearn.decomposition.PCA on each individual set of coordinates and then transform them individually onto their PCA axes.
In [ ]:
In [ ]:
In [ ]:
You are studying a mixed population of C. elegans worms and would like to figure out how many classes of worms are present. You measure 10 different features (things like worm length, fecundity, etc.) for 50,000 individual. You have a dataset in pca_dataset.csv, with the parameters in columns the top (numbered 0 to 9) and the individuals in rows.
In [ ]:
In [ ]:
In [ ]: