In today's assignment, you'll be constructing a rudimentary pipeline to analyze cell screening data generated at high throughput. These are actual data (Complements of Sara Cherry and David Schultz) and there are many more things to analyze!
In particular, these are results from a high-throughput screen using all FDA approved small-molecules. Cells in each well of a 384-well plate have been treated with the a unique small molecule (one drug per well), and assayed to determine a phenotype/response to the drug exposure.
You have find several files in your assignment folder:
plate*.csv: these are the results files for the cell screening assay that was performed. sampleids.txt: a list of all samples that were assayed
For this assignment, we will be engaged in R completely, so treat your commands in this markdown like you would if you were working in the R environment.
The samples are arrayed in the following way, where maximum and minimum controls are located in the plate:
Remember: the use of term "maximum" (control) indicates fluorescing cells, whereas 'minimum' (control) reflects background fluorescence in the absence of cells (i.e., no response).
In [0]:
In [0]:
In [0]:
In [0]:
In [0]:
In [0]:
In [0]:
In [0]:
In [0]:
In [0]:
In [0]:
In [0]:
In [0]:
%> string1 <- "foo"
%> string2 <- "bar"
%> mypaste <- paste(string1, string2, sep="_")
%> mypaste
[1] "foo_bar"
In [0]:
In [0]:
In [0]:
In [0]:
table_forheatmap <- as.matrix( 1 * (mytable < thresh) )
In [0]:
Rowv=NA Colv=NA
labRow=myrownames
scale="none"
col=c("dark blue", "gold")
revC=T
In [0]:
sampleid,row,col,plate
87234,C,3,1
7134,C,4,1
...
81672,P,22,7
row,col,plate
C,3,1
D,5,2
P,6,7