Skip to content
National Cancer Institute National Cancer Institute U.S. National Institutes of Health www.cancer.gov
Division of Cancer Prevention logo
Home Site Map Contact DCP
Programs & Resources

Biometry Research Group

Statistical Software

Comparative evaluation of two serial gene expression experiments

Stuart G. Baker, 2014


Introduction

This program fits biologically relevant response curves in comparative analysis of the two gene expression experiments involving same genes but under different scenarios and at least 12 responses. The program outputs gene pairs with biologically relevant response curve shapes including flat, linear, sigmoid, hockey stick, impulse and step curves.

Reference

Baker, S.G. Comparative Analysis of Biologically Relevant Response Curves in Gene Expression Experiments: Heteromorphy, Heterochrony, and Heterometry. Microarrays 2014, 3, 39-51.
http://www.mdpi.com/2076-3905/3/1/39 Exit Disclaimer

Requirement

Mathematica Version 8 Exit Disclaimer or later.

To run example in manuscript

copy all files into some folder called "FOLDER"
start a new Mathematica session
type SetDirectory["FOLDER"]
type << mfit.m
type MFit[datasetm1,datasetm2,NewFitQ->True]



To try on your own data,

typeMFit[dataset1,dataset2,options]



Options

NewFitQTRUEFits new data or retrieves data from previous fit
NewNameCORE"mfit"Name prepended to data set name for new fit
StoredNameCORE"mfit"Name prepended to data set name for retrieving fit (usually the same as NewNameCORE)
ModelSetQfull+polyfull+poly" fits flat, line, sigmoid, double sigmoid, generalized double sigmoid, and polynomial curves of degree 3, 5, 7 (requires 7 fitted points and recommended with at least 14 time points
  full" fits flat, line, sigmoid, double sigmoid, generalized double sigmoid (requires 7 fitted points and recommended with at least 14 time points).
  "reduced" fits flat, line, sigmoid, double sigmoid curves (requires 6 fitted points and recommended with at least 12 time points)
SplitTypealternating"alternating" which fits every other point or "custom"
SplitFitSet{1,4,7,10,13}Applies only for SplitType->"custom", ranked points for fitting the model
ShowPlotFALSEShows plots for gene pairs with biologically relevant curves
ShowReportFALSEReport on gene pairs with biologically relevant curves
ShowSummaryTRUEShows summary results
ShowRPEAICFALSEShow relationship of RPE to AIC


dataset={data, geneid, genename, times, xname, yname, plotname, dataname},

datamatrix of responses, where rows indicate genes and columns indicate times
geneida list of gene identification numbers corresponding to data rows
genenamea list of gene names corresponding to data rows
timeslist of observation times
xnamename of horizontal axis on plot
ynamename of vertical axis on plot
plotnamename of data set for plot
datanamename of data set for storing files


Downloads

Download All (zip, 7.57MB)

File name / sizeDescription
mfit.m
(M File, 14KB)
main program and calls all programs
mfitpair.m
(M File, 14KB)
compute results for model pairs
mfitsupport.m
(M File, 4KB)
support functions
mfitform.m
(M File, 6KB)
compute heterochrony and heterometry
mfitkey.m
(M File, 10KB)
model selection
mfitplot.m
(M File, 3KB)
compute plots
mfitplotoverlay.m
(M File, 7KB)
overlay plot
mfitplotside.m
(M File, 14KB)
side-by-side plot
mfitreport.m
(M File, 7KB)
report lists of genes by model pair
mfitcore.m
(M File, 19KB)
core fitting program
mfitdiagnostic.m
(M File, 6KB)
plots RPE vesus AIC
mfitdata.m
(M File, 5KB)
generate data in correct format from following files
mfitrawdata1.m
(M File, 9.42MB)
raw data for frog example 1 (X.laevis)
mfitrawdata2.m
(M File, 9.48MB)
raw data for frog example 2 (X.tropicalis)
mfitgenename.m
(M File, 262KB)
gene names for frog examples


Notes on raw data for example

The files mfitrawdata1.m and mfitrawdata2.m are structured as follows: There 11299 rows correspond to genes. The 126 = 14 x 9 columns are ordered from left to right as 14 times for replicate 1 from sample 1, 14 times for replicate 2 from sample 1, 14 times for replicate 3 from sample 1, 14 times for replicate 1 from sample 2, 14 times for replicate 2 from sample 2, 14 times for replicate 3 from sample 2, 14 times for replicate 1 from sample 3, 14 times for replicate 2 from sample 3, 14 times for replicate 3 from sample 3. The 14 times are 1.5, 5, 7, 9, 13.25, 14.75, 16.25, 18.25, 19.75, 21.75, 24.75, 27.5, 35, 44.5. The file mfitgenename.m is structured as follows. The first 11299 rows correspond to the genes in the raw data (there are extra rows). The 3 columns are row number, gene ID, and gene name (or nothing if no gene name).

Disclaimer

This code is provided "as is", without warranty of any kind, express or implied, including but not limited to the warranties of merchantability, fitness for a particular purpose and noninfringement. In no event shall the NCI or the individual developers be liable for any claim, damages or other liability of any kind. Use of this code by recipient is at recipient's own risk. NCI makes no representations that the use of the code will not infringe any patent or proprietary rights of third parties.

Last updated: March 18, 2014

Back to top