16.01.2015 Views

Package 'WGCNA' - Laboratory Web Sites - UCLA

Package 'WGCNA' - Laboratory Web Sites - UCLA

Package 'WGCNA' - Laboratory Web Sites - UCLA

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

50 goodSamples<br />

useGenes<br />

minFraction<br />

minNSamples<br />

minNGenes<br />

verbose<br />

indent<br />

optional specifications of genes for which to perform the check. Should be a<br />

logical vector; genes whose entries are FALSE will be ignored. Defaults to<br />

using all genes.<br />

minimum fraction of non-missing samples for a gene to be considered good.<br />

minimum number of good samples for the data set to be considered fit for analysis.<br />

If the actual number of good samples falls below this threshold, an error<br />

will be issued.<br />

minimum number of non-missing samples for a sample to be considered good.<br />

integer level of verbosity. Zero means silent, higher values make the output<br />

progressively more and more verbose.<br />

indentation for diagnostic messages. Zero means no indentation, each unit adds<br />

two spaces.<br />

Details<br />

Value<br />

The constants ..minNSamples and ..minNGenes are both set to the value 4. For most data sets,<br />

the fraction of missing samples criterion will be much more stringent than the absolute number of<br />

missing samples criterion.<br />

A list with one component per input set. Each component is a logical vector with one entry per<br />

sample in the corresponding set, indicating whether the sample passed the missing value criteria.<br />

Author(s)<br />

See Also<br />

Peter Langfelder and Steve Horvath<br />

goodGenes, goodSamples, goodSamplesGenes for cleaning individual sets separately;<br />

goodGenesMS, goodSamplesGenesMS for additional cleaning of multiple data sets together.<br />

goodSamples<br />

Filter samples with too many missing entries<br />

Description<br />

Usage<br />

This function checks data for missing entries and returns a list of samples that pass two criteria on<br />

maximum number of missing values: the fraction of missing values must be below a given threshold<br />

and the total number of missing genes must be below a given threshold.<br />

goodSamples(datExpr,<br />

useSamples = NULL,<br />

useGenes = NULL,<br />

minFraction = 1/2,<br />

minNSamples = ..minNSamples,<br />

minNGenes = ..minNGenes,<br />

verbose = 1, indent = 0)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!