Identify significantly mutated genes.
This document describes gmt music smg version 0.04 (2013-05-14 at 16:03:04)
gmt music smg --gene-mr-file=? --output-file=? [--max-fdr=?] [--skip-low-mr-genes] [--bmr-modifier-file=?] [--processors=?]
... music smg \ --gene-mr-file output_dir/gene_mrs \ --output-file output_dir/smgs
(A \*(L"gene-mr-file\*(R" can be generated using the tool \*(L"music bmr calc-bmr\*(R".)
File with per-gene mutation rates (Created using \*(L"music bmr calc-bmr\*(R")
Output file that will list significantly mutated genes and their p-values
The maximum allowed false discovery rate for a gene to be considered an \s-1SMG\s0 Default value '0.2' if not specified
Skip testing genes with MRs lower than the background \s-1MR\s0 Default value 'true' if not specified
Tab delimited multipliers per gene that modify \s-1BMR\s0 before testing [gene_name bmr_modifier]
Number of processors to use (requires 'foreach' and 'doMC' R packages) Default value '1' if not specified
This script runs R-based statistical tools to identify Significantly Mutated Genes (SMGs), when given per-gene mutation rates categorized by mutation type, and the overall background mutation rates (BMRs) for each of those categories (gene_mr_file, created using \*(L"music bmr calc-bmr\*(R").
P-values and false discovery rates (FDRs) for each gene in gene_mr_file is calculated using three tests: Fisher's Combined P-value test (\s-1FCPT\s0), Likelihood Ratio test (\s-1LRT\s0), and the Convolution test (\s-1CT\s0). For a gene, if its \s-1FDR\s0 for at least 2 of these tests is <= max_fdr, it will be output as an \s-1SMG\s0. Another output file with prefix \*(L"_detailed\*(R" will have p-values and FDRs for all genes.
Qunyuan Zhang, Ph.D. Cyriac Kandoth, Ph.D. Nathan D. Dees, Ph.D.