Create survival plots and p-values for clinical and mutational phenotypes.
This document describes genome music survival version 0.04 (2013-05-14 at 16:03:04)
genome music survival --bam-list=? --output-dir=? [--maf-file=?] [--skip-silent] [--genetic-data-type=?] [--numeric-clinical-data-file=?] [--categorical-clinical-data-file=?] [--glm-clinical-data-file=?] [--phenotypes-to-include=?] [--legend-placement=?] [--skip-non-coding]
... music survival \ --bam-list /path/myBamList.tsv \ --maf-file /path/myMAF.tsv \ --numeric-clinical-data-file /path/myNumericData.tsv \ --categorical-clinical-data-file /path/myClassData.tsv \ --output-dir /path/output_directory ... music survival \ --bam-list /path/myBamList.tsv \ --maf-file /path/myMAF.tsv \ --glm-clinical-data-file /path/myGLMClinicalData.tsv \ --output-dir /path/output_directory ... music survival \ --bam-list /path/myBamList.tsv \ --maf-file /path/myMAF.tsv \ --genetic-data-type 'gene' \ --glm-clinical-data-file /path/myGlmClinicalData.tsv \ --phenotypes-to-include 'Race,Gender,TP53' \ --output-dir /path/output_directory
List of sample names to be included in the analysis. (See Description)
Directory where output files will be written
List of mutations in \s-1MAF\s0 format
Skip silent mutations from the provided \s-1MAF\s0 file Default value 'true' if not specified
Correlate clinical data to \*(L"gene\*(R" or \*(L"variant\*(R" level data Default value 'gene' if not specified
Table of samples (y) vs. numeric clinical data category (x)
Table of samples (y) vs. categorical clinical data category (x)
Clinical traits, mutational profiles, other mixed clinical data (See \s-1DESCRIPTION\s0).
Include only these genes and/or phenotypes in the anlaysis. (COMMA-DELIMITED)
Choose one of 'bottomleft', 'topleft', 'topright', or 'bottomright'. Default value 'bottomleft' if not specified
Skip non-coding mutations from the provided \s-1MAF\s0 file Default value 'true' if not specified
This command performs survival analysis and plots survival curves for mutational data, as well as any clinical traits of interest as specified via the --phenotypes-to-include input parameter. The analyses performed include the Kaplan-Meier estimator followed by the Cox Proportional Hazards model. Outputs for each gene/clinical trait analyzed include survival curves, a hazard ratio (with confidence intervals), and P-values and FDRs describing the significance of the difference between survivors and non-survivors.
All clinical data files are searched for the required (case insensitive) \*(L"vital_status\*(R" and \*(L"days_to_last_followup\*(R" columns which are paired to phenotypes via sample IDs for the survival analysis. The first column of all clinical data files \s-1MUST\s0 contain the sample IDs, same as in other MuSiC tools. By default, analysis is performed on every gene present in the \s-1MAF\s0. Optionally, the analysis may be limited to only specific genes by listing them (comma delimited) after the --phenotypes-to-include input parameter. Survival analysis may also be performed on other columns in the clinical data file by adding the column headers to the list of entries specified after the --phenotypes-to-include input parameter.
Here are some general guildelines for creating clinical data input files:
Headers are required.
The first column of each clinical data file must contain sample IDs which match those in both the --bam-list and the \s-1MAF\s0 variant list (in the \s-1MAF\s0, this is the Tumor_Sample_Barcode column, specifically).
In at least one of the clinical data files input, columns with headers \*(L"vital_status\*(R" and \*(L"days_to_last_followup\*(R" (case insensitive) must exist. \*(L"vital_status\*(R" must be delineated by 1's and 0's, where 0 denotes 'living', and 1 denotes 'deceased'.
Note that all input files must be tab-separated.
Copyright (C) 2010-2011 Washington University in St. Louis.
It is released under the Lesser \s-1GNU\s0 Public License (\s-1LGPL\s0) version 3. See the associated \s-1LICENSE\s0 file in this distribution.
Nathan D. Dees, Ph.D. Qunyuan Zhang, Ph.D.
genome-music(1), genome(1)