Density estimation with density estimation trees
det [-h] [-v] -t string [-f int] [-l string] [-L string] [-M int] [-N int] [-p] -I [-T string] [-E string] [-e string] [-r string] [-u string] [-V] [-i string]
This program performs a number of functions related to Density Estimation Trees. The optimal Density Estimation Tree (DET) can be trained on a set of data (specified by --train_file) using cross-validation (with number of folds specified by --folds). In addition, the density of a set of test points (specified by --test_file) can be estimated, and the importance of each dimension can be computed. If class labels are given for the training points (with --labels_file), the class memberships of each leaf in the DET can be calculated.
The created DET can be saved to a file, along with the density estimates for the test set and the variable importances.
--train_file (-t) [string] The data set on which to build a density estimation tree.
--folds (-f) [int] The number of folds of cross-validation to perform for the estimation (0 is LOOCV) Default value 10.
--help (-h) Default help info.
--info [string] Get help on a specific module or option. Default value ''.
--labels_file (-l) [string] The labels for the given training data to generate the class membership of each leaf (as an extra statistic) Default value ''.
--leaf_class_table_file (-L) [string] The file in which to output the leaf class membership table. Default value 'leaf_class_membership.txt'.
--max_leaf_size (-M) [int] The maximum size of a leaf in the unpruned, fully grown DET. Default value 10.
--min_leaf_size (-N) [int] The minimum size of a leaf in the unpruned, fully grown DET. Default value 5.
--print_tree (-p) Print the tree out on the command line (or in the file specified with --tree_file).
--print_vi (-I) Print the variable importance of each feature out on the command line (or in the file specified with --vi_file).
--test_file (-T) [string] A set of test points to estimate the density of. Default value ''.
--test_set_estimates_file (-E) [string] The file in which to output the estimates on the test set from the final optimally pruned tree. Default value ''.
--training_set_estimates_file (-e) [string] The file in which to output the density estimates on the training set from the final optimally pruned tree. Default value ''.
--tree_file (-r) [string] The file in which to print the final optimally pruned tree. Default value ''.
--unpruned_tree_estimates_file (-u) [string] The file in which to output the density estimates on the training set from the large unpruned tree. Default value ''.
--verbose (-v) Display informational messages and the full list of parameters and timers at the end of execution.
--version (-V) Display the version of mlpack.
--vi_file (-i) [string] The file to output the variable importance values for each feature. Default value ''.
For further information, including relevant papers, citations, and theory, consult the documentation found at http://www.mlpack.org or included with your distribution of MLPACK.