bdgenomics.mango.genotypes.VariantsPerSampleDistribution

class bdgenomics.mango.genotypes.VariantsPerSampleDistribution(ss, genotypeDataset, sample=1.0)[source]

VariantsPerSampleDistribution class. VariantsPerSampleDistribution computes a distribution of the count of variants per sample.

__init__(ss, genotypeDataset, sample=1.0)[source]

Initializes a VariantsPerSampleDistributionn class. Computes the coverage distribution of a CoverageDataset. This dataset can have data for multiple samples.

Args:
param ss:global SparkSession.
param genotypeDataset:
 bdgenomics.adam.ds.GenotypeDataset
param sample:Fraction to sample GenotypeDataset. Should be between 0 and 1

Methods

__init__(ss, genotypeDataset[, sample]) Initializes a VariantsPerSampleDistributionn class.
plotDistributions([normalize, cumulative, …]) Plots final distribution values and returns the plotted distribution as a Counter object.

Attributes

pre_sampled
rdd
sample
seed
ss