dmr_gene_annotation
Cleans reference file and creates genomic region files (TSS, geneBody, TES, 5dist, and intergenic) from the reference.
Required Arguments
-r fileor--referenceFile file: Reference file.-g fileor--genomeFile file: Genome file, with one column containing chromosome names and another column containing the length of each chromosome.-hu y/m/ror--human y/m/r: Is the sample from a human? Useyes/yfor human,mouse/mfor mouse, orrat/rfor rat. If not, it must be from either the mouse (m) or rat (r) genome.-n y/nor--numericalChr y/n: Will you operate with numerical chromosome names or not? Useyes/yif using numerical chromosome names orno/nif using alphanumeric names. The choice must match the chromosome naming convention in thegenomeFile. The default isno.
Optional Arguments
-rmor--removeMir: Remove genes with names starting with ‘Mir’? Useyes/yorno/n. The default isyes.-X: The number of upstream base pairs (bp) for TSS, TES, and genes. The default is 1000.-Y: The number of downstream base pairs (bp) for TSS, TES, and genes. The default is 1000.-M: The number of base pairs (bp) from the gene start site for the 5dist region. The default is 10000.-N: The number of base pairs (bp) from the gene start site for the 5dist region. The default is 1000000.-ior--intergenicBTGenes: Are intergenic regions between gene body regions (yes/y) or between TSS and TES regions (no/n)? The default isyes.-lor--minIntergenicLen: The minimum intergenic region distance. The default is 2000.-xLor--maxIntergenicLen: The maximum intergenic region distance. The default is 10000000.-remor--removeShort: Should regions be removed from TSS, TES, and 5dist regions if the gene body is removed due to its size being less than 0? Useyes/yorno/n. The default isyes.-For--folderOut: The name of the output folder. The default is “out”.