dmr_map2genome
Map DMR/MRs to predefined genomic regions such as TSS, TES, 5dist, etc.
Required Parameters
in_cluster_out_path
: File path that contains all clustered PWMs and uncertain PWMs. Example: ../data/out/sep8_run1.in_bpb3_ranking_path
: File path for bpb3 ranking results. Example: ../data/out/sep8_run1/test_67snps/foreground/test_seq/rankings/.in_fileString4clustered_pwm
: File string or file folder/name used/contained for clustered PWMs, such as quality_assessed_out_seed3.
Required
-inSDFile IN_SORTEDDMR_FILE
or--in_sortedDMR_file IN_SORTEDDMR_FILE
: An exported file from DMR_analysis with BED format, such as the exported file from “dmr_combine_multChrs4rank,” where the columns are chrom, start_position, end_position, DMR_ID, and probability values. Please note that this input file should be sorted by chromosome position before being intersected with predefined genomic regions such as TSS, gene, or TES, etc.-inGRFile IN_GENEREGION_FILE
or--in_geneRegion_file IN_GENEREGION_FILE
: A list of gene region files that is generated by hmst_seq_analyzer, such as “list_region_files.txt,” where the file name and path of genomic region files are listed. These predefined genomic region files will be intersected with the sorted bed file of MR/DMR. Please refer to the demo of “run_hmsq.sh” for how to generate these predefined genomic region/annotation files.-inOFolder IN_OUTFILE_FOLDER
or--in_outFile_folder IN_OUTFILE_FOLDER
: Path of the output file folder for exporting intersected MR/DMR files with each genomic region.-inRefFile IN_REFFLAT_FILE
or--in_refFlat_file IN_REFFLAT_FILE
: Path of the input refFlat reference file, such as an export file “hg19.refFlat_clean_sorted.bed” from “hmst_seq_Analyzer”.
Optional Parameters
-minOlp
or--in_minimum_overlap4bedtools
: Minimum overlap rate in bedtools intersection. The default is 1e-9 or 1 bp overlap, which is a float number between 0 and 1.