pycoQC CLI Usage
PycoQC CLI can generate a beautiful HTML formatted report containing interactive D3.js plots. On top of it, the CLI can also dump summary information in a JSON formated file allowing easy parsing with third party tools.
The report is dynamically generated depending on the information available in the summary file.
CLI Usage
Activate virtual environment
# Using conda here but can also be done with other virtenv managers
conda activate pycoQC
Getting help
pycoQC -h
Usage examples
Basic usage (quiet mode)
pycoQC \
-f ./data/Albacore-1.2.1_basecall-1D-DNA_sequencing_summary.txt.gz \
-o ./results/Albacore-1.2.1_basecall-1D-DNA.html \
--quiet
JSON data output on top of the html report
A json report can be generated on top (or instead) of the html report
It contains a summarized version of the data collected by pycoQC in a structured and easy to parse format
pycoQC \
-f ./data/Guppy-2.1.3_basecall-1D-RNA_sequencing_summary.txt.gz \
-o ./results/Guppy-2.1.3_basecall-1D_RNA.html \
-j ./results/Guppy-2.1.3_basecall-1D_RNA.json \
--quiet
Including guppy barcoding information
pycoQC \
-f ./data/Guppy-2.1.3_basecall-1D-DNA_sequencing_summary.txt.gz \
-b ./data/Guppy-2.1.3_basecall-1D_DNA_barcoding_summary.txt.gz \
-o ./results/Guppy-2.1.3_basecall-1D_DNA_barcode.html \
--quiet
Matching multiple files with a regex and add a title to report
pycoQC \
-f ./data/Albacore*RNA* \
-o ./results/Albacore_all_RNA.html \
--report_title "All RNA runs" \
--quiet
Tweak filtering parameters
- Define reads with a quality higher than 8 and length higher than 200 bases as "pass"
- Discard reads aligned on the calibration standard
- Unset value of any barcode found in less than 10% of the reads
pycoQC \
-f ./data/Albacore-2.1.10_basecall-1D-DNA_sequencing_summary.txt.gz\
-o ./results/Albacore-2.1.10_basecall-1D-DNA.html \
--min_pass_qual 8 \
--min_pass_len 200 \
--filter_calibration \
--min_barcode_percent 10 \
--quiet
Including Alignments information for a Bam file
pycoQC \
-f ./large_data/sample_1_sequencing_summary.txt \
-a ./large_data/sample_1.bam \
-o ./results/Guppy-2.3_basecall-1D_alignment-DNA.html \
-j ./results/Guppy-2.3_basecall-1D_alignment-DNA.json \
--quiet
Advanced configuration with custon json file
Although we recommend to stick to the default parameters, a json formatted configuration file can be provided to tweak the plots. A default configuration file can be generated using:
pycoQC --default_config
To save and edit it redirect the std output to a file and make your changes using your favorite text editor.
To remove a plot from the report, just remove it (or comment it) from the configuration file
The configuration file accept all the arguments of the target plotting functions. For more information refer to the API documentation
pycoQC --default_config > data/pycoQC_config.json
Run pycoQC with --config
option
pycoQC \
-f ./data/Albacore-1.7.0_basecall-1D-DNA_sequencing_summary.txt.gz\
-o ./results/Albacore-1.7.0_basecall-1D-DNA.html \
--config ./data/pycoQC_config.json \
--quiet