Qiime2 snakemake pipeline
This project is maintained by attayeb
Snaq is a snakemake pipeline for Microbiome data analsysis using QIIME2.
This pipeline works in Linux, Mac and Windows (Ubuntu on Windows). It also can run using Docker container system.
Mohsen A, Chen Y-A, Allendes Osorio RS, Higuchi C and Mizuguchi K (2022) Snaq: A Dynamic Snakemake Pipeline for Microbiome Data Analysis With QIIME2. Front. Bioinform. 2:893933. doi: 10.3389/fbinf.2022.893933
Install Ubuntu for windows 10 following the instructions in this website
In “Ubuntu bash command line” install and test the pipeline following the same steps mentioned in Linux and Mac.
All these steps should be executed in the terminal (linux and Mac) or (Ubuntu bash command line) in windows,
conda install -n base -c conda-forge mamba
conda activate base
mamba create -c conda-forge -c bioconda -n snakemake snakemake
conda activate snakemake
download the latest release file from this repository and extract it in a new folder, or clone this repository:
docker pull snakemake/snakemake
Download the latest release Source code (zip) from github repository, and extract it, or clone this repository.
docker run --rm -it -v "$PWD":/snaq -w /snaq snakemake/snakemake snakemake -lt
Create a new folder inside data folder use only letters in capital to name it, no spaces. Eg: AB, CONTROL, COHORTONE etc.
--cores <number of cores> --use-conda
, These two parameters are essential to run the analysis.results/AB/AB.qza
. Snakemake will understand to import the data set saved in data/AB
folder to results/AB/AB.qza
artifact file; That will be done in two steps, first a manifest file is created, result/AB/AB_manifest.qza
and then the files listed in that manifest files will be imported to results/AB/AB.qza
.docker pull snakemake/snakemake
will download the required image to run sanakemake.docker run -it snakemake/snakemake
is executed, a container will be built which basically means running a small virtual linux PC inside your host system, and whatever command you send after that will run inside that virtual PC.docker run -it snakemake/snakemake snakemake
then the snakemake software will run inside that created container.If you want changes to remain, you can link a folder from host machine to the container, and the container will save, modify, read from that folder, for that we use -v
parameter in docker command. To map the pipeline folder to a folder inside the container you can:
docker run -it -v c:\snaq:/snaq -w /snaq snakemake/snakemake
To run a basic task in docker then the the command should be like this
docker run --rm -it -v [snaq folder in host system]:/snaq -w /snaq snakemake/snakemake snakemake --use-conda --cores 10 results/AB/AB+bb-t16+fp-f17-r21+dd+cls-silva+rrf10000.zip
is available for testing: can be downloaded from here; download it and extract it in data folder.
Possible targets are:
AB_manifest.tsv
AB.qza
AB+bb-t16+fp-f17-r21+dd+cls-gg_asv.biom
AB+bb-t16+fp-f17-r21+dd+cls-gg+phyloseq.RDS
AB+bb-t16+fp-f17-r21+dd+cls-gg+rrf-d10000+beta_braycurtis.tsv
AB+bb-t16+fp-f17-r21+dd+cls-gg+rrf-d10000+beta_jaccard.tsv
AB+bb-t16+fp-f17-r21+dd+cls-gg+rrf-d10000+manta_tax.tsv
AB+bb-t16+fp-f17-r21+dd+cls-gg+rrf-d10000+manta.tsv
AB+bb-t16+fp-f17-r21+dd+cls-gg+rrf-d10000+otu_tax.biom
AB+bb-t16+fp-f17-r21+dd+cls-gg+rrf-d10000+otu_tax_biom.tsv
AB+bb-t16+fp-f17-r21+dd+cls-gg+rrf-d10000+otu_tax.qza
AB+bb-t16+fp-f17-r21+dd+cls-gg+rrf-d10000.zip
AB+bb-t16+fp-f17-r21+dd+cls-gg_taxonomy.csv
AB+bb-t16+fp-f17-r21+dd+cls-gg_taxonomy.qza
AB+bb-t16+fp-f17-r21+dd+fasttree.nwk
AB+bb-t16+fp-f17-r21+dd+fasttree_rooted.qza
AB+bb-t16+fp-f17-r21+dd+rrf-d10000+alphadiversity.tsv
AB+bb-t16+fp-f17-r21+dd+rrf-d10000+beta_unweightedunifrac.csv
AB+bb-t16+fp-f17-r21+dd+rrf-d10000+beta_unweightedunifrac.qza
AB+bb-t16+fp-f17-r21+dd+rrf-d10000+beta_weightedunifrac.csv
AB+bb-t16+fp-f17-r21+dd+rrf-d10000+beta_weightedunifrac.qza
AB+bb-t16+fp-f17-r21+dd_seq.csv
AB+bb-t16+fp-f17-r21+dd_seq.qza
AB+bb-t16+fp-f17-r21+dd_stats.qza
AB+bb-t16+fp-f17-r21+dd_table.qza
AB+bb-t16+fp-f17-r21+dd_table+rrf-d10000.csv
AB+bb-t16+fp-f17-r21+dd_table+rrf-d10000.qza
AB+bb-t16+fp-f17-r21.qza
For more details please check the paper.