STACKS2 using SNAKEMAKE Workflow
RADseq workflow using STACKS2 This was designed to process RADseq data from RESERVEBENEFIT project.
Table of contents
- Introduction
- Installation
- Prerequisite
- Data Files
- Set up
- Reporting bugs
- Running the pipeline
- Filter raw data
- Georeferenced sequences alignments by species
- Species sequence pairwise comparison
- Genetic Diversity calculation
- Statistical analysis
1. Introduction
blablabla
2. Installation
2.1 Prerequisite
You must install the following softwares and packages :
-
SNAKEMAKE 5.3.0
- Check version and if the program is correctly installed by typing :
snakemake --version
## should give you the output
5.3.0
-
STACKS 2.0b
- Check version and if programs are correctly installed by typing :
process_radtags --version
clone_filter --version
gstacks --version
populations --version
## should give you the output
2.0b
-
BWA 0.7.17
- Download
bwa
at: http://sourceforge.net/projects/bio-bwa/files/
- Download
tar -xvf bwa-x.x.x.tar.bz2
cd bwa-x.x.x
./configure --prefix=/where/to/install
make
make install
* Check version and if programs are correctly installed by typing :
bwa
## should give you the output
Program: bwa (alignment via Burrows-Wheeler transformation)
Version: 0.7.17-r1188
...
-
- Download
htslib
andsamtools
at : http://www.htslib.org/download/ - Building each desired package from source is very simple:
cd htslib-1.x ./configure --prefix=/where/to/install make make install cd .. ## and similarly for samtools : cd samtools-1.x ./configure --prefix=/where/to/install make make install
- Check version and if programs are correctly installed by typing :
- Download
samtools --version
## should give you the output
samtools 1.9
Using htslib 1.9
Copyright (C) 2018 Genome Research Ltd.
2.2 Data Files
The included data files are :
2.3 Set Up
clone the project and switch to the main folder, it's your working directory
git clone dzdzdzworldmap_fish_genetic_diversity.git
cd snakemake_stacks2
4. Reporting bugs
If you're sure you've found a bug — e.g. if one of my programs crashes with an obscur error message, or if the resulting file is missing part of the original data, then by all means submit a bug report.
I use GitLab's issue system as my bug database. You can submit your bug reports there. Please be as verbose as possible — e.g. include the command line, etc
4. Running the pipeline
Quickstart
- open a shell
- make a folder, name it yourself, I named it workdir
mkdir workdir
cd workdir
- clone the project and switch to the main folder, it's your working directory
git clone
cd snakemake_stacks2
WORK IN PROGRESS !!!!
that's it ! The pipeline is running and crunching your data. Look for the log folder output folder after the pipeline is finished.