With current sequencing technologies the bottleneck is handling of raw data. A single experiment can easily generate terabytes of sequence data. Efficiently and correctly cleaning, filtering and quality control of these sequence reads is the first step for any successful subsequent downstream analysis.