SNP Analysis

Screening the millions of reads that next-generation sequencing produces presents a major challenge when searching for candidate SNPs. Both the Maq and GSNAP algorithms include SNP screening capabilities.

Maq(1) has a two level screening process which searches initially for differences between the reads and the reference sequence. Next a filtering step takes place which sifts the initial results looking for a minimum number of variants of the same class per column of reads, and variants embedded within high quality regions. The results are presented in a comprehensive report. You can also view your results in a Maqview or Tablet.

With GSNAP(2) the SNP analysis takes a different approach looking at both previously reported SNPs as well as new candidates. The user must supply a list of known SNPs as well as the reads and a reference sequence. GSNAP performs a SNP-tolerant alignment of all major and minor alleles. The algorithm enables the minor alleles to be differentiated from mismatches. The results are presented in a comprehensive report. You can also view your results in Tablet.

 

(1) Heng Li, Jue Ruan and Richard Durbin
Mapping short DNA sequencing reads and calling variants using mapping quality scores
Genome Research 2008 18:1851-1858

(2) Thomas D. Wu and Serban Nacu
Fast and SNP-tolerant detection of complex variants and splicing in short reads
Bioinformatics 2010 26: 873-881