+1-302-831-1356 rjw@udel.edu

Coming Soon: Manuscript in Review

Clustering of Circular Consensus Sequences (C3S) for Long Amplicon Analysis of PacBio Data

Overview: To improve and extend the functionality of the long amplicon analysis (LAA) module from PacBio, we restructured the clustering approach for LAA based on the high quality circular consensus sequence data, grouping these reads based on the primer sequences used to amplify the DNA and the molecular barcodes used to track individuals. This directs error correction and consensus sequence analysis to be performed on sequences that should be the same, from a given sample, leading to improved accuracy of the outputted data. In addition, integration of Minimus (Sommer et al. BMC Bioinformatics, 2007 8:64) for automated assembly of any overlapping amplicon consensus sequences allows for efficient processing of tiled amplicon resequence data.

Availability: ThermoAlign is released under an MIT open source license at Github (https://github.com/drmaize/C3S-LAA).

Developers: Felix Francis, Randy Wisser

Funding: This material is based upon work supported by the U.S. National Science Foundation Plant Genome Research Program IOS-1127076.