Content feed Comments Feed

BioSoftwares

Bioinformatics Tools – Solutions – Services

Archive for the ‘Algorithms’ Category

SOAP2: an improved ultrafast tool for short read alignment

Posted by primoto On August - 5 - 2009
SOAP2 is a significantly improved version of the short oligonucleotide alignment program that both reduces computer memory usage and increases alignment speed at an unprecedented rate. We used a Burrows Wheeler Transformation (BWT) compression index to substitute the seed strategy for indexing the reference sequence in the main memory. We tested it on the whole human genome and found that this new algorithm reduced memory usage from 14.7 to 5.4 GB and improved alignment speed by 20–30 times. SOAP2 is compatible with both single- and paired-end reads. Additionally, this tool now supports multiple text and compressed file formats. A consensus builder has also been developed for consensus assembly and SNP detection from alignment of short reads on a reference genome.

OS: Linux

Licence: Freeware

Home page

Download page

Homologous protein families share highly conserved sequence and structure regions that are frequent targets for comparative analysis of related proteins and families. Many protein families, such as the curated domain families in the Conserved Domain Database (CDD), exhibit similar structural cores. To improve accuracy in aligning such protein families, we propose a profile–profile method CORAL that aligns individual core regions as gap-free units.

CORAL computes optimal local alignment of two profiles with heuristics to preserve continuity within core regions. We benchmarked its performance on curated domains in CDD, which have pre-defined core regions, against COMPASS, HHalign and PSI-BLAST, using structure superpositions and comprehensive curator-optimized alignments as standards of truth. CORAL improves alignment accuracy on core regions over general profile methods, returning a balanced score of 0.57 for over 80% of all domain families in CDD, compared with the highest balanced score of 0.45 from other methods. Further, CORAL provides E-values to aid in detecting homologous protein families and, by respecting block boundaries, produces alignments with improved ‘readability’ that facilitate manual refinement.

OS: Windows, Mac OS

Licence: Freeware

Home Page

Download Page

About Us

BioSoftwares is entirely dedicated on providing the latest information related to bioinformatics tools, solutions and services. BioSoftwares hosts an up-to-date software database, containing the latest versions of freely available applications and complete solutions for sequence analysis and annotation, gene expression clustering, advanced biological information retrieval and text mining. BioSoftwares evaluates the best available solutions in field of interest with in depth reviews of the applications functionalities and performance. Stay tuned to BioSoftwares for the latest news and trends in the field of applied bioinformatics!