Comparative genomics study the evolution of gene organization, which in the past years was by the fast growing number of full genome sequences available in public repositories. Most of the available bioinformatics tools for visualization and comparison of genomes are restricted to two or three genomes at a time, which is a significant limitation in the field of comparative genomics. Recently (this month) a new phylogenetic software was announced – Genomicus. It provides functionality of a synteny browser, which can represent and compare unlimited numbers of genomes in a broad phylogenetic view. The tool provides reconstructed ancestral gene organization, in a way that facilitates the interpretation of the analysis data.
The comparative genomics aims to:
- to identify conserved functional regions
- to document differences among these functional sequences as a first step to understand broader biological differences (metabolic, developmental, etc.) between organisms
- to outline the evolutionary events that have interrupted the gene colinearity between the genomes of two species since their closest common ancestry dna.
Data integrated in Genomicus
Genomicus is based on already integrated and publicly available from the Ensembl (Ensembl Phylogenetic Trees). Genomicus utilize just two main types of information from Ensembl:
- gene positions in the respective genomes
- phylogenetic relationships (orthology, paralogy) between genes.
Genomicus then edits Ensembl phylogenetic trees in the following steps:
- Selection of duplication nodes below a defined threshold, that is optimized to increase the synteny between extant genomes.
- Integration of information from Boreoeutheria, Euarchontoglires and Atlantogenata ancestral nodes in existing trees of placental mammals
- Addition of genomic information from some extant species that are not currently referenced in Ensembl (Branchiostoma floridae, Nematostella vectensis and Oikopleura dioica), together with their respective ancestral nodes.
For each of the integrated genomes, best reciprocal protein BLAST comparisons are performed.
Functionality
Genomicus offers two types of data visualization
- PhyloView – a phylogenetic tree visualization, which shows the chosen reference gene in the centre of the display with 15 neighbouring genes on both side. It displays information about as well as orthologs and paralogs of the query gene in their own respective genomic regions. If these neighbouring genes are orthologs or paralogs of genes in the reference species, they are displayed with matching colours. When an analog of the reference gene is duplicated (shown as a red square), the originating specie may appear twice in the screen.
- AlignView – provides an alignments between the genes contained within the genomic region of the reference gene and all their respective orthologs in other species.
In both views, the phylogenetic tree can be edited (expanding, collapsing, hiding, showing chosen nodes) to clarify the view.
Genomicus also displays orthologous conserved non-coding elements at three levels of conservation. In a ddition a link-out functionality to other browsers like Ensembl, UCSC and NCBI is provided.
System Implementation
Genomicus is a web based application, developed with Perl scripts and modules, executed with mod_perl on an Apache2 server. The data is persisted in a MySQL database. The visualization is based on embed inline-SVG drawings in XHTML, in which the JavaScript usage is limited to an information panel retrieved with AJAX calls. The browsers are required to support the Google Chrome Frame extension.
Availability
Genomicus is freely available for online use at http://www.dyogen.ens.fr/genomicus.
The data can be downloaded at ftp://ftp.biologie.ens.fr/pub/dyogen/genomicus.
BioEdit easy-to-use sequence alignment editor and sequence analysis program designed and written by a graduate student who knows how frustrating and time consuming it can be to rely upon word-processors and command-line programs for sequence manipulation. BioEdit is intended to supply a single program that can handle most simple sequence and alignment editing and manipulation functions that researchers are likely to do on a daily basis, as well as a few basic sequences analyses.
NoteExpress is a perfect assistant and information manager for researchers, scholars, students, and librarians. NoteExpress is designed to help you organize research notes and bibliographic references, generate bibliographies automatically, search and capture bibliographic data from Internet with efficiency and ease. NoteExpress is well integrated with Microsoft Word. It can format bibliographies in many popular styles. NoteExpress works the same as many other bibliograhic softwares but with many additional efficient features like:
The FastPCR software is an integrated tools environment that provides comprehensive facilities for designing any kind of PCR primers for standard, long distance, inverse, real-time PCR (LUX and self-reporting), multiplex PCR, group-specific PCR (common primers for given N target sequences), unique PCR (design of specific (unique) PCR primers for each sequence); single primer PCR (design of PCR primers from close located inverted repeat), automatically detecting SSR loci and direct PCR primer design, amino acid sequence degenerate PCR and more. The software consists of a data editor; build-in commands for probe/primers design and build automation tools.
The Microarray Explorer (MAExplorer) is a Java-based data-mining facility for microarray databases. It may be freely downloaded and run as a stand-alone application on your computer. The exploratory data analysis environment provides tools for the data-mining of quantitative expression profiles across multiple microarrays.
Gathering and managing data from diverse and heterogeneous datasets is central to many systems approaches to biology. Systems biologists need to easily identify and gather the information they require and then integrate and analysis the experimental and reference data sets which can come from myriad databases with a wide variety of formats and access methods.
Avogadro is a new, open source molecular editor designed for cross-platform use in computational chemistry, molecular modeling, bioinformatics, materials science, and related areas. It offers flexible rendering and a powerful plugin architecture.
Bioclipse is a free, open source, workbench for chemo- and bioinformatics with powerful editing and visualization capabilities for molecules, sequences, proteins, spectra etc.
Bioconductor is an open source and open development software project to provide tools for the analysis and comprehension of genomic data.