Core/Pan genome calculation
One main feature of EDGAR is the fast calculation of the genomic subsets of the core and pan genome. The interfaces and results for this features are identical.
The core genome as well as the pan genome calculation require the selection of one reference genome in the left selection list and the selection of comparison genomes or sets in the right selection list. Results are presented in tabular form and the reference genome will appear in the first column of the result table.
The result table show one set of orthologous genes per line. For each gene the locus tag and a functional description is displayed. If no ortholog for a certain gene is found in other genomes a "-" will be shown in the table cell. At the end of each line there are links to generate multiple alignments of the genes within the line either based on DNA or amino acid sequence. Such alignments can be created also for the upstream region, the 400bp prior to the translation start site. The pan genome can be saved as multiple FASTA file (DNA or protein sequence) or as a TAB separated flat file of locus tags and functional descriptions. The locus tag of each gene is a link to the Comparative Viewer for the respective gene set.
The results can be downloaded as TAB separated or as FASTA files.