Parse EMBL: Parse an EMBL file to Darwin database format
Paste your EMBL file in the window below and ParseEMBL
will convert your sequences to Darwin SGML format for use in the Darwin Server.
Here is an example
This code is known to fail when the CDS tag refers to a separate EMBL file.
This code searches for the following tags:
codon_start
translation
CDS
for the reference tag, it uses "gene", "db_xref", and "protein_id" in that order
the descriptor is given by "product"
The code extracts the translation and the dna from the given CDS tag. The translation of the DNA
is compared to the protein sequence and any inconsistances are reported.
Please email any EMBL files that do not work to:
darwin.comments@inf.ethz.ch