SelenoDB
Release 2.0
Chart Advanced
Faq

What is the FASTA sequence format?

A sequence in FASTA format begins with a single-line description, followed by lines of sequence data. The description line is distinguished from the sequence data by a greater-than (">") symbol in the first column. It is recommended that all lines of text be shorter than 80 characters in length. Example of nucleotide and protein sequences in FASTA format are:

        >SPT00000014_1.0 # Selenoprotein H (SelH) # Homo sapiens
        ATGGCTCCCCGCGGGAGGAAGCGTAAGGCTGAGGCCGCGGTGGTCGCCGTAGCCGAGAAG
        CGAGAGAAGCTGGCGAACGGCGGGGAGGGAATGGAGGAGGCGACCGTTGTTATCGAGCAT
        TGCACTAGCTGACGCGTCTATGGGCGCAACGCCGCGGCCCTGAGCCAGGCGCTGCGCCTG
        GAGGCCCCAGAGCTTCCAGTAAAGGTGAACCCGACGAAGCCCCGGAGGGGCAGCTTCGAG
        GTGACGCTGCTGCGCCCGGACGGCAGCAGTGCGGAGCTCTGGACTGGGATTAAGAAGGGG
        CCCCCACGCAAACTCAAATTCCCTGAGCCTCAAGAGGTGGTGGAAGAGTTGAAGAAGTAC
        CTGTCGTAG
        SPP00000014_1.0 # Selenoprotein H (SelH) # Homo sapiens
        MAPRGRKRKAEAAVVAVAEKREKLANGGEGMEEATVVIEHCTSURVYGRNAAALSQALRL
        EAPELPVKVNPTKPRRGSFEVTLLRPDGSSAELWTGIKKGPPRKLKFPEPQEVVEELKKY
        LS%
        
Note the correspondece between the nucleotide and protein codes. In particular, between the in-frame TGA and U and the termination codon TAG and %. Learn more on the encoding of termination codons in SelenoDB here.
Chart Advanced