Faq |
What is the FASTA sequence format? A sequence in FASTA format begins with a single-line description, followed by lines of sequence data. The description line is distinguished from the sequence data by a greater-than (">") symbol in the first column. It is recommended that all lines of text be shorter than 80 characters in length. Example of nucleotide and protein sequences in FASTA format are: >SPT00000014_1.0 # Selenoprotein H (SelH) # Homo sapiens ATGGCTCCCCGCGGGAGGAAGCGTAAGGCTGAGGCCGCGGTGGTCGCCGTAGCCGAGAAG CGAGAGAAGCTGGCGAACGGCGGGGAGGGAATGGAGGAGGCGACCGTTGTTATCGAGCAT TGCACTAGCTGACGCGTCTATGGGCGCAACGCCGCGGCCCTGAGCCAGGCGCTGCGCCTG GAGGCCCCAGAGCTTCCAGTAAAGGTGAACCCGACGAAGCCCCGGAGGGGCAGCTTCGAG GTGACGCTGCTGCGCCCGGACGGCAGCAGTGCGGAGCTCTGGACTGGGATTAAGAAGGGG CCCCCACGCAAACTCAAATTCCCTGAGCCTCAAGAGGTGGTGGAAGAGTTGAAGAAGTAC CTGTCGTAG SPP00000014_1.0 # Selenoprotein H (SelH) # Homo sapiens MAPRGRKRKAEAAVVAVAEKREKLANGGEGMEEATVVIEHCTSURVYGRNAAALSQALRL EAPELPVKVNPTKPRRGSFEVTLLRPDGSSAELWTGIKKGPPRKLKFPEPQEVVEELKKY LS%Note the correspondece between the nucleotide and protein codes. In particular, between the in-frame TGA and U and the termination codon TAG and %. Learn more on the encoding of termination codons in SelenoDB here. |
---|
© 2007-2025 SelenoDB
- A public source of selenium -