Skip to content

FASTA Files

Tony Boyles edited this page May 16, 2018 · 1 revision

A FASTA file represents one or more DNA sequences. It is a simple text format, composed of a greater than sign (>) followed by an identifier, followed by a newline, followed by a nucleotide string (actg). For example:

> ID_of_person_A
CCTCAGATCACTCTTTGGCAACGACCCCTCGTCACAATAAARATAGGRGGGCA
ACTAAAGGAAGCTCTACTAGATACAGGAGCAGATGATACAGTATTAGAAGAAC
TRAGTTTACCAGGAAGATGGAAACCAAAAATGATAGGGGGAATTGGAGGTTTT
ATCAAAGTAAGACAGTATGATCAGGTAKCCATAGAAATCTGTGGGCATAAAGC
TGTAGGTACAGTATTAGTAGGACCTACACCAGTCAACATAATTGG
> ID_of_person_B
CCTCAGATCACTCTTTGGCAACGACCCCTCGTCACAATAAARATAGGRGGGCA
ACTAAAGGAAGCTCTACTAGATACAGGAGCAGATGATACAGTATTAGAAGAAC
TRAGTTTACCAGGAAGATGGAAACCAAAAATGATAGGGGGAATTGGAGGTTTT
ATCAAAGTAAGACAGTATGATCAGGTAKCCATAGAAATCTGTGGGCATAAAGC
TGTAGGTACAGTATTAGTAGGACCTACACCAGTCAACATAATTGG

The file extension varies (.FASTA, .FAS, .FA). Any file that doesn't have a csv file extension will be parsed as a FASTA file.

Clone this wiki locally