Click on each file name to download it.

File description: Header line is split into several tab delimited fields. The first is an indication of the contig of origin. The second and third are the start and stop locations of the conserved non-coding sequence within this contig, where the coordinates can be determined by adding these values to the first number listed in the header, after the chr## information (e.g. 10001142 in the header chr2L-10001142-10196655.alignment.avid.out). The fourth field reports the number of conserved bases, and the fifth the number of CNSSes observed within the CNS. The CNSSes themselves are presented in code in the following lines that begin with [#,#]. Here, the number in the first bracket is a base 4 representation of the first five nucleotides of the CNSS and the second number is a base 4 representation of the last nucleotide of the CNSS. The nucleotides themselves have the following values: A=0;C=1;G=2;T=3. So, [0,1] converts to aaaaac. Similarly, [16,2] converts to aacaag. The phylogenetic footprint files MUST be in this format to be processed correctly by PF-Sampler and PF-Searcher.