Supported Input FormatsΒΆ

There are many different programs that utilize a gff or gtf style format. Each has its own inclusion rules and formatting guidelines. Therefore, a true universal gff/gff3/gtf file converter that requires no user input may be an impossible task. gFACs requires the user to define the application source and version. If you do not find you input format, look into the support script format_diagnosis.pl. If your file is made up of many different formats merged together, I suggest breaking it apart. If you want a format added (especially if it is a well-known one) let me know and I will likely create one!

The format is specified in the gFACs command by a -f [code] flag. It is a mandatory flag and the code will fail without it. These codes are listed out below with notes and can also be seen in the command line manual. For an example of the command with a proper format flag, see any of the sample runs!

BRAKER:
braker_2.05_gtf
braker_2.05_gff
braker_2.05_gff3
braker_2.0_gff3
braker_2.0_gff
braker_2.0_gtf

MAKER:
maker_2.31.9_gff

Note

Maker may provide other information such as blastx and protein2genome information. Currently, only maker models of genes and exons will be considered.

PROKKA:
prokka_1.11_gff

GMAP:
gmap_2017_03_17_gff3

GENOME THREADER:
genomethreader_1.6.6_gff3

GFFREAD:
gffread_0.9.12_gff3

EXONERATE:
exonerate_2.4.0_gff

EVIDENCE MODELER:
EVM_1.1.1_gff3

GFACS:
gFACs_gene_table
gFACs_gtf

Note

You can input a gene table from gFACs, any version. However, the prefix on the input will NOT be retained.

NCBI:
refseq_gff

Note

CDS only taken.

genbank_gbff

Note

UTR is removed. (CDS sequences only considered)

Warning

Gbff is undergoing repair! Results may not be accurate!

For those who are curious, each format has a special conversion script that transitions the input into the gene table. These are the scripts found in the format_scripts folder that comes along with gFACs. If you are feeling adventurous, you can make your own conversion script that creates the gene table and simply run gFACs with the gene table format code.

../_images/Format.JPG