CRAIG Data Sets

Training Set

train_genes.masked.fa.gz <- Masked Sequences in Fasta format
train_genes.gtf <- Annotation in gtf format

Development Set

develop_genes.masked.fa.gz <- Masked Sequences in Fasta format
develop_genes.gtf <- Annotation in gtf format