GENCODE#

class openomics.database.sequence.GENCODE(path='ftp://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_32/', file_resources=None, col_rename=None, blocksize=0, remove_version_num=False)[source][source]#

Bases: openomics.database.sequence.SequenceDatabase

Loads the GENCODE database from https://www.gencodegenes.org/ .

Default path: ftp://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_32/ . Default file_resources: {

“basic.annotation.gtf”: “gencode.v32.basic.annotation.gtf.gz”, “long_noncoding_RNAs.gtf”: “gencode.v32.long_noncoding_RNAs.gtf.gz”, “lncRNA_transcripts.fa”: “gencode.v32.lncRNA_transcripts.fa.gz”, “transcripts.fa”: “gencode.v32.transcripts.fa.gz”,

}

Methods Summary

get_rename_dict([from_index, to_index])

param from_index

get_sequences(index, omic[, agg, biotypes])

param index

load_dataframe(file_resources[, blocksize])

param file_resources

Methods Documentation

get_rename_dict(from_index='gene_id', to_index='gene_name')[source][source]#
Parameters
  • from_index

  • to_index

get_sequences(index, omic, agg='all', biotypes=None)[source][source]#
Parameters
  • index (str) –

  • omic (str) –

  • agg (str) –

  • biotypes (List[str]) –

load_dataframe(file_resources, blocksize=None)[source][source]#
Parameters
  • file_resources

  • blocksize