Читать книгу Data Analytics in Bioinformatics - Группа авторов - Страница 67
3.2 Biological Datasets
ОглавлениеBioinformatics deals with various biological datasets being collected at different levels of omics data such as
Genomic Sequence data
Protein Sequence data
Microarray data
Structure data (Structure of RNA and protein)
Chemical data
Disease data.
Based on the type of data Biological database can be divided in to two categories:
a. Primary DatabaseThese kinds of databases are archival in nature because these databases are created by the experimental results submitted directly by researchers. These databases are populated with protein sequence, nucleotide sequence or macromolecular structure etc. [10].Example: Protein Data Bank (PDB), GenBank, DNA Data Bank of Japan (DDBJ), Gene Expression Omnibus (GEO).
b. Secondary DatabaseThese databases are either manually created or extracted from result analysis of primary database to create more structured records for easy retrieval of data [10]. Example: Swiss-port (it is protein sequence database maintained by Swiss Institute of Bioinformatics, Switzerland and the European Bioinformatics Institute, UniProt Knowledgebase.