序列数据的存储 ·核酸序列数据库 ·国际三大核酸序列数据库: GenBank,EBML,DDBJ RefSeq:The Reference Sequence Database ·dbEST:Expressed Sequences Tags数据库 ·UniGene等
序列数据的存储 核酸序列数据库 国际三大核酸序列数据库: GenBank, EBML, DDBJ RefSeq: The Reference Sequence Database dbEST: Expressed Sequences Tags数据库 UniGene等
核酸数据库数据的增长 Growth of the International Nucleotide Sequence Database Collaboration 90 80 0 40 Base Pairs in Billions 1 0000 0 10-06 70-a7 Base Pairs contributed by GenBark袋-EMBL-DDBJ-■
核酸数据库数据的增长
较早的基因组数据库-GDB ·为人类基因组计划(HGP)保存和处理基因组图谱数据。 GDB的目标是构建关于人类基因组的百科全书,除了构建 基因组图谱之外,还开发了描述序列水平的基因组内容的方 法,包括序列变异和其它对功能和表型的描述。 GenBank 由美国国立卫生研究院NH下属国立生物技术信息中心 NCBI建立。 汇集并注释了所有公开的核酸以及蛋白质序列。每个记录 代表了一个单独的、连续的、带有注释的DNA或RNA片段
较早的基因组数据库- GDB 为人类基因组计划(HGP)保存和处理基因组图谱数据。 GDB的目标是构建关于人类基因组的百科全书,除了构建 基因组图谱之外,还开发了描述序列水平的基因组内容的方 法,包括序列变异和其它对功能和表型的描述。 GenBank • 由美国国立卫生研究院NIH下属国立生物技术信息中心 NCBI建立。 • 汇集并注释了所有公开的核酸以及蛋白质序列。每个记录 代表了一个单独的、连续的、带有注释的DNA或RNA片段
GenBank中测序最多的20个物种 Entries Bases Species 11148092 12700084970 Homo sapiens (人) 7200432 8291244632 Mus musculus (小鼠) 1288005 5766221181 Rattus norvegicus (大鼠) 2026919 3808273911 Bos taurus (牛) 2841072 3564487204 Zea mays (玉米) 1559584 2764828000 Danio rerio (斑马鱼) 2058320 1863733664 Sus scrofa(猪) 1179148 1517127691 0 ryza sativa(水稻 227831 1352463179 Strongylocentrotus purpuratus (海胆)》 1417622 1135336003 Xenopus tropicalis (爪蟾 212172 943046501 Pan troglodytes(黑猩猩) 734569 897464279 Drosophila melanogaster(果蝇) 1949707 879897433 Arabidopsis thaliana(拟南芥) 802461 857512666 Gallusgallus(鸡) 497579 810324848 Vitis vinifera(葡萄) 75850 708598911 Macaca mulatta(恒河猴) 1220300 694494794 Canis lupus familiaris 1006209 657603350 Sorghum bicolor(高梁) 1102504 655077659 Triticum aestivum(小麦) 409757 520072874 Medicago truncatula(蒺藜状苜蓿) 2007
GenBank中测序最多的20个物种 161.0版,2007
EMBL核酸序列数据库 EMBL-EBI(European Bioinformatics Institute)维护; ● http://www.ebi.ac.uk/embl/ EMBL-EBI EB-eye All Databases Enter Text Here Go Reset⑦ Give us Search Advanced Search feedback Databases Tools EBI Groups Training Industry About Us Help Site Index EMBL-Bank Home EBI>Databases EMBL-Bank Access EMBL Nucleotide Sequence Database Documentation News The EMBL Nucleotide Sequence Database(also known as EMBL-Bank)constitutes Europe's primary nucleotide sequence resource.Main sources for DNA and RNA Submission sequences are direct submissions from individual researchers,genome sequencing EMBL Publications projects and patent applications. NUCLEOTIDE SEQUENCE People DATABASE The database is produced in an international collaboration with GenBank(USA)and the Contact DNA Database of Japan(DDBJ).Each of the three groups collects a portion of the total sequence data reported worldwide,and all new and updated database entries are exchanged between the groups on a daily EMBL Fetch basis.The current database release(Release 101.Sept 2009).with according Release notes and user manual are available from the EBI servers.A sample database entry is shown here. Fetch an EMBL record by id Go A publication in Nucleic Acids Research 2009 37:D19-D25.provides further information and details. The EMBL nucleotide sequence database forms part of the European Nucleotide Archive.an EBI project led by Hands-on Training Guy Cochrane as part of the The Protein and Nucleotide Database Group(PANDA)under Ewan Birney. 30th April-1st May 2009: Short Read Bioinformatics hands-on EBI training Link Explanation course...more Access Database queries,Completed genomes webserver,FTP archives (EMBL release,alignments etc),EMBL sequence version archive (SVA).Browse by geography. Collaborations Submission Primary sequence submissions,third party annotation,updates INSDC-International Documentation Release notes user manual,Information for Submitters,FAQ,Release information,Forthcoming Changes
EMBL核酸序列数据库 EMBL-EBI (European Bioinformatics Institute)维护; http://www.ebi.ac.uk/embl/