当前位置：和泉文库 > 生物 > 浏览文档

麻省理工大学：《Foundations of Biology》课程教学资源（英文版）Lecture 2 The Language of genomics

The Language of genomics CDNAS, ESTS. BACS Alus. etc Dideoxy Method Shotgun Sequencing The 'shotgun coverage equation(Poisson) Flavors of blast BLASTIPNXJ, TBLASTINXI Statistics of High Scoring Segments

文件格式：PDF，文件大小：716.04KB，售价：10.32元

共36页，可试读12页，点击往前阅读 ↑↑

文档详细内容（约36页）

DNA Sequence Comparison Alignment Target frequencies and mismatch penalties Eukaryotic gene structure Comparative genomics applications Pipmaker(2 species comparison) Phylogenetic Shadowing(many species) Intro to DNA sequence motifs See Ch. 7 of Mount

DNA Sequence Comparison & Alignment • Target frequencies and mismatch penalties • Eukaryotic gene structure • Comparative genomics applications: • See Ch. 7 of Mount - Pipmaker (2 species comparison) - Phylogenetic Shadowing (many species) Intro to DNA sequenc e motifs

DNA Sequence Alignment V How is n related to the score matrix? n is the unique positive solution to the equation pip: e= p frequency of nt i, Si= score for aligning an i,j pair What kind of an equation is this? What would happen to n if we doubled all the scores? What does this tell us about the nature of n? Karlin Altschul 1990

i DNA Sequence Alignment V How is λ related to the score matrix? λ is the unique positive solution to the equation*: ∑ p pjeλsij = 1 i i,j p = frequency of nt i, sij = score for aligning an i,j pair What kind of an equation is this? What would happen to λ  if we doubled all the scores? What does this tell us about the nature of λ? *Karlin & Altschul, 1990

DNA Sequence Alignment VI What scoring matrix to use for dNA? Usually use simple match-mismatch matrices Gmm CGT mmm mmm m="mismatch penalty(must be negative

DNA Sequence Alignment VI What scoring matrix to use for DNA? Usually use simple match-mismatch matrices: i j: A C G T A 1 m m m C m 1 m m si,j : G T m m m m 1 m m 1 m = “mismatch penalty” (must be negative)

DNA Sequence alignment Vll How to choose the mismatch penalty? Use theory of High Scoring Segment composition High scoring alignments will have composition qi= pp ei where q = frequency of i j pairs(target frequencies") pp - req of i, j bases in sequences being compared What would happen to the target frequencies if we doubled all of the scores? *Karlin Altschul. 1990

DNA Sequence Alignment VII How to choose the mismatch penalty? Use theory of High Scoring Segment composition* High scoring alignments will have composition: qij = pi pj e λ sij where qij = frequency of i,j pairs (“target frequencies”) p , p = freq of i, j bases in sequences being compared i j What would happen to the target frequencies if we doubled all of the scores? *Karlin & Altschul, 1990

DNA Sequence alignment Vlll Still figuring out how to choose the mismatch penalty m Target frequencies: qi=p,p e/ij =In(q;/p:p )/A If you want to find regions with R% identities r=R/100q=r4q=(1n)12() Set s=1 Then m=Si=S/Si=In(q /p pi ))/(In(qi/pip1)/ (]) →m=n4(1)/3)n(4

DNA Sequence Alignment VIII Still figuring out how to choose the mismatch penalty m Target frequencies: qij = pi pj e λ sij sij = l n ( qij / pi pj ⇒ )/ If you want to find regions with R% identities: r = R /100 qii = r/4 qij = (1-r)/12 (i,j) Set sii = 1 Then m = sij = sij/sii = ln(qij / pi pj )/ λ) / (ln(qii / pi pi )/ λ (i ≠j) ⇒ m = ln(4(1-r)/3)/ln(4r) λ

点击进入文档下载页（PDF格式）

共36页，可试读12页，点击继续阅读 ↓↓

您可能感兴趣的文档

麻省理工大学：《Foundations of Biology》课程教学资源（英文版）Lecture 5 Molecular Phylogenetics
麻省理工大学：《Foundations of Biology》课程教学资源（英文版）Lecture 4 Database Searching
麻省理工大学：《Foundations of Biology》课程教学资源（英文版）Lecture 2 More Pairwise Sequence Comparisons
麻省理工大学：《Foundations of Biology》课程教学资源（英文版）Lecture 3 More Multiple Sequence Alignment
麻省理工大学：《Foundations of Biology》课程教学资源（英文版）Lecture 1 Michael Yaffe Introduction to Bioinformatics
《微生物遗传学》第四章基因工程技术在改进微生物
《分子生物学》课程教学资源（练习题）试题详解（含参考答案）
南京军区南京总医院：《组织芯片应用的现状与前景》讲义
《酶学》课程教学资源（讲义）第四章酶的结构和功能
《酶学》课程教学资源（讲义）第十一章酶在医学方面的应用
《酶学》课程教学资源（讲义）第六章多种因素对酶反应速度的影响
《酶学》课程教学资源（讲义）第八章酶的别构效应
麻省理工大学：《Foundations of Biology》课程教学资源（英文版）Lecture 1 Genome Sequencing
麻省理工大学：《Foundations of Biology》课程教学资源（英文版）Lecture 3 Review of DNA Seq
麻省理工大学：《Foundations of Biology》课程教学资源（英文版）Lecture 6 Predicting rna Secondary structure
麻省理工大学：《Foundations of Biology》课程教学资源（英文版）Lecture 4 Organization of topics
麻省理工大学：《Foundations of Biology》课程教学资源（英文版）Lecture 6 Structure Prediction
麻省理工大学：《Foundations of Biology》课程教学资源（英文版）Lecture 5 Markov models
麻省理工大学：《Foundations of Biology》课程教学资源（英文版）Lecture 5 Review -Homology Modeling
麻省理工大学：《Foundations of Biology》课程教学资源（英文版）Lecture 1 Review of protein structure hierarchy
麻省理工大学：《Foundations of Biology》课程教学资源（英文版）Lecture 1 How are X-ray crystal structures
麻省理工大学：《Foundations of Biology》课程教学资源（英文版）Lecture 3 For a molecular simulation or model
麻省理工大学：《Foundations of Biology》课程教学资源（英文版）Lecture 2 Comparing protein Structures
麻省理工大学：《Foundations of Biology》课程教学资源（英文版）Lecture 7 The protein interactome

点击购买下载（PDF）

下载及服务说明

购买前请先查看本文档预览页，确认内容后再进行支付；
如遇文件无法下载、无法访问或其它任何问题，可发送电子邮件反馈，核实后将进行文件补发或退款等其它相关操作；
邮箱：

文档浏览记录