
簡介基因體 註解工具 SUNLab Molecular Genetics Bioinformatics
簡介基因體 註解工具

何謂基因體 疃也稱基因組 疃泛指特定生物體細胞核內的所有基 因組合 Human family Human cell Nuclear genome NLab Mitochondrial genomes 888 enetics Bioinformatics
何謂基因體 也稱基因組 泛指特定生物體細胞核內的所有基 因組合

人類基因體解讀計畫 九十年代初期正式成 uman 立 Genome 由美國衛生研究院及 能源部領軍,結合全 Project 美分子生物學界的資 源 企圖解讀整個人類基 因體序列並為預估的 十萬個基因描繪出完 整的基因地圖 SUNLab Molecular Genetics Bioinformatics
人類基因體解讀計畫 九十年代初期正式成 立 由美國衛生研究院及 能源部領軍,結合全 美分子生物學界的資 源 企圖解讀整個人類基 因體序列並為預估的 十萬個基因描繪出完 整的基因地圖

International Consortium Completes Human Genome Project http://www-genome.wi.mit.edu/media/2003/pr_03_humangenome.html genome center home for the media human genome project completed International Consortium Completes Human Genome Related Links Project International Human All Goals Achieved;New Vision for Genome Research Unveiled Genome Sequencing BETHESDA,Md.,Apri/14,2003--The International Human Genome Sequencing Consortium Consortium,led in the United States by the National Human Genome Research Institute Frequently Asked (NHGRD)and the Department of Energy (DOE),today announced the successful Questions completion of the Human Genome Project more than two years ahead of schedule. Human Genome Project Goals Also today,NHGRI unveiled its bold new vision for the future of genome research, Human Genome Project officially ushering in the era of the genome.The vision will be published in the April 24 Budget issue of the journal Nafure,coinciding with the 50th anniversary of Nafure's publication of What's Next? the landmark paper by Nobel laureates James Watson and Francis Crick that described Comparative Genomics DNA's double helix.Dr.Watson also was the first leader of the Human Genome Project. National Human Genome Research Institute The international effort to sequence the 3 billion DNA letters in the human genome is considered by many to be one of the most ambitious scientific undertakings of all time, even compared to splitting the atom or going to the moon
http://www-genome.wi.mit.edu/media/2003/pr_03_humangenome.html International Consortium Completes Human Genome Project

人類基因體計割的目的與應用 ◆完成人類基因體3*109鹼基之全 部定序工作 ◆發展新的生物科技 ◆生物資部 ◆建立實驗動物模式 ◆基因體定位以及基因鑑定 SUNLab Molecular Genetics Bioinformatics
人類基因體計劃的目的與應用 完成人類基因體 3*109 鹼基之全 部定序工作 發展新的生物科技 生物資訊 建立實驗動物模式 基因體定位以及基因鑑定

為何要做基因體的註解工作? (Genome annotation) 因為若無註解,基因體序列只是一群GATC 的組合,對一般的生物學家根本毫無幫助· 基因體序列被完全定序之後,生物學家非常 急切想要知道的就是,這由四個字母编排出 來的序列到底隱含了什麼樣的意義? 基因體註解→廣義地說,就是把所有在DNA 序列中有意義的資訊全都註解出來。 SUNLab Molecular Genetics Bioinformatics
為何要做基因體的註解工作? (Genome annotation) • 因為若無註解,基因體序列只是一群 GATC 的組合,對一般的生物學家根本毫無幫助。 • 基因體序列被完全定序之後,生物學家非常 急切想要知道的就是,這由四個字母編排出 來的序列到底隱含了什麼樣的意義? • 基因體註解➔廣義地說,就是把所有在DNA 序列中有意義的資訊全都註解出來

Example:SARS 基因體註解 SARS之RNA genome定序後’首要之工作即是 把序列上之基因位置及功能標示出來。這項工作稱為 基因體註解(genome annotation)。其中生物資訊中 的序列比對(sequence alignment)) 技術即可運用於此。 疃此項基因體註解工作,需仰賴資料庫中其他的冠狀 病毒(coronavirus)之基因功能註解,由序列的相似 性及區域來推斷SARS病毒中重要的結構蛋白 (structure protein),spike protein (S), membrame protein(M),small membrane protein (E) nucleocapsid protein’以及聚合脢等非結構蛋白 (NSPs)之基因位置。 SUNLab Molecular Genetics Bioinformatics
SARS 之 RNA genome 定序後,首要之工作即是 把序列上之基因位置及功能標示出來。這項工作稱為 基因體註解 (genome annotation)。其中生物資訊中 的序列比對 (sequence alignment) 技術即可運用於此。 此項基因體註解工作,需仰賴資料庫中其他的冠狀 病毒 (coronavirus) 之基因功能註解,由序列的相似 性及區域來推斷 SARS 病毒中重要的結構蛋白 (structure protein) ,如 spike protein (S), membrame protein(M), small membrane protein (E) nucleocapsid protein ,以及聚合脢等非結構蛋白 (NSPs) 之基因位置。 Example: SARS 基因體註解

Central Dogma of Molecular Biology genome cel DNA ATGGCATGTACTTGGTAG hromosomes ↓Transcription Genes contain instructions for making proteins RNA AUGGCAUGUACUUGGUAG ↓Translation Proteins act alone or in complexes to ProteinMetAlacysThrTrp* perform many cellular functions From Genes to Proteins http://www.doegenomes.org/ SUNLab Molecular Genetics Bioinformatics
Central Dogma of Molecular Biology ATGGCATGTACTTGGTAG MetAlaCysThrTrp* AUGGCAUGUACUUGGUAG DNA RNA Protein Transcription Translation http://www.doegenomes.org/

Structure of an idealized gene Transcription ATG,TGA,or TAG Start site Stop codon CCAAT AATAAA Box ATG Poly(A)signal Initiation codon Enhancer TATA Poly(A)tail Box GT AG GT AG EXON EXON EXON 5 39 Untranslated region Untranslated region Introns ↓Transcription SUNLab Molecular Genetics Bioinformatics
ATG Initiation codon ATG,TGA,or TAG Stop codon GT AG GT AG Transcription Start site 5’ Untranslated region 3’ Untranslated region Introns TATA Box AATAAA Poly(A) signal Poly(A) tail CCAAT Box Structure of an idealized gene Enhancer Transcription EXON EXON EXON

基因體註解 Promoter(啟動子):DNA region involved in and necessary for initiation of transcription,and including the RNA polymerase binding site,the startpoint of transcription and various other sites at which of transcription regulatory proteins may bind. Enhancer(增強子):a type of control site in DNA,present in the control region of many eukaryofic genes,and whose regulation by specific regulatory proteins dramatically increases the levele of transcription. SUNLab Molecular Genetics Bioinformatics
基因體註解 Promoter (啟動子): DNA region involved in and necessary for initiation of transcription, and including the RNA polymerase binding site, the startpoint of transcription and various other sites at which of transcription regulatory proteins may bind. Enhancer (增強子): a type of control site in DNA, present in the control region of many eukaryotic genes, and whose regulation by specific regulatory proteins dramatically increases the levele of transcription