|
Web News and Updates: |
September 25, 2006 The September 25th updates and additions include: - RefSeq gene annotations updated - added 28,112 new EST sequences - 2,561 new mRNA sequences added - 380 new clinical cases of defined phenotype added to database - 1799 new microsatellites added from "Tamiya et al, 2005".
The February 17th updates and additions include: - new SNPs from dbSNP added - RefSeq Release 15 updated - Ensembl gene annotations updated - updated EST data - new spliced EST track available - 137 new clinical cases of defined phenotype added to database
The October 30th updates and additions include: -76,630 new SNPs from dbSNP added -202,525 SNPs from the HapMap Consortium added with current Phase I genotype data -74 Human-Chimp inversions added to structural features track -2 polymorphic human inversions added to structural features track -Large ancient duplication "LAD7" added -COL1A2 repressor added to structural features track -65 clinical cases of defined phenotype added to database Human-Chimpanzee Inversions Publication: Feuk L, MacDonald JR, Tang T, Carson AR, Li M, et al. (2005) Discovery of Human Inversion Polymorphisms by Comparative Analysis of Human and Chimpanzee DNA Sequence Assemblies. PLoS Genet 1(4): e56 Abstract: Chimpanzee is the closest relative to humans having DNA sequences about 98% identical to each other. Small DNA sequence changes and probably more importantly, larger structural changes of chromosomes, led to the divergence of the two species some 6 million years ago. Until recently, there were ten structural differences visible under the microscope between chimpanzee and human, and nine of these were inversions of DNA. Through computational comparisons of genome sequences, the current study identifies another 1,576 putative inversion events. Thirty-three of these were larger than 100,000 base pairs in size and 29 intersect genes, prioritizing them for evolutionary studies. Twenty-three of the inversions have been confirmed experimentally with the largest being 4.3 million base pairs in size on human Chromosome 7. Surprisingly, three of the "inverted" regions were found to be variable in their orientation in the human population (in some cases the inversion was in the ancestral orientation found in chimpanzee). These observations indicate the human genome is still evolving in structure. Moreover, since such variable inversions have been shown to predispose to other (sometimes deleterious) changes in chromosomes, the new data delineate potential disease-associated genes.
HapMap Phase I SNP/Genotype Data Publication: A haplotype map of the human genome, International HapMap Consortium Nature 437, 1299-1320 (27 October 2005) Abstract: Inherited genetic variation has a critical but as yet largely uncharacterized role in human disease. Here we report a public database of common variation in the human genome: more than one million single nucleotide polymorphisms (SNPs) for which accurate and complete genotypes have been obtained in 269 DNA samples from four populations, including ten 500-kilobase regions in which essentially all information about common DNA variation has been extracted. These data document the generality of recombination hotspots, a block-like structure of linkage disequilibrium and low haplotype diversity, leading to substantial correlations of SNPs with many of their neighbours. We show how the HapMap resource can guide the design and analysis of genetic association studies, shed light on structural variation and recombination, and identify loci that may have been subject to natural selection during human evolution.
Large Ancient Duplications Publication: Itoh T, Toyoda A, Taylor TD, Sakaki Y, Hattori M. Identification of large ancient duplications associated with human gene deserts. Nat Genet. 2005 Oct;37(10):1041-3. Epub 2005 Sep 25. PMID: 16186813 Abstract: The authors identified 15 regions of >1 Mb in the human genome composed of large ancient local duplications corresponding to gene deserts. We detected these intrachromosomal duplications in mouse and dog but not in chicken; they present as patches of similarity as low as 60%. These findings suggest that some human gene deserts originated from duplications of segments lacking genes in a mammalian common ancestor
COL1A2 Repressor Publication: Antoniv TT, Tanaka S, Sudan B, De Val S, Liu K, Wang L, Wells DJ, Bou-Gharios G, Ramirez F. Identification of a Repressor in the First Intron of the Human {alpha}2(I) Collagen Gene (COL1A2). J Biol Chem. 2005 Oct 21;280(42):35417-23. Epub 2005 Aug 8. PMID: 16091368 Abstract: The human and mouse genes that code for the 2 chain of collagen I (COL1A2 and Col1a2, respectively) share a common chromatin structure and nearly identical proximal promoter and far upstream enhancer sequences. Despite these homologies, speciesspecific differences have been reported regarding the function of individual cis-acting elements, such as the first intron sequence. In the present study, we have investigated the transcriptional contribution of the unique open chromatin site in the first intron of COL1A2 using a transgenic mouse model. DNase I footprinting identified a cluster of three distinct areas of nuclease protection (FI1-3) that span from nucleotides 647 to 760, relative to the transcription start site, and which contain consensus sequences for GATA and interferon regulatory factor (IRF) transcription factors. Gel mobility shift and chromatin immunoprecipitation assays corroborated this last finding by documenting binding of GATA-4 and IRF-1 and IRF-2 to the first intron sequence. Moreover, a short sequence encompassing the three footprints was found to inhibit expression of transgenic constructs containing the COL1A2 proximal promoter and far upstream enhancer in a position-independent manner. Mutations inserted into each of the footprints restored transgenic expression to different extents. These results therefore indicated that the unique open chromatin site of COL1A2 corresponds to a repressor, the activity of which seems to bemediated by the concerted action ofGATAand IRF proteins. More generally, the study reiterated the existence of species-specific difference in the regulatory networks of the mammalian2(I) collagen coding genes.
The September 18 updates and additions include: -422 new human mRNA sequences added -260 new clinical cases of defined phenotype -RefSeq annotations updated -Ensembl annotaions updated -new BAC end data added
The June 15 updates and additions include: -viral integration sites added -243 new human mRNA sequences added -60 new clinical cases of defined phenotype -RefSeq annotations updated -Ensembl annotaions updated -new BAC end data added
Retroviral DNA integration sites Publication: Mitchell RS, Beitzel BF, Schroder AR, Shinn P, Chen H, Berry CC, Ecker JR, Bushman FD. Retroviral DNA integration: ASLV, HIV, and MLV show distinct target site preferences. PLoS Biol. 2004 Aug;2(8):E234. Epub 2004 Aug 17. PMID: 15314653 Abstract: The completion of the human genome sequence has made possible genome-wide studies of retroviral DNA integration. Here we report an analysis of 3,127 integration site sequences from human cells. We compared retroviral vectors derived from human immunodeficiency virus (HIV), avian sarcoma-leukosis virus (ASLV), and murine leukemia virus (MLV). Effects of gene activity on integration targeting were assessed by transcriptional profiling of infected cells. Integration by HIV vectors, analyzed in two primary cell types and several cell lines, strongly favored active genes. An analysis of the effects of tissue-specific transcription showed that it resulted in tissue-specific integration targeting by HIV, though the effect was quantitatively modest. Chromosomal regions rich in expressed genes were favored for HIV integration, but these regions were found to be interleaved with unfavorable regions at CpG islands. MLV vectors showed a strong bias in favor of integration near transcription start sites, as reported previously. ASLV vectors showed only a weak preference for active genes and no preference for transcription start regions. Thus, each of the three retroviruses studied showed unique integration site preferences, suggesting that virus-specific binding of integration complexes to chromatin features likely guides site selection.
The March 14 updates and additions include: -New TIGR Human Gene Index updated -HAPMAP genotype data updated -New human mRNAs added -576 new clinical cases of defined phenotype -New FISH data added -New BAC end data added -Updated HUGO gene nomenclature
The November 30th updates and additions include: -Putative gene enhancers added -RefSeq and Ensembl gene annotations updated -30,809 new non-human mRNAs added -New dog, mouse, fugu and rat comparative DNA sequence analysis -302 new clinical cases of defined phenotype -New FISH data added -New BAC end data added -Updated gene desert analysis
Enhancer Browser A chromosome 7 specific dataset was extracted from the whole genome enhancer browser and the information has been posted in the structural features track in the chromosome 7 genome browser. For more information on this dataset, please see the summary below or visit the Enhancer Browser project website at http://hazelton.lbl.gov/~sminovit/enhancers/frnt_page.html. A significant challenge exists in identifying and characterizing the complete set of gene enhancers present in the human genome. To date, few distant gene regulatory sequences have been identified despite the wide variety of temporal and spatial patterns exhibited by most genes. Nonetheless, small datasets have suggested that a significant percentage of human-fugu conserved noncoding sequences located a distance from genes have enhancer activity in transgenic mouse assays. To perform large-scale assessment of this preliminary finding, we prioritized human noncoding DNA for in vivo studies through whole human-fugu genome comparisons and identified ~2,500 highly conserved noncoding sequences. In an initial pilot, we have begun to systematically characterize all ~100 highly conserved human-fugu noncoding elements on human chromosome 16 using a high throughput mouse transgenic reporter assay. Indeed, to date, a high proportion of elements tested (48/60) do act as gene enhancers in vivo and ongoing efforts are focused on characterizing all human-fugu noncoding sequences. As a community resource, we have established a database to visualize these expression patterns and the ocalization of the larger set of genome-wide human-fugu conserved elements. These studies support the power of human-fish comparisons to identify a core set of vertebrate gene regulatory elements scattered throughout the human genome and provide the experimental substrate to better understand the vocabulary that governs gene expression.
The October 18th updates and additions include: -2270 new full length mRNA sequences added -Marshfield insertion/deletion polymorphism data added -300 new clinical cases of defined phenotype -Updated HUGO gene nomenclature -New FISH data added -New BAC end data added -Updated structural features
Marshfield Human Insertion/Deletion Polymorphisms Human insertion/deletion polymorphism data was obtained from the Marshfield Centre for Medical Genetics. Information and documentation for the data is included below.
The best current estimate is that 20% of all human polymorphisms are of the insertion/deletion (indel) type. Indels can be broken down into a roughly 50:50 mix of multiallelic and diallelic polymorphisms. Multiallelic indels include the minisatellites (also called VNTRs) and the short tandem repeat polymorphisms (STRPs) (also called microsatellites or simple sequence length polymorphisms (SSLPs)). Minisatellites are relatively rare and typically have repeat lengths of a few tens of nucleotides with tandem repeat copy numbers in the hundreds to thousands. STRPs are abundant and have repeat lengths of 1-6 nucleotides with tandem repeat copy numbers mostly < 30. Diallelic indels are also common, but are only just now beginning to be studied in detail (see below). All of the diallelic indels and most of the STRPs have the nice feature of being able to be analyzed simply by PCR followed by gel electrophoresis. We believe indels are an attractive alternative to SNPs.
They have confirmed and characterized well over 2,000 human diallelic short insertion/deletion (indel) polymorphisms (Weber, et al., AJHG 71: 2002). Supporting data for the Weber et al. 2002 paper are located in the following two spreadsheets: MIDdata and MIDAlleleFreqs. The MIDAlleleFreqs table contains allele frequencies in five populations for 2000 of the Marshfield Insertion/Deletion Polymorphisms. The five populations are Europeans, Japanese, African Pygmies, Native Americans from the Amazon, and the Polymorphism Discovery Resource (PDR). Also included are average values for all populations. The MIDdata table contains in addition to the allele frequency information, other information about the markers such as chromosomal location, inserted sequence, chimp/gorilla results, and data mining sources.
The August 18th updates and additions include: -331 new full length mRNA sequences added -ENCODE (encylopedia of DNA elements) regions added -376 new clinical cases of defined phenotype -Updated HUGO gene nomenclature -New FISH data added -New BAC end data added -Variation data added (large scale genomic, gene expression, and hominoid lineage-specific gene copy number variations) ENCODE project overview In April 2003, the sequence of the human genome was completed. Although this is a significant achievement, much remains to be done. Before the best use of the information contained in the sequence can be made, the identity and precise location of all of the protein-encoding and non-protein-encoding genes will have to be determined. The identity of other functional elements encoded in the DNA sequence, such as promoters and other transcriptional regulatory sequences, along with determinants of chromosome structure and function, such as origins of replication, also remain largely unknown. A comprehensive encyclopedia of all of these features is needed to fully utilize the sequence to better understand human biology, to predict potential disease risks, and to stimulate the development of new therapies to prevent and treat these diseases.
To encourage discussion and comparison of existing computational and experimental approaches, and to stimulate the development of new ones, the NHGRI proposed to create a highly interactive public research consortium to carry out a pilot project for testing and comparing existing and new methods to identify functional sequences in DNA. Working together in a highly cooperative effort to rigorously analyze a defined portion of the human genome sequence, investigators with diverse backgrounds and expertise will be able to evaluate the relative merits of each of a diverse set of techniques, technologies and strategies in identifying all the functional elements in human genomic sequence, to identify gaps in our ability to annotate genomic sequence, and to consider the abilities of such methods to be scaled up for an effort to analyze the entire human genome.
On July 23-24, 2002, the NHGRI organized a workshop, the Comprehensive Extraction of Biological Information from Genomic Sequence, to discuss this proposal. The workshop participants resoundingly supported the concept of a pilot project and made a number of recommendations about the project's goals, organization and implementation, which have now been incorporated into NHGRI's plan. The ultimate goal of this project is to improve access to information, resources, ideas, expertise, and technology beyond the scope of any single group, and to affect the entire community of researchers interested in mining genomic sequence. The hoped-for outcome will be a clear path to determining all of the functional elements in the entire human genome sequence and integrating the information in a manner that will guide future basic and clinical research. For more information, visit the ENCODE project website.
Variation data overview Three new types of DNA variations have been added to the database.
Large-scale copy-number variations (LCVs) involve gains or losses of several kilobases to hundreds of kilobases of genomic DNA among phenotypically normal individuals. These LCVs were investigated using array CGH for 55 unrelated individuals. A total of 255 loci across the human genome that contained genomic imbalances were identified. Twenty four variants were present in greater than 10% of the individuals examined. Half of these regions overlap with genes, and many coincide with segmental duplications or gaps in the human genome assembly. This previously unappreciated heterogeneity may underlie certain human phenotypic variation and susceptibility to disease and argues for a more dynamic human genome structure. For additional information, visit the Database of Genomic Variants.
Reference: Iafrate AJ, Feuk L, Rivera MN, Listewnik ML, Donahoe PK, Qi Y, Scherer SW, Lee C. Detection of large-scale variation in the human genome. Nat Genet. 2004 Aug 1
A genetic analysis of genome-wide variation in human gene expression identified natural variation in gene expression and confirmed a heritable component to the baseline expression level of many genes. Microarrays were used to measure the baseline expression levels of genes in immortalized B cells from members of Centre d'Etude du Polyporphisme Humain (CEPH) Utah pedigrees. For each of the ~8,500 genes on the array, they estimated the variance of expression level among unrelated individuals (94 CEPH grandparents) and the mean of variance of array replicates. The analysishighlighted those genes which had a greater expression variation between individuals than between replicates.
Reference: Morley M, Molony CM, Weber TM, Devlin JL, Ewens KG, Spielman RS, Cheung VG. Genetic analysis of genome-wide variation in human gene expression. Nature. 2004 Aug 12;430(7001):743-7. Epub 2004 Jul 21.
The detection of lineage-specific gene duplication and loss in human and great ape evolution, identified thosegenes which have undergone lineage-specific duplications or contractions among several hominoid lineages. Interspecies cDNA array-based comparative genomic hybridization was used to individually compare copy number variation for 39,711 cDNAs, representing 29,619 human genes, across five hominoid species, including human. A total of 1,005 genes, either as isolated genes or in clusters positionally biased toward rearrangement-prone genomic regions, that produced relative hybridization signals unique to one or more of the hominoid lineages.
Reference: Fortna A, Kim Y, MacLaren E, Marshall K, Hahn G, Meltesen L, Brenton M, Hink R, Burgers S, Hernandez-Boussard T, Karimpour-Fard A, Glueck D, McGavran L, Berry R, Pollack J, Sikela JM. Lineage-specific gene duplication and loss in human and great ape evolution. PLoS Biol. 2004 Jul;2(7):E207. Epub 2004 Jul 13.
The June 30th updates and additions include: -New TCAG gene annotation release 4 -3,389 UniGene clusters added -25,149 HapMap project SNP's added with genotype data -95 new clinical cases of defined phenotype -Updated HUGO gene nomenclature TCAG Annotations TCAG annotations are manually curated entries using all available data sources to obtain the most complete and accurate description of all chromosome 7 transcripts. This includes incorporating all data from GenBank (EST, mRNA), RefSeq, Ensembl, LocusLink, UCSC, TIGR Human Gene Index, HORDE (Human Olfactory Receptor Genes), Cytochrome P450 DB, ncRNA Databases, DBTSS (Database of transcription Start Sites), Celera Genomics, TCAG, RIKEN DIGIT gene predictions, Pseudogene databases, (www.pseudogene.org), H-Invitational database and all relevant publications. Additional information and links to gene expression data, OMIM, GO Annotations etc provide a complete centralized resource for the research community. The TCAG Annotation Release 4 includes an additional 153 gene structures, including new known, novel, putative and predicted gene models as well as a number of new pseudogenes and non-coding RNA's (ncRNA).
UniGene UniGene is an experimental system for automatically partitioning GenBank sequences into a non-redundant set of gene-oriented clusters. Each UniGene cluster contains sequences that represent a unique gene, as well as related information such as the tissue types in which the gene has been expressed and map location.
HapMap Data The goal of the International HapMap Project is to develop a haplotype map of the human genome, the HapMap, which will describe the common patterns of human DNA sequence variation. The HapMap is expected to be a key resource for researchers to use to find genes affecting health, disease, and responses to drugs and environmental factors. The information produced by the Project will be made freely available.
The Project is a collaboration among scientists in Japan, the U.K., Canada, China, Nigeria, and the U.S. [see participating groups]. The Project officially started with a meeting on October 27-29, 2002 and is expected to take about three years.
Genetic variation and use of the HapMap Most common diseases, such as diabetes, cancer, stroke, heart disease, depression, and asthma, are affected by many genes and environmental factors. Although any two unrelated people are the same at about 99.9% of their DNA sequences, the remaining 0.1% is important because it contains the genetic variants that influence how people differ in their risk of disease or their response to drugs. Discovering the DNA sequence variants that contribute to common disease risk offers one of the best opportunities for understanding the complex causes of disease in humans.
Currently only genotypes for one population (90 individuals from 30 CEPH family trios) are available, but samples from two other populations (Africa & Asia) will also be genotyped in the project. See project website http://www.hapmap.org for further information on samples and overall scientific goals.
The May 7th updates and additions include.... -2075 H-Invitational cDNA's added with links to the H-INV db -247 pseudogenes from www.pseudogene.org -17856 non-human mRNA aligned to Chr7 -31 chromosome 7 ultra-conserved elements added to structural features. -400 new clinical cases of defined phenotype H-Invitational cDNA H-Invitational Database (H-InvDB) is a human gene database, with integrative annotation of 41,118 full-length cDNA clones currently available from six high throughput cDNA sequencing projects. This database represents 21,037 cDNA clusters describing their gene structures, functions, novel alternative splicing isoforms, non-coding functional RNAs, functional domains, sub-cellular localizations, mapping of SNPs and microsatellite repeat motifs in relation with orphan diseases, gene expression profiling, and comparative results with mouse full-length cDNAs in the context of molecular evolution.
For more information, visit the H-invitational website at http://www.jbirc.aist.go.jp/hinv/index.jsp and see the relevant publication listed below.
Reference: Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA Clones. PLoS Biol. 2004 Apr 20 [Epub ahead of print] PMID: 15103394 [PubMed - as supplied by publisher]
Pseudogene (Gerstein Lab) Pseudogenes are sequences of genomic DNA with such similarity to normal genes that they are regarded as non-functional copies or close relatives of genes. This data track was obtained from the Gerstein Lab at Yale Univeristy. For more information visit the Gerstein lab website at http://www.pseudogene.org and see the relevant publication listed below.
Reference: Zhang Z, Harrison PM, Liu Y, Gerstein M. Millions of years of evolution preserved: a comprehensive catalog of the processed pseudogenes in the human genome. Genome Res. 13:2541-58 (2003).
Non-human mRNA Non-human vertebrate and invertebrate mRNAs were taken from GenBank, and then mapped to chromosome 7 using translated BLAT. 50% identity and 30% coverage were used as cutoff.
Ultra-conserved elements in the human genome There are 481 segments longer than 200 bp that are absolutely conserved (100% identity with no insertions or deletions) between orthologous regions of the human, rat and mouse genomes. Nearly all of these segments are also conserved in the chicken and dog genomes, with an average of 95% and 99% identity, respectively. Many are also significantly conserved in fish. These ultraconserved elements of the human genome are most often located either overlapping exons in genes involved in RNA processing or in introns or nearby genes involved in regulation of transcription and development. Along with more than 5,000 sequences of over 100bp that are absolutely conserved among the three sequenced mammals, these represent a class of genetic elements whose functions and evolutionary origins are yet to be determined, but which are more highly conserved between these species than proteins, and appear to be essential for the ontogeny of mammals and other vertebrates.
Reference: Ultraconserved elements in the human genome Gill Bejerano, Michael Pheasant, Igor Makunin, Stuart Stephen, W. James Kent, John S. Mattick, and David Haussler Published online May 6 2004; 10.1126/science.1098119 (Science Express Reports)
A second version DNA sequence assembly of human chromosome 7 (named CRA_TCAGchr7v2) has been generated and is publicly available. The assembly, which encompasses 158,329,839 nucleotides (nt) of DNA, represents the most complete description of this chromosome available (download). It was generated utilizing Celera whole genome shotgun information, cloned-based sequence from Genbank, and targeted sequence from The Centre for Applied Genomics (TCAG). In comparison to the first version release (Science, 2003) an additional 376,050 nt is present and 23 additional gaps have been filled. The CRA_TCAGchr7v2 assembly, amongst other variations, contains 704,297 nt of verified sequence not found in NCBI's Build 34 of chromosome 7.
To fulfill the objective of having every biological and medically relevant feature annotated along the sequence assembly, all annotation records have been updated and new ones added. For example, the new assembly displays information from the most current RefSeq, Ensembl, Celera published, UCSC, TIGR Human Gene Index, dbEST and mRNA datasets. Unique TCAG annotations have also been added including a new 'experimentally-confirmed' gene track, all known structural features, as well as 619 new clinical cases of defined phenotype. A new proprietary segmental duplication browser has also been developed for the chromosome 7 browser.
The February 5th updates and additions include..... -78 new clinical cases added -Affymetrix 10K Xba 131 SNP data track added -Predicted microRNA targets added (Lewis BP, Shih IH, Jones-Rhoades MW, Bartel DP, Burge CB. Cell. 115(7): 787-98.) -Gene expression data from GNF gene expression atlas added for 381 genes -New SNP data added -Updated HUGO gene nomenclature
The December 19th updates and additions include..... -TCAG gene annotations updated -FOSMID data track added -721 new mRNA sequences to database -110 new clinical cases -Spectral genomics clones added -Updated HUGO gene nomenclature -Kazusa Institute gene expression (FLJ, KIAA) information added in gene summary page -Affymetrix gene chip information added in gene summary page -Chimpanzee genomic clones added to database
The November 6th updates and additions include..... -Fugu comparative sequence conservation -RIKEN mouse imprinted gene data added (PubMed 12819139) -140 new clinical cases -New gene predictions added -New segmental duplication analysis -Updated HUGO gene nomenclature -ABI Gene Expression Assay on Demand information added in gene summary page -Bay Genomics GeneTrap ES cell line information added in gene summary page -New STS markers added
TCAG chromosome 7 sequence and annotations have been made available at NCBI, and can be viewed using Map Viewer.
The September 18th update includes mouse, rat and tetraodon comparative sequence data, updated RefSeq and Ensembl annotations. Promoter data from the Eukaryotic Promoter Database has been added along with new EST and mRNA sequences. The updates and additions include..... -Updated RefSeq annotations -Updated Ensembl annotations -Mouse comparative sequence conservation -Rat comparative sequence conservation -Tetraodon comparative sequence conservation -Promoters from Eukaryotic Promoter Database added -Promoters added from "Identification and functional analysis of human transcriptional promoters" Genome Res. 2003 13: 308-312 (website) -1951 new EST sequences -471 new mRNA sequences -138 new BAC end sequences
The August 21st updates include 243 new clinical cases, and updated gene annotations. The updates and additions include..... -Updated TCAG annotations -243 new clinical cases -2 new mouse imprinted genes (Pon2, Pon3) -SHH regulatory element (ZRS) -New 5' sequence for DLX6
The July 10th updates include the updated TIGR Human Gene Index release 12.0 (June 2003). The latest June dbEST dataset has been added, along with a number of new clinical data entries. New additional 5' sequence for 158 genes has also been added to the current annotations from the Database of Transcription Start Sites (DBTSS). The updates and additions include..... - 4650 new EST seqeunces (June 30th dbEST updates) - TIGR HGI release 12.0 displayed June 2003 (29212 entries) - Additional 5' sequence for 158 genes added from Database of Transcription Start Sites (DBTSS) - 232 clinical cases added
WashU publishes chromosome 7 paper in Nature.
The June 15th updates include new mRNA and BAC end sequences and new STS and genetic markers. DLX5 has been added as an imprinted gene in the structural features track. The updates and additions include..... - 207 new mRNA sequences added - STS and genetic marker datasets updated - BAC end data updated (153 new entries) - New imprinted gene DLX5 added
Multiple tracks in GBrowse and the associated databases have been updated: - 183 new gene structures (272 with variants) - New track: RIKEN DIGIT gene predictions (211 entries) - 66 new breakpoints - 72 new STS markers - 21 new FISH clones - 117 new BAC ends - 4 new genetic markers - 132 sequence variations, 2 mouse imprinted genes were added in the structural features - updated ab initio gene predictions (Genscan and HMMgene)
799 new mRNAs and 1212 new ESTs added to GBrowse.
BLAT alignment tool implemented for sequence search.
The International Human Genome Sequencing Consortium announced the completion of the human DNA sequence.
TCAG publishes chromosome 7 paper in Science. Launch of this website.
|
|