[X] Close
You are about to erase all the values you have customized, search history, page format, etc.
Click here to RESET all values       Click here to GO BACK without resetting any value
Item 1 of about 1
1. Haeussler M, Gerner M, Bergman CM: Annotating genes and genomes with DNA sequences extracted from biomedical articles. Bioinformatics; 2011 Apr 01;27(7):980-6
PDF icon [Fulltext service] Download fulltext PDF of this article and others, as many as you want.

  • [Source] The source of this record is MEDLINE®, a database of the U.S. National Library of Medicine.
  • [Title] Annotating genes and genomes with DNA sequences extracted from biomedical articles.
  • MOTIVATION: Increasing rates of publication and DNA sequencing make the problem of finding relevant articles for a particular gene or genomic region more challenging than ever.
  • Existing text-mining approaches focus on finding gene names or identifiers in English text.
  • These are often not unique and do not identify the exact genomic location of a study.
  • RESULTS: Here, we report the results of a novel text-mining approach that extracts DNA sequences from biomedical articles and automatically maps them to genomic databases.
  • We find that ∼20% of open access articles in PubMed central (PMC) have extractable DNA sequences that can be accurately mapped to the correct gene (91%) and genome (96%).
  • We illustrate the utility of data extracted by text2genome from more than 150 000 PMC articles for the interpretation of ChIP-seq data and the design of quantitative reverse transcriptase (RT)-PCR experiments.
  • CONCLUSION: Our approach links articles to genes and organisms without relying on gene names or identifiers.
  • It also produces genome annotation tracks of the biomedical literature, thereby allowing researchers to use the power of modern genome browsers to access and analyze publications in the context of genomic data.
  • AVAILABILITY AND IMPLEMENTATION: Source code is available under a BSD license from http://sourceforge.net/projects/text2genome/ and results can be browsed and downloaded at http://text2genome.org.
  • [MeSH-major] DNA / chemistry. Data Mining / methods. Genes. Genome. Molecular Sequence Annotation. PubMed
  • [MeSH-minor] Base Sequence. Chromatin Immunoprecipitation. Databases, Nucleic Acid. Reverse Transcriptase Polymerase Chain Reaction. Sequence Analysis, DNA. Software

  • MedlinePlus Health Information. consumer health - Genes and Gene Therapy.
  • COS Scholar Universe. author profiles.
  • [Email] Email this result item
    Email the results to the following email address:   [X] Close
  • [Cites] Proc Natl Acad Sci U S A. 2001 Jan 16;98(2):381-2 [11209037.001]
  • [Cites] Acta Crystallogr D Biol Crystallogr. 2010 Dec;66(Pt 12):1269-70 [21123865.001]
  • [Cites] Genome Biol. 2002 Jun 18;3(7):RESEARCH0034 [12184808.001]
  • [Cites] Nucleic Acids Res. 2003 Jan 1;31(1):172-5 [12519974.001]
  • [Cites] BMC Bioinformatics. 2001;2:7 [11667947.001]
  • [Cites] Nucleic Acids Res. 2004 Jan 1;32(Database issue):D493-6 [14681465.001]
  • [Cites] Nucleic Acids Res. 1987 May 11;15(9):3937 [3588316.001]
  • [Cites] Nucleic Acids Res. 1987 Nov 11;15(21):9083-4 [3684584.001]
  • [Cites] Genome Res. 1996 Oct;6(10):995-1001 [8908519.001]
  • [Cites] Science. 1965 Mar 19;147(3664):1462-5 [14263761.001]
  • [Cites] BMC Bioinformatics. 2005;6 Suppl 1:S12 [15960824.001]
  • [Cites] Bioinformatics. 2005 Nov 1;21(21):4046-53 [16159926.001]
  • [Cites] BMC Bioinformatics. 2006;7:260 [16712726.001]
  • [Cites] Nucleic Acids Res. 2007 Jan;35(Database issue):D26-31 [17148475.001]
  • [Cites] BMC Bioinformatics. 2007;8:280 [17678535.001]
  • [Cites] Genome Biol. 2008;9(2):R31 [18271954.001]
  • [Cites] Bioinformatics. 2008 Aug 15;24(16):i126-132 [18689813.001]
  • [Cites] Genome Biol. 2008;9 Suppl 2:S3 [18834494.001]
  • [Cites] Genome Biol. 2008;9 Suppl 2:S8 [18834499.001]
  • [Cites] Hum Mol Genet. 2008 Dec 1;17(23):3740-60 [18799476.001]
  • [Cites] Nucleic Acids Res. 2009 Jan;37(Database issue):D690-7 [19033362.001]
  • [Cites] Nature. 2009 Feb 12;457(7231):854-8 [19212405.001]
  • [Cites] Bioinformatics. 2009 Jun 1;25(11):1422-3 [19304878.001]
  • [Cites] Nucleic Acids Res. 2009 Jul;37(Web Server issue):W147-52 [19468046.001]
  • [Cites] Nucleic Acids Res. 2010 Jan;38(Database issue):D563-9 [19884133.001]
  • [Cites] Nucleic Acids Res. 2010 Jan;38(Database issue):D613-9 [19906737.001]
  • [Cites] Nucleic Acids Res. 2010 Jan;38(Database issue):D46-51 [19910366.001]
  • [Cites] BMC Bioinformatics. 2010;11:85 [20149233.001]
  • [Cites] BMC Bioinformatics. 2010;11:410 [20682041.001]
  • [Cites] Bioinformatics. 2010 Nov 1;26(21):2801-2 [20829445.001]
  • [Cites] Genome Res. 2002 Apr;12(4):656-64 [11932250.001]
  • (PMID = 21325301.001).
  • [ISSN] 1367-4811
  • [Journal-full-title] Bioinformatics (Oxford, England)
  • [ISO-abbreviation] Bioinformatics
  • [Language] eng
  • [Grant] United Kingdom / Biotechnology and Biological Sciences Research Council / / BB/G000093/1; United Kingdom / Biotechnology and Biological Sciences Research Council / / BB/E012868/1
  • [Publication-type] Journal Article; Research Support, Non-U.S. Gov't
  • [Publication-country] England
  • [Chemical-registry-number] 9007-49-2 / DNA
  • [Other-IDs] NLM/ PMC3065681
  •  go-up   go-down


Advertisement





Advertisement