[X] Close
You are about to erase all the values you have customized, search history, page format, etc.
Click here to RESET all values       Click here to GO BACK without resetting any value
Item 1 of about 1
1. Poulter GL, Rubin DL, Altman RB, Seoighe C: MScanner: a classifier for retrieving Medline citations. BMC Bioinformatics; 2008 Feb 19;9:108
PDF icon [Fulltext service] Download fulltext PDF of this article and others, as many as you want.

  • [Source] The source of this record is MEDLINE®, a database of the U.S. National Library of Medicine.
  • [Title] MScanner: a classifier for retrieving Medline citations.
  • BACKGROUND: Keyword searching through PubMed and other systems is the standard means of retrieving information from Medline.
  • However, ad-hoc retrieval systems do not meet all of the needs of databases that curate information from literature, or of text miners developing a corpus on a topic that has many terms indicative of relevance.
  • Several databases have developed supervised learning methods that operate on a filtered subset of Medline, to classify Medline records so that fewer articles have to be manually reviewed for relevance.
  • A few studies have considered generalisation of Medline classification to operate on the entire Medline database in a non-domain-specific manner, but existing applications lack speed, available implementations, or a means to measure performance in new domains.
  • RESULTS: MScanner is an implementation of a Bayesian classifier that provides a simple web interface for submitting a corpus of relevant training examples in the form of PubMed IDs and returning results ranked by decreasing probability of relevance.
  • For maximum speed it uses the Medical Subject Headings (MeSH) and journal of publication as a concise document representation, and takes roughly 90 seconds to return results against the 16 million records in Medline.
  • The web interface provides interactive exploration of the results, and cross validated performance evaluation on the relevant input against a random subset of Medline.
  • We describe the classifier implementation, cross validate it on three domain-specific topics, and compare its performance to that of an expert PubMed query for a complex topic.
  • In cross validation on the three sample topics against 100,000 random articles, the classifier achieved excellent separation of relevant and irrelevant article score distributions, ROC areas between 0.97 and 0.99, and averaged precision between 0.69 and 0.92.
  • CONCLUSION: MScanner is an effective non-domain-specific classifier that operates on the entire Medline database, and is suited to retrieving topics for which many features may indicate relevance.
  • Its web interface simplifies the task of classifying Medline citations, compared to building a pre-filter and classifier specific to the topic.
  • The data sets and open source code used to obtain the results in this paper are available on-line and as supplementary material, and the web interface may be accessed at http://mscanner.stanford.edu.

  • [Email] Email this result item
    Email the results to the following email address:   [X] Close
  • [Cites] Bioinformatics. 2001 Apr;17(4):359-63 [11301305.001]
  • [Cites] BMC Bioinformatics. 2007;8:423 [17971238.001]
  • [Cites] BMC Bioinformatics. 2003 Mar 27;4:11 [12689350.001]
  • [Cites] Adv Dent Res. 2003 Dec;17:115-20 [15126221.001]
  • [Cites] Appl Bioinformatics. 2003;2(3):189-91 [15130808.001]
  • [Cites] Pharmacogenetics. 2004 Sep;14(9):577-86 [15475731.001]
  • [Cites] Radiology. 1982 Apr;143(1):29-36 [7063747.001]
  • [Cites] Proc AMIA Symp. 1998;:750-4 [9929319.001]
  • [Cites] PLoS Biol. 2004 Nov;2(11):e309 [15383839.001]
  • [Cites] J Am Med Inform Assoc. 2005 Mar-Apr;12(2):207-16 [15561789.001]
  • [Cites] J Am Med Inform Assoc. 2005 Mar-Apr;12(2):121-9 [15561790.001]
  • [Cites] BMC Bioinformatics. 2005;6:75 [15790421.001]
  • [Cites] Brief Bioinform. 2005 Mar;6(1):57-71 [15826357.001]
  • [Cites] Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W774-8 [15980583.001]
  • [Cites] J Biomed Inform. 2005 Oct;38(5):404-15 [16198999.001]
  • [Cites] Int J Med Inform. 2006 Jun;75(6):488-95 [16165395.001]
  • [Cites] Bioinformatics. 2006 Jul 15;22(14):e220-6 [16873475.001]
  • [Cites] BMC Bioinformatics. 2006;7:370 [16893465.001]
  • [Cites] Bioinformatics. 2006 Sep 15;22(18):2298-304 [16926219.001]
  • [Cites] AMIA Annu Symp Proc. 2006;:161-5 [17238323.001]
  • [Cites] Bioinformatics. 2007 Jan 15;23(2):e237-44 [17237098.001]
  • [Cites] BMC Med Inform Decis Mak. 2007;7:1 [17214888.001]
  • [Cites] BMC Bioinformatics. 2007;8:269 [17655769.001]
  • [Cites] Nucleic Acids Res. 2002 Jan 1;30(1):163-5 [11752281.001]
  • (PMID = 18284683.001).
  • [ISSN] 1471-2105
  • [Journal-full-title] BMC bioinformatics
  • [ISO-abbreviation] BMC Bioinformatics
  • [Language] ENG
  • [Grant] United States / FIC NIH HHS / TW / D43 TW006993; United States / NIGMS NIH HHS / GM / U01 GM061374; United States / FIC NIH HHS / TW / D43 TW06993; United States / NIGMS NIH HHS / GM / U01GM61374
  • [Publication-type] Comparative Study; Journal Article; Research Support, N.I.H., Extramural
  • [Publication-country] England
  • [Other-IDs] NLM/ PMC2263023
  •  go-up   go-down


Advertisement





Advertisement