Title from disc label. "LDC2008T21" "Release 1.0 of the oncology corpus of PennBioIE (aka Mining the Bibliome), the Biomedical Information Extraction Project at the University of Pennsylvania ... consists of 1414 PubMed abstracts on cancer, concentrating on molecular genetics, and comprising approximately 327,000 words of biomedical text, tokenized and annotated for paragraph, sentence, part of speech, and 24 types of biomedical named entities in five categories of interest. 318 of the abstracts have also been syntactically annotated."--Index.html file.
This resource is supported by the Institute of Museum and Library Services under the provisions of the Library Services and Technology Act as administered by State Library of Iowa.