Title from "index.html" file; additional information from the pertinent LDC catalog webpage. "LDC2005T28." "This corpus contains source data for the 2004 TREC HARD (High Accuracy Retrieval from Documents) Evaluation. HARD 2004 was a track within the NIST Text REtrieval Conference (TREC), with the objective of achieving high accuracy retrieval from documents by leveraging additional information about the searcher and/or the search context, through techniques like passage retrieval and the use of targeted interaction with the searcher ... The corpus comprises eight English newswire and web text sources from January-December 2003. The sources are: AFE: Agence France Presse (English); APE: Associated Press Newswire; CNE: Central News Agency Taiwan (English); LAT: Los Angeles Times/Washington Post; NYT: New York Times; SLN: Salon.com; UME: Ummah Press (English); XIE: Xinhua News Agency (English)."--index.html.
This resource is supported by the Institute of Museum and Library Services under the provisions of the Library Services and Technology Act as administered by State Library of Iowa.