The Locator -- [(subject = "Machine translating")]

213 records matched your query       


Record 36 | Previous Record | MARC Display | Next Record | Search Results
Title:
MADCAT phase 3 training set [electronic resource].
Format:
[electronic resource].
Publisher:
Linguistic Data Consortium,
Copyright Date:
c2013
Description:
1 DVD ; 4 3/4 in.
Subject:
Arabic language--Data processing.--Data processing.
Arabic language--Machine translating.
Arabic language--Translating into English.
Machine translating.
Other Authors:
Lee, David.
Linguistic Data Consortium.
Notes:
Title from disc label. Data type: Text. Data sources: Newsgroups, newswire, weblogs. Applications: Handwriting recognition, machine translation. "LDC2013T16". Authors: David Lee, Safa Ismael, Dave Doermann, Stephanie Strassel, Zhiyi Song, Stephen Grimes.
Summary:
"MADCAT (Multilingual Automatic Document Classification Analysis and Translation) Phase 3 Training Set contains all training data created by the Linguistic Data Consortium (LDC) to support Phase 3 of the DARPA MADCAT Program. The data in this release consists of handwritten Arabic documents, scanned at high resolution and annotated for the physical coordinates of each line and token. Digital transcripts and English translations of each document are also provided, with the various content and annotation layers integrated in a single MADCAT XML output. The goal of the MADCAT program is to automatically convert foreign text images into English transcripts." -- LDC online catalogue.
Series:
LDC corpora ; LDC2013T15
ISBN:
9781585636518
1585636517
OCLC:
(OCoLC)863541408
Locations:
OVUX522 -- University of Iowa Libraries (Iowa City)

Initiate Another SILO Locator Search

This resource is supported by the Institute of Museum and Library Services under the provisions of the Library Services and Technology Act as administered by State Library of Iowa.