Title from LDC online catalogue. Data type: Text. Data source: Broadcast conversation. Application: Machine translation. Authors: Lauren Friedman, Hubert Jin, Zhiyi Song, Gary Krug, Stephanie Strassel.
Summary:
"GALE phase 2 Chinese broadcast conversation parallel text part 1 was developed by the Linguistic Data Consortium (LDC). Along with other corpora, the parallel text in this release comprised training data for phase 2 of the DARPA GALE (Global Autonomous Language Exploitation) Program. This corpus contains Chinese source text and corresponding English translations selected from broadcast conversation (BC) data collected by LDC in 2006 and 2007 and transcribed by LDC or under its direction." -- LDC online catalogue.
This resource is supported by the Institute of Museum and Library Services under the provisions of the Library Services and Technology Act as administered by State Library of Iowa.