Please use the following text to cite this item or export to a predefined format:
University of Oxford, 2003, The Emille Corpus (Beta Release Version), CLARIN DSpace, http://hdl.handle.net/20.500.14106/2460.
dc.contributorMcEnery, A.M. Department of Linguistics and Modern English Language Lancaster University Lancaste
dc.contributor.editorMcEnery, A.M.
dc.contributor.editorBaker, Paul
dc.contributor.editorHardie, Andrew
dc.date.accessioned2018-07-27
dc.date.accessioned2022-08-19T15:53:13Z
dc.date.available2022-08-19T15:53:13Z
dc.date.created2003
dc.date.issued2003-05-02
dc.description.abstractThe collection consists of: Thirty million words of monolingual written data (Gujarati, Tamil, Hindi, Punjabi-news website articles) 600,000 words of monolingual spoken data (Hindi, Urdu, Punjabi, Bengali, Gujarati-radio broadcasts) 120,000 words of parallel data in each of English, Hindi, Urdu, Punjabi, Bengali and Gujarati (U.K. government leaflets). Further information available at: http://www.emille.lancs.ac.uk/home.htm
dc.description.sponsorshipEngineering and Physical Science Research Council (EPSRC)
dc.format.extentText data 6551 files : ca. 482 MB
dc.format.mediumDigital bitstream
dc.identifierota:2460
dc.identifier.urihttp://hdl.handle.net/20.500.14106/2460
dc.languageEnglish
dc.languageGujarati
dc.languageTamil
dc.languageHindi
dc.languagePanjabi
dc.languageUrdu
dc.languageBengali
dc.language.isoeng
dc.language.isoguj
dc.language.isotam
dc.language.isohin
dc.language.isopan
dc.language.isourd
dc.language.isoben
dc.publisherUniversity of Oxford
dc.relation.ispartofOxford Text Archive Core Collection
dc.rightsDistributed by the University of Oxford under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.
dc.rights.labelPUB
dc.rights.urihttp://creativecommons.org/licenses/by-nc-sa/3.0/
dc.subject.lcshSouth Asia--Languages
dc.subject.lcshIndo-Aryan languages, Modern
dc.subject.lcshLinguistic analysis (Linguistics)
dc.subject.otherLinguistic corpora
dc.titleThe Emille Corpus (Beta Release Version)
dc.typeCorpus
local.brandingOxford Text Archive
local.brandingOxford Text Archive
local.files.count9
local.files.size113513930
local.has.filesyes
local.language.nameEnglish
local.language.nameGujarati
local.language.nameTamil
local.language.nameHindi
local.language.namePanjabi
local.language.nameUrdu
local.language.nameBengali
otaterms.date.range2000-present
 Files in this item
This item contains no files.