Please use the following text to cite this item or export to a predefined format:
De Smet, Hendrik; Flach, Susanne; Diller, Hans-Jürgen and Tyrkkö, Jukka, 2015,
The Corpus of Late Modern English Texts, version 3.1, CLARIN DSpace,
http://hdl.handle.net/20.500.14106/2574.
| dc.contributor.author | De Smet, Hendrik |
| dc.contributor.author | Flach, Susanne |
| dc.contributor.author | Diller, Hans-Jürgen |
| dc.contributor.author | Tyrkkö, Jukka |
| dc.date.accessioned | 2024-11-25T15:04:52Z |
| dc.date.available | 2024-11-25T15:04:52Z |
| dc.date.issued | 2015-10 |
| dc.description | The Corpus of Late Modern English Texts (CLMET) is a corpus of roughly 35 million words of British English from 1710–1920, grouped into three 70-year periods. The history, versions and specifics of corpus composition can be followed up by referring to the CLMET3.0 website. CLMET3.0 is currently distributed in three formats: (i) plain text, (ii) plain text with one sentence per line, and (iii) a tagged version (one sentence per line). Version CLMET3.1 is the result of making CLMET available in a CQP format for use in CWB and CQPweb-based corpus environments. While there is no change to the selection of texts, CLMET3.1 includes additions and changes in linguistic annotation. The changes in CLMET3.1 are of three general types: (a) retokenization and retagging, (b) fixing of some systematic issues that come with historical data, and (c) enhancing annotation by adding lemmas and simplified part-of-speech class tags. |
| dc.identifier | 2574 |
| dc.identifier.uri | http://hdl.handle.net/20.500.14106/2574 |
| dc.language | English |
| dc.language.iso | eng |
| dc.publisher | KU Leuven |
| dc.relation.ispartof | Oxford Text Archive Core Collection |
| dc.relation.isreferencedby | https://essenglish.org/messenger/wp-content/uploads/sites/2/2016/01/192-29-35.pdf |
| dc.rights | Creative Commons - Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) |
| dc.rights.label | PUB |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/4.0/ |
| dc.source.uri | https://fedora.clarin-d.uni-saarland.de/clmet/clmet.html |
| dc.source.uri | https://perswww.kuleuven.be/%7Eu0044428/clmet3_0.htm |
| dc.subject | Linguistic corpora |
| dc.subject.lcsh | Linguistics analysis (Linguistics) |
| dc.subject.lcsh | Linguistics |
| dc.title | The Corpus of Late Modern English Texts, version 3.1 |
| dc.title.alternative | CLMET3.1 |
| dc.type | corpus |
| local.branding | Literary and Linguistic Data Service |
| local.contact.person | Hendrik De Smet hendrik.desmet@kuleuven.be KU Leuven |
| local.files.count | 5 |
| local.files.size | 723314347 |
| local.has.files | yes |
| local.hasCMDI | false |
| local.hidden | false |
| local.language.name | English |
| local.size.info | 34386225 tokens |
| local.size.info | 333 texts |
| local.size.info | 212 other |
| local.size.info | 687 mb |
| metashare.ResourceInfo#ContentInfo.mediaType | text |
| otaterms.date.range | 1700-1799 |
| otaterms.date.range | 1800-1899 |
| otaterms.date.range | 1900-1999 |
Collections
This item isPublicly Available
and licensed under:
Files in this item
This item contains no files.

