Please use the following text to cite this item or export to a predefined format:
Nesi, Hilary; Gardner, Sheena; Thompson, Paul and Wickens, Paul, 2008,
British Academic Written English Corpus, CLARIN DSpace,
http://hdl.handle.net/20.500.14106/2539.
| dc.contributor | Spencer-Oatley, Helen |
| dc.contributor | Durrant, Philip |
| dc.contributor.author | Nesi, Hilary |
| dc.contributor.author | Gardner, Sheena |
| dc.contributor.author | Thompson, Paul |
| dc.contributor.author | Wickens, Paul |
| dc.date.accessioned | 2018-07-27 |
| dc.date.accessioned | 2022-09-30T11:26:08Z |
| dc.date.available | 2022-09-30T11:26:08Z |
| dc.date.created | 2004 |
| dc.date.issued | 2008-10-01 |
| dc.description.abstract | The BAWE corpus contains 2761 pieces of proficient assessed student writing, ranging in length from about 500 words to about 5000 words. Holdings are fairly evenly distributed across four broad disciplinary areas (Arts and Humanities, Social Sciences, Life Sciences and Physical Sciences) adn across four levels of study (undergraduate and taught masters level). Thirty-five disciplines are represented. The assignments have been annotated using a system devised in accordance with the TEI guidelines. There is a dtd file which must be kept in the same folder as the corpus files, named tei_bawe.dtd and the holdings are described in an Excel spreadsheet 'BAWE.xls'. The transcription and mark-up conventions are described in the BAWE manual document, which is in PDF format. In 2022 tagged versions of the files, with part-of-speech and constituency annotations, were added by Philip Durrant of the University of Exeter. The annotated corpus is available in three versions, one (in directory 'conll') is the original output from the Stanford Core NLP parser one ('csv') is a slightly edited version in .csv format a third ('csv_categorized') is the .csv files organized by discipline and year group. |
| dc.description.sponsorship | Economic and Social Research Council (ESRC) |
| dc.format.extent | CollectionText 22,188 files: ca. 1.1 GB |
| dc.format.medium | Digital bitstream |
| dc.identifier | ota:2539 |
| dc.identifier.uri | http://hdl.handle.net/20.500.14106/2539 |
| dc.language | English |
| dc.language.iso | eng |
| dc.publisher | University of Oxford |
| dc.relation.ispartof | Oxford Text Archive Core Collection |
| dc.rights | Distributed by the University of Oxford under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. |
| dc.rights.label | PUB |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/3.0/ |
| dc.subject.lcsh | Linguistics |
| dc.subject.lcsh | Linguistics analysis (Linguistics) |
| dc.subject.other | Linguistic corpora |
| dc.title | British Academic Written English Corpus |
| dc.title.alternative | BAWE |
| dc.type | Corpus |
| local.branding | Oxford Text Archive |
| local.branding | Oxford Text Archive |
| local.files.count | 2 |
| local.files.size | 295038463 |
| local.has.files | yes |
| local.language.name | English |
Collections
This item isPublicly Available
and licensed under:
Files in this item
This item contains no files.

