Please use the following text to cite this item or export to a predefined format:
Nawar Halabi, 2016,
Arabic Speech Corpus, CLARIN DSpace,
http://hdl.handle.net/20.500.14106/2561.
| dc.contributor | Nawar Halabi University of Southampton |
| dc.contributor.author | Nawar Halabi |
| dc.date.accessioned | 2018-07-27 |
| dc.date.accessioned | 2022-08-19T15:57:01Z |
| dc.date.available | 2022-08-19T15:57:01Z |
| dc.date.created | 2015 |
| dc.date.issued | 2016-06-09 |
| dc.description.abstract | The resource is a speech corpus, with digital audio files, text transcripts, and files containing time stamps of the phoneme boundaries. There are 1813 .wav files containing spoken utterances, 1813 .lab files containing text utterances, 1813 .TextGrid files containing the phoneme labels with time stamps of the boundaries where these occur in the .wav files. These files can be opened using Praat software. The file phonetic-transcript.txt which has the form "[wav_filename]" "[Phoneme Sequence]" in every line. The file orthographic-transcript.txt which has the form "[wav_filename]" "[Orthographic Transcript]" in every line. Orthography is in Buckwalter Format which is friendlier where there is software that does not read Arabic script. It can be easily converted back to Arabic. |
| dc.format.extent | CollectionSound 5,444 files: ca. 1.3 GB |
| dc.format.medium | Digital bitstream |
| dc.identifier | ota:2561 |
| dc.identifier.uri | http://hdl.handle.net/20.500.14106/2561 |
| dc.language | Arabic |
| dc.language.iso | ara |
| dc.publisher | University of Oxford |
| dc.relation.ispartof | Oxford Text Archive Core Collection |
| dc.rights | Distributed by the University of Oxford under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License. |
| dc.rights.label | PUB |
| dc.rights.uri | http://creativecommons.org/licenses/by-nc-sa/3.0/ |
| dc.subject.lcsh | Linguistics |
| dc.subject.lcsh | Linguistics analysis (Linguistics) |
| dc.subject.lcsh | Speech--Synthesis |
| dc.subject.other | Linguistic corpora |
| dc.subject.other | Speech--Research |
| dc.title | Arabic Speech Corpus |
| dc.type | CollectionSound |
| local.branding | Oxford Text Archive |
| local.branding | Oxford Text Archive |
| local.files.count | 3 |
| local.files.size | 2064444 |
| local.has.files | yes |
| local.language.name | Arabic |
| local.relation.uri | https://downloads.it.ox.ac.uk/ota-public/audio/2561.zip |
| otaterms.date.range | 2000-present |
Collections
This item isPublicly Available
and licensed under:
Files in this item
This item contains no files.

