Please use this identifier to cite or link to this item: http://hdl.handle.net/1959.14/148256
21 Visitors
22 Hits
0 Downloads
- Title
- From archive to corpus : transcription and annotation in the creation of signed language corpora
- Related
- International jurnal of corpus linguistics, Vol. 15, No. 1, (2010), p.106-131
- DOI
- 10.1075/ijcl.15.1.05joh
- Publisher
- John Benjamins Publishing
- Date
- 2010
- FoR/RFCD Code(s)
-
200400 Linguistics
170200 Cognitive Sciences
- Author/Creator
- Johnston, Trevor
- Description
- Annotations are an important resource in corpus-based linguistic research. In fact, the most important feature of a modern signed language corpus should be that it has been annotated rather than simply transcribed. Digital multi-media annotation software can now transform language recordings into machine-readable texts using gloss-based annotations without it first being necessary to transcribe these utterances, provided that sign tokens are identified and discriminated according to type. Further annotations can subsequently be appended to these units. However, unique identifiers of sign types (or ‘ID-glosses’) can only be used if a comprehensive reference lexical database of the language already exists. In order to create a basic multi-purpose reference signed language corpus, therefore, linguists should prioritize annotation using ID-glosses above transcription. The effort expended in creating a transcription that does not facilitate the unique identification of sign types will not result in a machine-readable corpus in any meaningful sense, contrary to expectations.
- Description
- 26 page(s)
- Subject Keyword
- 200400 Linguistics
- Subject Keyword
- 170200 Cognitive Sciences
- Subject Keyword
- corpus linguistics
- Subject Keyword
- transcription, annotation
- Subject Keyword
- language documentation
- Subject Keyword
- sign language
- Subject Keyword
- Auslan (Australian Sign Language)
- Resource Type
- journal article
- Organisation
- Macquarie University. Dept. of Linguistics
- Identifier
- http://hdl.handle.net/1959.14/148256
- Identifier
- ISSN:1384-6655
- Identifier
- mq-rm-2010004084
- Language
- eng
- Reviewed
