EafToCha
Converts ELAN .eaf files to CLAN CHAT transcripts
All tiers will be interpreted as transcription of participant speech. If some tiers contain other annotations, use the –ignoreTiers command line switch to exclude them from the conversion using a regular expression, e.g.: –ignoreTiers=Noise|Topic
Deserializing from “ELAN EAF Transcript” text/x-eaf+xml
Command-line configuration parameters for deserialization:
--commentLayer= Layer |
Commentary |
--noiseLayer= Layer |
Noise annotations |
--lexicalLayer= Layer |
Lexical tags |
--pronounceLayer= Layer |
Manual pronunciation tags |
--authorLayer= Layer |
Name of transcriber |
--dateLayer= Layer |
Document date |
--languageLayer= Layer |
The language of the whole transcript |
--phraseLanguageLayer= Layer |
For tagging individual phrases with a language |
--useConventions= Boolean |
Whether to use text conventions for comment, noise, lexical, and pronounce annotations |
--ignoreBlankAnnotations= Boolean |
Whether to skip annotations with no label, or process them |
--minimumTurnPauseLength= Double |
Minimum amount of time between two turns by the same speaker, with no intervening speaker, for which the inter-turn pause counts as a turn change boundary. If the pause is shorter than this, the turns are merged into one. |
Serializing to “CLAN CHAT transcript” text/x-chat
Command-line configuration parameters for serialization:
--cUnitLayer= Layer |
Layer for marking c-units |
--tokenLayer= Layer |
Output word tokens come from this layer |
--disfluencyLayer= Layer |
Layer for disfluency annotations |
--nonWordLayer= Layer |
Layer for non-word noises |
--expansionLayer= Layer |
Layer for expansion annotations |
--errorsLayer= Layer |
Layer for error annotations |
--linkageLayer= Layer |
Layer for linkage annotations |
--repetitionsLayer= Layer |
Layer for repetition annotations |
--retracingLayer= Layer |
Layer for retracing annotations |
--pauseLayer= Layer |
Layer for marking unfilled pauses |
--completionLayer= Layer |
Layer for completion annotations |
--morLayer= Layer |
Layer for morphosyntactic tags |
--morPrefixLayer= Layer |
Layer for prefixes in MOR tags |
--morPartOfSpeechLayer= Layer |
Layer for parts of speech in MOR tags |
--morPartOfSpeechSubcategoryLayer= Layer |
Layer for subcategories of parts of speech in MOR tags |
--morStemLayer= Layer |
Layer for stems in MOR tags |
--morFusionalSuffixLayer= Layer |
Layer for fusional suffixes in MOR tags |
--morSuffixLayer= Layer |
Layer for (non-fusional) suffixes in MOR tags |
--morGlossLayer= Layer |
Layer for English glosses in MOR tags |
--gemLayer= Layer |
Layer for gems |
--transcriberLayer= Layer |
Layer for transcriber name |
--languagesLayer= Layer |
Layer for transcriber language |
--dateLayer= Layer |
Layer for date of the interaction |
--locationLayer= Layer |
Layer for location of the interaction |
--recordingQualityLayer= Layer |
Layer for recording quality |
--roomLayoutLayer= Layer |
Layer for room layout |
--tapeLocationLayer= Layer |
Layer for tape and location on the tape covered by the transcription |
--targetParticipantLayer= Layer |
Layer for identifying target participants |
--SESLayer= Layer |
Layer for SES |
--roleLayer= Layer |
Layer for role |
--educationLayer= Layer |
Layer for education |
--sexLayer= Layer |
Layer for sex |
--customLayer= Layer |
Layer for custom |
--corpusLayer= Layer |
Layer for corpus |
--languageLayer= Layer |
Layer for language |
--ageLayer= Layer |
Layer for age |
--groupLayer= Layer |
Layer for group |
--includeTimeCodes= Boolean |
Include utterance sychronization information when exporting transcripts |
--splitMorTagGroups= Boolean |
Split alternative MOR taggings into separate annotations |
--splitMorWordGroups= Boolean |
Split MOR word morphemes (clitics, components of compounds ) into separate annotations. This is only supported when Split MOR Tag Groups is also enabled. |