EafToCha

Converts ELAN .eaf files to CLAN CHAT transcripts

All tiers will be interpreted as transcription of participant speech. If some tiers contain other annotations, use the –ignoreTiers command line switch to exclude them from the conversion using a regular expression, e.g.: –ignoreTiers=Noise|Topic

Deserializing from “ELAN EAF Transcript” text/x-eaf+xml

Command-line configuration parameters for deserialization:


`--commentLayer=`Layer	Commentary
`--noiseLayer=`Layer	Noise annotations
`--lexicalLayer=`Layer	Lexical tags
`--pronounceLayer=`Layer	Manual pronunciation tags
`--authorLayer=`Layer	Name of transcriber
`--dateLayer=`Layer	Document date
`--languageLayer=`Layer	The language of the whole transcript
`--phraseLanguageLayer=`Layer	For tagging individual phrases with a language
`--useConventions=`Boolean	Whether to use text conventions for comment, noise, lexical, and pronounce annotations
`--ignoreBlankAnnotations=`Boolean	Whether to skip annotations with no label, or process them
`--minimumTurnPauseLength=`Double	Minimum amount of time between two turns by the same speaker, with no intervening speaker, for which the inter-turn pause counts as a turn change boundary. If the pause is shorter than this, the turns are merged into one.

Serializing to “CLAN CHAT transcript” text/x-chat

Command-line configuration parameters for serialization:


`--cUnitLayer=`Layer	Layer for marking c-units
`--tokenLayer=`Layer	Output word tokens come from this layer
`--disfluencyLayer=`Layer	Layer for disfluency annotations
`--nonWordLayer=`Layer	Layer for non-word noises
`--expansionLayer=`Layer	Layer for expansion annotations
`--errorsLayer=`Layer	Layer for error annotations
`--linkageLayer=`Layer	Layer for linkage annotations
`--repetitionsLayer=`Layer	Layer for repetition annotations
`--retracingLayer=`Layer	Layer for retracing annotations
`--pauseLayer=`Layer	Layer for marking unfilled pauses
`--completionLayer=`Layer	Layer for completion annotations
`--morLayer=`Layer	Layer for morphosyntactic tags
`--morPrefixLayer=`Layer	Layer for prefixes in MOR tags
`--morPartOfSpeechLayer=`Layer	Layer for parts of speech in MOR tags
`--morPartOfSpeechSubcategoryLayer=`Layer	Layer for subcategories of parts of speech in MOR tags
`--morStemLayer=`Layer	Layer for stems in MOR tags
`--morFusionalSuffixLayer=`Layer	Layer for fusional suffixes in MOR tags
`--morSuffixLayer=`Layer	Layer for (non-fusional) suffixes in MOR tags
`--morGlossLayer=`Layer	Layer for English glosses in MOR tags
`--gemLayer=`Layer	Layer for gems
`--transcriberLayer=`Layer	Layer for transcriber name
`--languagesLayer=`Layer	Layer for transcriber language
`--dateLayer=`Layer	Layer for date of the interaction
`--locationLayer=`Layer	Layer for location of the interaction
`--recordingQualityLayer=`Layer	Layer for recording quality
`--roomLayoutLayer=`Layer	Layer for room layout
`--tapeLocationLayer=`Layer	Layer for tape and location on the tape covered by the transcription
`--targetParticipantLayer=`Layer	Layer for identifying target participants
`--SESLayer=`Layer	Layer for SES
`--roleLayer=`Layer	Layer for role
`--educationLayer=`Layer	Layer for education
`--sexLayer=`Layer	Layer for sex
`--customLayer=`Layer	Layer for custom
`--corpusLayer=`Layer	Layer for corpus
`--languageLayer=`Layer	Layer for language
`--ageLayer=`Layer	Layer for age
`--groupLayer=`Layer	Layer for group
`--includeTimeCodes=`Boolean	Include utterance sychronization information when exporting transcripts
`--splitMorTagGroups=`Boolean	Split alternative MOR taggings into separate annotations
`--splitMorWordGroups=`Boolean	Split MOR word morphemes (clitics, components of compounds ) into separate annotations. This is only supported when Split MOR Tag Groups is also enabled.