WhisperToEaf
Converts JSON-formatted Whisper ASR files to ELAN files
Deserializing from “Whisper ASR transcript” text/whisper+plain
Command-line configuration parameters for deserialization:
--languageLayer=Layer |
The language of the whole transcript |
--minShortPauseLength=0.35 |
The minimum inter-word pause length, in seconds, before a pause counts as a ‘short pause’. |
--shortPauseLabel=(.) |
If an inter-word pause has a duration between minShortPauseLength and minMediumPauseLength, then the word before the pause will have this string appended to its label (after a space). |
--minMediumPauseLength=0.7 |
The minimum inter-word pause length, in seconds, before a pause counts as a ‘medium pause’ |
--mediumPauseLabel=(..) |
If an inter-word pause has a duration between minMediumPauseLength and minLongPauseLength, then the word before the pause will have this string appended to its label (after a space) |
--minLongPauseLength=1.4 |
The minimum inter-word pause length, in seconds, before a pause counts as a ‘long pause’. |
--longPauseLabel=(...) |
If an inter-word pause has a duration more than minLongPauseLength, then the word before the pause will have this string appended to its label (after a space) e.g. for the the length of the pause in parentheses, use: ({0.000}) |
--maxUtteranceDuration=15.0 |
Utterances longer than this will be split on longer inter-word pauses, where possible. |
--utterancePadding=0.5 |
Maximum number of seconds to subtract from the start time and add to the end time of each utterance, to allow for alignment errors of first/last word in each segment. |
Serializing to “ELAN EAF Transcript” text/x-eaf+xml
Command-line configuration parameters for serialization:
--commentLayer=Layer |
Commentary |
--noiseLayer=Layer |
Noise annotations |
--lexicalLayer=Layer |
Lexical tags |
--pronounceLayer=Layer |
Manual pronunciation tags |
--authorLayer=Layer |
Name of transcriber |
--dateLayer=Layer |
Document date |
--languageLayer=Layer |
The language of the whole transcript |
--phraseLanguageLayer=Layer |
For tagging individual phrases with a language |
--useConventions=true |
Whether to use text conventions for comment, noise, lexical, and pronounce annotations |
--ignoreBlankAnnotations=true |
Whether to skip annotations with no label, or process them |
--minimumTurnPauseLength=0.0 |
Minimum amount of time between two turns by the same speaker, with no intervening speaker, for which the inter-turn pause counts as a turn change boundary. If the pause is shorter than this, the turns are merged into one. |
--wordTierPattern=.*word.* |
A regular expression that matches the tier that contains word tokens, e.g. .word. |
nzilbb.convernt.whispertoeaf