TrsToVtt

Converts Transcriber .trs files to Web VTT subtitles

As WebVTT is a subtitle format, not related to a linguistic annotation tool, all annotations that Transcriber supports are lost during this conversion, including:

  • topics
  • comments
  • noises
  • phrase language annotations
  • named entity annotations
  • lexical tags
  • pronounce tags

Although WebVTT does not support meta-data, the meta-data that is present in the Transcriber transcript is included in the .vtt file as NOTEs, including:

  • version
  • version date
  • air date
  • scribe
  • language
  • participant gender
  • participant dialect
  • participant accent
  • participant scope

Deserializing from “Transcriber transcript” text/xml-transcriber

Command-line configuration parameters for deserialization:

--topicLayer=Layer Topic tags
--commentLayer=Layer Commentary
--noiseLayer=Layer Noise annotations
--languageLayer=Layer Inline language tags
--lexicalLayer=Layer Lexical tags
--pronounceLayer=Layer Manual pronunciation tags
--entityLayer=Layer Named entities
--scribeLayer=Layer Name of transcriber
--versionLayer=Layer Version of transcriber
--versionDateLayer=Layer Version date of transcriber
--programLayer=Layer Name of the program recorded
--airDateLayer=Layer Date the program aired
--transcriptLanguageLayer=Layer The language of the whole transcript
--participantCheckLayer=Layer Participant checked
--genderLayer=Layer Gender - participant ‘type’
--dialectLayer=Layer Participant's dialect
--accentLayer=Layer Participant's accent
--scopeLayer=Layer Participant's ‘scope’

Serializing to “WebVTT subtitles” text/vtt

There are no configuration parameters for serialization.