SltToTextGrid

Converts SALT .slt transcripts to Praat .TextGrid files

Praat doesn't support meta-data like Dob, Doe, Ethnicity, etc. so almost all SALT header meta-data is lost when converting to .eaf.

By default, inline annotations (mazes, codes, bound morphemes, etc.) are not interpreted and appear as-is in the TextGrid. If you want them to be parse (i.e. annotations removed to yield more standard orthography), use –parseInlineConventions

Deserializing from “SALT transcript” text/x-salt

Command-line configuration parameters for deserialization:


`--cUnitLayer=`Layer	Layer for marking c-units
`--targetParticipantLayer=`Layer	Layer for marking the target participant
`--commentLayer=`Layer	Layer for comments
`--parentheticalLayer=`Layer	Layer for marking parenthetical remarks by the speaker
`--properNameLayer=`Layer	Layer for tagging proper names
`--repetitionsLayer=`Layer	Layer for annotating repetitions
`--rootLayer=`Layer	Layer for tagging words with their root form
`--errorLayer=`Layer	Layer for marking errors
`--soundEffectLayer=`Layer	Layer for marking non-word verbal sound effects
`--pauseLayer=`Layer	Layer for marking pauses in speech
`--boundMorphemeLayer=`Layer	Layer for marking bound morpheme annotations
`--mazeLayer=`Layer	Layer for marking false starts, repetitions, and reformulations
`--partialWordLayer=`Layer	Layer for marking stuttered or interrupted words
`--omissionLayer=`Layer	Layer for marking missing words
`--codeLayer=`Layer	Layer for non-error codes
`--languageLayer=`Layer	Layer for recording the language of the speech
`--participantIdLayer=`Layer	Layer for recording the target participant's ID
`--genderLayer=`Layer	Layer for recording the gender of the target participant
`--dobLayer=`Layer	Layer for recording the birth date of the target participant
`--doeLayer=`Layer	Layer for recording the date the recording was elicited
`--caLayer=`Layer	Layer for recording the target participant's age when recorded
`--ethnicityLayer=`Layer	Layer for recording the ethnicity of the target participant
`--contextLayer=`Layer	Layer for recording the sampling context
`--subgroupLayer=`Layer	Layer for recording the sub-group/story
`--collectLayer=`Layer	Layer for recording the collection point of the elicitation
`--locationLayer=`Layer	Layer for recording the location of the elicitation
`--dateFormat=`String	Format used in SALT files for dates (e.g. Dob, Doe) - either M/d/yyyy or d/M/yyyy. NB: the default date format is inferred from your locale settings
`--parseInlineConventions=`Boolean	Whether to use SALT in-line conventions when deserializing. If false, then only meta-data headers, comment lines, and time stamps are parsed; all in-line annotation conventions are left as-is

Serializing to “Praat TextGrid” text/praat-textgrid

Command-line configuration parameters for serialization:


`--commentLayer=`Layer	Commentary
`--noiseLayer=`Layer	Noise annotations
`--lexicalLayer=`Layer	Lexical tags
`--pronounceLayer=`Layer	Manual pronunciation tags
`--renameShortNumericSpeakers=`Boolean	Short speaker names like ‘S1’ should be prefixed with the transcript name during import
`--allowPeerOverlap=`Boolean	Allows TextGrids with, for example, multiple segment tiers, if the underlying annotations are invalid and have overlapping segments.
`--utteranceThreshold=`Double	Minimum inter-word pause to trigger an utterance boundary, when no utterance layer is mapped. 0 means ‘do not infer utterance boundaries’.
`--useConventions=`Boolean	Whether to use text conventions for comment, noise, lexical, and pronounce annotations
`--ignoreLabels=`String	Regular expression for annotation to ignore, e.g. <p:> to ignore MAUS pauses