About nzilbb.annotator

Annotators are modules that perform specialized automatic annotation tasks on annotation graphs.

Project Modules

This project has declared the following modules:

Name	Description
nzilbb.annotator.orthography	This anotator creates 'cleaned up' word labels - e.g. downcase, with extraneous punctuation removed - which should be optimal for looking up lexicons, frequency computation, etc.
nzilbb.annotator.cmudict	Annotator that tags words with their pronunciations according to the CMU Pronouncing Dictionary.
nzilbb.annotator.flatlexicon	Looks up lexicons derived from a flat file (e.g. in CSV format)
nzilbb.annotator.javascript	The Javascript Annotator allows annotations to be generated by executing a program or expression written in JavaScript.
nzilbb.annotator.patterntagger	The Pattern Tagger generates new annotations by matching a list of regular expressions against annotations on a selected layer; the first pattern that matches is used to tag the annotation.
nzilbb.annotator.porterstemmer	The Porter Stemmer annotator uses the Porter Algorithm to compute the stems of English words from their orthography.
nzilbb.annotator.jython	The Jython Layer Manager allows annotations to be generated by executing a script written in Python.
nzilbb.annotator.spanishphonology	This annotator tags Spanish words their phonemic transcription, based on rules that map orthography to phonology.
nzilbb.annotator.labelmapper	Maps annotations on pairs of layers by finding the minimum edit path between them.
nzilbb.annotator.phonemetranscoder	Translates phonemic transcriptions from one encoding to another.
nzilbb.annotator.basservices	Uses CLARIN-D's BAS web services for various annotation tasks.
nzilbb.annotator.reaper	Annotator that executes Reaper on the transcript's sound file to analyse frequency.
nzilbb.annotator.unisyn	Looks up accent-specific versions of the Unisyn English lexicon to annotate English words with their pronunciations.
nzilbb.annotator.mor	Annotator that annotates words tags from the the Talk Bank MOR morphosyntactic tagger.
nzilbb.annotator.partitioner	Partitions transcripts by a number of tokens or a number of seconds.
nzilbb.annotator.stanfordner	Integration with the Stanford Named Entity Recognizer (NER)