About nzilbb.annotator
Annotators are modules that perform specialized automatic annotation tasks on annotation graphs.
Project Modules
This project has declared the following modules:
| Name | Description |
|---|---|
| nzilbb.annotator.orthography | This anotator creates 'cleaned up' word labels - e.g. downcase, with extraneous punctuation removed - which should be optimal for looking up lexicons, frequency computation, etc. |
| nzilbb.annotator.cmudict | Annotator that tags words with their pronunciations according to the CMU Pronouncing Dictionary. |
| nzilbb.annotator.flatlexicon | Looks up lexicons derived from a flat file (e.g. in CSV format) |
| nzilbb.annotator.javascript | The Javascript Annotator allows annotations to be generated by executing a program or expression written in JavaScript. |
| nzilbb.annotator.patterntagger | The Pattern Tagger generates new annotations by matching a list of regular expressions against annotations on a selected layer; the first pattern that matches is used to tag the annotation. |
| nzilbb.annotator.porterstemmer | The Porter Stemmer annotator uses the Porter Algorithm to compute the stems of English words from their orthography. |
| nzilbb.annotator.jython | The Jython Layer Manager allows annotations to be generated by executing a script written in Python. |
| nzilbb.annotator.spanishphonology | This annotator tags Spanish words their phonemic transcription, based on rules that map orthography to phonology. |
| nzilbb.annotator.labelmapper | Maps annotations on pairs of layers by finding the minimum edit path between them. |
| nzilbb.annotator.phonemetranscoder | Translates phonemic transcriptions from one encoding to another. |
| nzilbb.annotator.basservices | Uses CLARIN-D's BAS web services for various annotation tasks. |
| nzilbb.annotator.reaper | Annotator that executes Reaper on the transcript's sound file to analyse frequency. |
| nzilbb.annotator.unisyn | Looks up accent-specific versions of the Unisyn English lexicon to annotate English words with their pronunciations. |
| nzilbb.annotator.mor | Annotator that annotates words tags from the the Talk Bank MOR morphosyntactic tagger. |
| nzilbb.annotator.partitioner | Partitions transcripts by a number of tokens or a number of seconds. |
| nzilbb.annotator.stanfordner | Integration with the Stanford Named Entity Recognizer (NER) |
nzilbb.annotator