All Classes Interface Summary Class Summary Enum Summary Exception Summary Annotation Types Summary
| Class |
Description |
| AbstractBottomUpParser |
Abstract class which contains code to tag and chunk parses for bottom up parsing and
leaves implementation of advancing parses and completing parses to extend class.
|
| AbstractContextGenerator |
Abstract class containing many of the methods used to generate contexts for parsing.
|
| AbstractDataIndexer |
Abstract class for collecting event and context counts used in training.
|
| AbstractEventModelSequenceTrainer |
|
| AbstractEventStream<T> |
|
| AbstractEventTrainer |
|
| AbstractModel |
|
| AbstractModel.ModelType |
|
| AbstractModelReader |
|
| AbstractModelWriter |
|
| AbstractObjectStream<T> |
|
| AbstractParserEventStream |
Abstract class extended by parser event streams which perform tagging and chunking.
|
| AbstractSampleStreamFactory<T> |
Base class for sample stream factories.
|
| AbstractSequenceTrainer |
|
| AbstractToSentenceSampleStream<T> |
|
| AbstractTrainer |
|
| AdaptiveFeatureGenerator |
An interface for generating features for name entity identification and for
updating document level contexts.
|
| ADChunkSampleStream |
Parser for Floresta Sita(c)tica Arvores Deitadas corpus, output to for the
Portuguese Chunker training.
|
| ADChunkSampleStreamFactory |
A Factory to create a Arvores Deitadas ChunkStream from the command line
utility.
|
| AdditionalContextFeatureGenerator |
|
| ADNameSampleStream |
Parser for Floresta Sita(c)tica Arvores Deitadas corpus, output to for the
Portuguese NER training.
|
| ADNameSampleStreamFactory |
A Factory to create a Arvores Deitadas NameSampleDataStream from the command line
utility.
|
| ADPOSSampleStream |
Note: Do not use this class, internal use only!
|
| ADPOSSampleStreamFactory |
Note: Do not use this class, internal use only!
|
| ADSentenceSampleStream |
Note: Do not use this class, internal use only!
|
| ADSentenceSampleStreamFactory |
Note: Do not use this class, internal use only!
|
| ADSentenceStream |
Stream filter which merges text lines into sentences, following the Arvores
Deitadas syntax.
|
| ADSentenceStream.Sentence |
|
| ADSentenceStream.SentenceParser |
Parses a sample of AD corpus.
|
| ADTokenSampleStreamFactory |
Note: Do not use this class, internal use only!
|
| AggregateCharSequenceNormalizer |
|
| AggregatedFeatureGenerator |
|
| AggregatedFeatureGeneratorFactory |
|
| AncoraSpanishHeadRules |
Class for storing the Ancora Spanish head rules associated with parsing.
|
| AncoraSpanishHeadRules.HeadRulesSerializer |
|
| AnnotationConfiguration |
|
| AnnotatorNoteAnnotation |
|
| arabicStemmer |
This class was automatically generated by a Snowball to Java compiler
It implements the stemming algorithm defined by a snowball script.
|
| ArrayMath |
Utility class for simple vector arithmetic.
|
| ArrayMath |
Deprecated. |
| ArtifactProvider |
Provides access to model persisted artifacts.
|
| ArtifactSerializer<T> |
|
| ArtifactToSerializerMapper |
Deprecated. |
| AttachContextGenerator |
|
| AttributeAnnotation |
|
| Attributes |
|
| BagOfWordsFeatureGenerator |
Generates a feature for each word in a document.
|
| BaseLink |
Stores a minimal tuple of information.
|
| BaseModel |
This model is a common based which can be used by the components
model classes.
|
| BaseToolFactory |
Base class for all tool factories.
|
| BasicContextGenerator |
Generate contexts for maxent decisions, assuming that the input
given to the getContext() method is a String containing contextual
predicates separated by spaces.
|
| BasicFormatParams |
Common format parameters.
|
| BasicTrainingParams |
Common training parameters.
|
| BeamSearch<T> |
Performs k-best search over sequence.
|
| BeamSearchContextGenerator<T> |
Interface for context generators used with a sequence beam search.
|
| BigramNameFeatureGenerator |
|
| BigramNameFeatureGeneratorFactory |
|
| BilouCodec |
|
| BilouNameFinderSequenceValidator |
|
| BinaryFileDataReader |
|
| BinaryGISModelReader |
A reader for GIS models stored in binary format.
|
| BinaryGISModelWriter |
Model writer that saves models in binary format.
|
| BinaryNaiveBayesModelReader |
|
| BinaryNaiveBayesModelWriter |
Model writer that saves models in binary format.
|
| BinaryPerceptronModelReader |
|
| BinaryPerceptronModelWriter |
Model writer that saves models in binary format.
|
| BinaryQNModelReader |
A reader for quasi-newton models stored in binary format.
|
| BinaryQNModelWriter |
|
| BioCodec |
|
| BioNLP2004NameSampleStream |
Parser for the training files of the BioNLP/NLPBA 2004 shared task.
|
| BioNLP2004NameSampleStreamFactory |
|
| BratAnnotation |
|
| BratAnnotationStream |
Reads the annotations from the brat .ann annotation file.
|
| BratDocument |
|
| BratDocumentParser |
|
| BratDocumentStream |
|
| BratNameSampleStream |
Generates Name Sample objects for a Brat Document object.
|
| BratNameSampleStreamFactory |
|
| BrownBigramFeatureGenerator |
Generates Brown cluster features for token bigrams.
|
| BrownCluster |
Class to load a Brown cluster document: word\tword_class\tprob
http://metaoptimize.com/projects/wordreprs/
The file containing the clustering lexicon has to be passed as the
value of the dict attribute of each BrownCluster feature generator.
|
| BrownCluster.BrownClusterSerializer |
|
| BrownClusterBigramFeatureGeneratorFactory |
Generates Brown clustering features for token bigrams.
|
| BrownClusterTokenClassFeatureGeneratorFactory |
Generates Brown clustering features for token classes.
|
| BrownClusterTokenFeatureGeneratorFactory |
Generates Brown clustering features for current token.
|
| BrownTokenClasses |
Obtain the paths listed in the pathLengths array from the Brown class.
|
| BrownTokenClassFeatureGenerator |
Generates Brown cluster features for current token and token class.
|
| BrownTokenFeatureGenerator |
Generates Brown cluster features for current token.
|
| BuildContextGenerator |
Class to generator predictive contexts for deciding how constituents should be combined together.
|
| BuildContextGenerator |
Creates the features or contexts for the building phase of parsing.
|
| BuildModelUpdaterTool |
|
| ByteArraySerializer |
|
| Cache<K,V> |
Provides fixed size, pre-allocated, least recently used replacement cache.
|
| CachedFeatureGenerator |
|
| CachedFeatureGeneratorFactory |
|
| catalanStemmer |
This class was automatically generated by a Snowball to Java compiler
It implements the stemming algorithm defined by a snowball script.
|
| CensusDictionaryCreatorTool |
This tool helps create a loadable dictionary for the NameFinder,
from data collected from US Census data.
|
| CharacterNgramFeatureGenerator |
|
| CharacterNgramFeatureGeneratorFactory |
|
| CharSequenceNormalizer |
A char sequence normalizer, used to adjusting (prune, substitute, add, etc.)
characters in order to remove noise from text
|
| CheckContextGenerator |
Class for generating predictive context for deciding when a constituent is complete.
|
| CheckContextGenerator |
|
| CheckModelUpdaterTool |
|
| ChunkContextGenerator |
Creates predivtive context for the pre-chunking phases of parsing.
|
| Chunker |
The interface for chunkers which provide chunk tags for a sequence of tokens.
|
| ChunkerContextGenerator |
Interface for the context generator used in syntactic chunking.
|
| ChunkerConverterTool |
Tool to convert multiple data formats into native OpenNLP chunker training
format.
|
| ChunkerCrossValidator |
|
| ChunkerCrossValidatorTool |
|
| ChunkerDetailedFMeasureListener |
|
| ChunkerEvaluationMonitor |
|
| ChunkerEvaluator |
|
| ChunkerEvaluatorTool |
|
| ChunkerEventStream |
Class for creating an event stream out of data files for training a chunker.
|
| ChunkerFactory |
|
| ChunkerME |
The class represents a maximum-entropy-based chunker.
|
| ChunkerMETool |
|
| ChunkerModel |
|
| ChunkerModelLoader |
Loads a Chunker Model for the command line tools.
|
| ChunkerModelSerializer |
|
| ChunkerSampleStreamFactory |
|
| ChunkerTrainerTool |
|
| ChunkEvaluationErrorListener |
|
| ChunkSample |
Class for holding chunks for a single unit of text.
|
| ChunkSampleSequenceStream |
|
| ChunkSampleStream |
Parses the conll 2000 shared task shallow parser training data.
|
| ChunkSampleStream |
|
| CollectionObjectStream<E> |
|
| ComparableEvent |
A maxent event representation which we can use to sort based on the
predicates indexes contained in the events.
|
| ComparablePredicate |
A maxent predicate representation which we can use to sort based on the
outcomes.
|
| ConfigurablePOSContextGenerator |
A context generator for the POS Tagger.
|
| Conll02NameSampleStream |
Parser for the dutch and spanish ner training files of the CONLL 2002 shared task.
|
| Conll02NameSampleStream.LANGUAGE |
|
| Conll02NameSampleStreamFactory |
Note: Do not use this class, internal use only!
|
| Conll03NameSampleStream |
An import stream which can parse the CONLL03 data.
|
| Conll03NameSampleStream.LANGUAGE |
|
| Conll03NameSampleStreamFactory |
|
| ConlluLemmaSampleStream |
|
| ConlluLemmaSampleStreamFactory |
Note: Do not use this class, internal use only!
|
| ConlluPOSSampleStream |
|
| ConlluPOSSampleStreamFactory |
Note: Do not use this class, internal use only!
|
| ConlluSentence |
|
| ConlluSentenceSampleStream |
|
| ConlluSentenceSampleStreamFactory |
|
| ConlluStream |
The CoNNL-U Format is specified here:
http://universaldependencies.org/format.html
|
| ConlluTagset |
|
| ConlluTokenSampleStream |
|
| ConlluTokenSampleStreamFactory |
|
| ConlluWordLine |
|
| ConllXPOSSampleStream |
Parses the data from the CONLL 06 shared task into POS Samples.
|
| ConllXPOSSampleStreamFactory |
Note: Do not use this class, internal use only!
|
| ConllXSentenceSampleStreamFactory |
Note: Do not use this class, internal use only!
|
| ConllXTokenSampleStreamFactory |
Note: Do not use this class, internal use only!
|
| Cons |
Class to hold feature information about a specific parse node.
|
| ConstitParseSampleStream |
|
| ConstitParseSampleStreamFactory |
|
| Constituent |
Class used to hold constituents when reading parses.
|
| Context |
Class which associates a real valued parameter or expected value with a particular contextual
predicate or feature.
|
| ContextGenerator<T> |
Generate contexts for maxent decisions.
|
| CrossValidationPartitioner<E> |
Provides access to training and test partitions for n-fold cross validation.
|
| CrossValidationPartitioner.TrainingSampleStream<E> |
The TrainingSampleStream which iterates over
all training elements.
|
| CustomFeatureGenerator |
Deprecated. |
| CVParams |
Common cross validator parameters.
|
| DataIndexer |
Object which compresses events in memory and performs feature selection.
|
| DataIndexerFactory |
|
| DataReader |
|
| DataStream |
A interface for objects which can deliver a stream of training data to be
supplied to an EventStream.
|
| DefaultChunkerContextGenerator |
Features based on chunking model described in Fei Sha and Fernando Pereira.
|
| DefaultChunkerSequenceValidator |
|
| DefaultEndOfSentenceScanner |
|
| DefaultLanguageDetectorContextGenerator |
A context generator for language detector.
|
| DefaultLemmatizerContextGenerator |
Simple feature generator for learning statistical lemmatizers.
|
| DefaultLemmatizerSequenceValidator |
|
| DefaultNameContextGenerator |
Class for determining contextual features for a tag/chunk style
named-entity recognizer.
|
| DefaultPOSContextGenerator |
A context generator for the POS Tagger.
|
| DefaultPOSSequenceValidator |
|
| DefaultSDContextGenerator |
Generate event contexts for maxent decisions for sentence detection.
|
| DefaultTokenContextGenerator |
Generate events for maxent decisions for tokenization.
|
| DefinitionFeatureGeneratorFactory |
|
| DetailedFMeasureEvaluatorParams |
EvaluatorParams for Chunker.
|
| DetokenEvaluationErrorListener |
|
| DetokenizationDictionary |
|
| DetokenizationDictionary.Operation |
|
| Detokenizer |
A Detokenizer merges tokens back to their untokenized representation.
|
| Detokenizer.DetokenizationOperation |
This enum contains an operation for every token to merge the
tokens together to their detokenized form.
|
| DetokenizerEvaluator |
|
| DetokenizerParameter |
|
| DetokenizerSampleStreamFactory<T> |
Base class for factories which need detokenizer.
|
| DetokenizeSentenceSampleStream |
|
| Dictionary |
This class is a dictionary.
|
| DictionaryBuilderTool |
|
| DictionaryDetokenizer |
A rule based detokenizer.
|
| DictionaryDetokenizerTool |
|
| DictionaryEntryPersistor |
This class is used by for reading and writing dictionaries of all kinds.
|
| DictionaryFeatureGenerator |
|
| DictionaryFeatureGeneratorFactory |
|
| DictionaryLemmatizer |
Lemmatize by simple dictionary lookup into a hashmap built from a file
containing, for each line, word\tabpostag\tablemma.
|
| DictionaryNameFinder |
This is a dictionary based name finder, it scans text
for names inside a dictionary.
|
| DictionarySerializer |
|
| DirectorySampleStream |
The directory sample stream allows for creating a stream
from a directory listing of files.
|
| DoccatConverterTool |
|
| DoccatCrossValidator |
Cross validator for document categorization
|
| DoccatCrossValidatorTool |
|
| DoccatEvaluationErrorListener |
|
| DoccatEvaluationMonitor |
|
| DoccatEvaluatorTool |
|
| DoccatFactory |
The factory that provides Doccat default implementations and resources
|
| DoccatFineGrainedReportListener |
Generates a detailed report for the POS Tagger.
|
| DoccatModel |
A model for document categorization
|
| DoccatModelLoader |
Loads a Document Categorizer Model for the command line tools.
|
| DoccatTool |
|
| DoccatTrainerTool |
|
| DocumentBeginFeatureGenerator |
|
| DocumentBeginFeatureGeneratorFactory |
|
| DocumentCategorizer |
Interface for classes which categorize documents.
|
| DocumentCategorizerEvaluator |
|
| DocumentCategorizerEventStream |
Iterator-like class for modeling document classification events.
|
| DocumentCategorizerME |
|
| DocumentNameFinder |
Name finding interface which processes an entire document allowing the name finder to use context
from the entire document.
|
| DocumentSample |
Class which holds a classified document and its category.
|
| DocumentSampleStream |
This class reads in string encoded training samples, parses them and
outputs DocumentSample objects.
|
| DocumentSampleStreamFactory |
|
| DocumentToLineStream |
Reads a plain text file and return each line as a String object.
|
| DownloadUtil |
This class facilitates the downloading of pretrained OpenNLP models.
|
| DownloadUtil.ModelType |
The type of model.
|
| DynamicEvalParameters |
|
| EmojiCharSequenceNormalizer |
Normalizer for emojis.
|
| EmptyLinePreprocessorStream |
Stream to to clean up empty lines for empty line separated document streams.
- Skips empty line at training data start
- Transforms multiple empty lines in a row into one
- Replaces white space lines with empty lines
- TODO: Terminates last document with empty line if it is missing
This stream should be used by the components that mark empty lines to mark document boundaries.
|
| EncodingParameter |
Encoding parameter.
|
| EndOfSentenceScanner |
Scans Strings, StringBuffers, and char[] arrays for the offsets of
sentence ending characters.
|
| EntityLinker<T extends Span> |
EntityLinkers establish connections to external data to enrich extracted
entities.
|
| EntityLinkerFactory |
Generates an EntityLinker implementation via properties file configuration
|
| EntityLinkerProperties |
Properties wrapper for the EntityLinker framework
|
| EntityLinkerTool |
|
| Entry |
|
| EntryInserter |
|
| EvalitaNameSampleStream |
Parser for the Italian NER training files of the Evalita 2007 and 2009 NER shared tasks.
|
| EvalitaNameSampleStream.LANGUAGE |
|
| EvalitaNameSampleStreamFactory |
Note: Do not use this class, internal use only!
|
| EvalParameters |
This class encapsulates the varibales used in producing probabilities from a model
and facilitaes passing these variables to the eval method.
|
| EvaluationMonitor<T> |
|
| Evaluator<T> |
The Evaluator is an abstract base class for evaluators.
|
| EvaluatorParams |
Common evaluation parameters.
|
| Event |
The context of a decision point during training.
|
| EventAnnotation |
|
| EventModelSequenceTrainer |
|
| EventTraceStream |
|
| EventTrainer |
|
| Experimental |
Indicates that the API is not stable.
|
| ExtensionLoader |
The ExtensionLoader is responsible to load extensions to the OpenNLP library.
|
| ExtensionNotLoadedException |
Exception indicates that an OpenNLP extension could not be loaded.
|
| ExtensionServiceKeys |
|
| Factory |
|
| Factory |
|
| FeatureGenerator |
Interface for generating features for document categorization.
|
| FeatureGeneratorResourceProvider |
|
| FeatureGeneratorUtil |
This class provide common utilities for feature generation.
|
| FileEventStream |
Class for using a file of events as an event stream.
|
| FileToByteArraySampleStream |
|
| FileToStringSampleStream |
Provides the ability to read the contents of files
contained in an object stream of files.
|
| FilterObjectStream<S,T> |
|
| FineGrainedEvaluatorParams |
Common evaluation parameters.
|
| FMeasure |
The FMeasure is an utility class for evaluators
which measure precision, recall and the resulting f-measure.
|
| Function |
Interface for a function
|
| GapLabeler |
Interface for labeling nodes which contain traces so that these traces can be predicted
by the parser.
|
| GeneratorFactory |
Creates a set of feature generators based on a provided XML descriptor.
|
| GeneratorFactory.AbstractXmlFeatureGeneratorFactory |
|
| GenericModelReader |
|
| GenericModelSerializer |
|
| GenericModelWriter |
|
| GISModel |
A maximum entropy model which has been trained using the Generalized
Iterative Scaling procedure (implemented in GIS.java).
|
| GISModelReader |
Abstract parent class for readers of GISModels.
|
| GISModelWriter |
Abstract parent class for GISModel writers.
|
| GISTrainer |
An implementation of Generalized Iterative Scaling.
|
| Glove |
Warning: Experimental new feature, see OPENNLP-1144 for details, the API might be changed anytime.
|
| greekStemmer |
This class was automatically generated by a Snowball to Java compiler
It implements the stemming algorithm defined by a snowball script.
|
| HashSumEventStream |
|
| HeadRules |
Interface for encoding the head rules associated with parsing.
|
| HeadRules |
Class for storing the English head rules associated with parsing.
|
| HeadRules.HeadRulesSerializer |
|
| Index |
|
| indonesianStemmer |
This class implements the stemming algorithm defined by a snowball script.
|
| InputStreamFactory |
Allows repeated reads through a stream for certain types of model building.
|
| InSpanGenerator |
Generates features if the tokens are recognized by the provided
TokenNameFinder.
|
| InsufficientTrainingDataException |
This exception indicates that the provided training data is
insufficient to train the desired model.
|
| InvalidFormatException |
This exception indicates that a resource violates the expected data format.
|
| IrishSentenceBankDocument |
A structure to hold an Irish Sentence Bank document, which is a collection
of tokenized sentences.
|
| IrishSentenceBankDocument.IrishSentenceBankFlex |
|
| IrishSentenceBankDocument.IrishSentenceBankSentence |
|
| IrishSentenceBankSentenceStreamFactory |
|
| IrishSentenceBankTokenSampleStreamFactory |
|
| irishStemmer |
This class was automatically generated by a Snowball to Java compiler
It implements the stemming algorithm defined by a snowball script.
|
| Language |
Class for holding the document language and its confidence
|
| LanguageDetector |
The interface for LanguageDetector which provide the @ Language according to the context.
|
| LanguageDetectorConfig |
|
| LanguageDetectorContextGenerator |
A context generator interface for language detector.
|
| LanguageDetectorConverterTool |
|
| LanguageDetectorCrossValidator |
Cross validator for language detector
|
| LanguageDetectorCrossValidatorTool |
|
| LanguageDetectorEvaluationErrorListener |
|
| LanguageDetectorEvaluationMonitor |
|
| LanguageDetectorEvaluator |
|
| LanguageDetectorEvaluatorTool |
|
| LanguageDetectorEventStream |
Iterator-like class for modeling language detector events.
|
| LanguageDetectorFactory |
Default factory used by Language Detector.
|
| LanguageDetectorFineGrainedReportListener |
Generates a detailed report for the POS Tagger.
|
| LanguageDetectorME |
Implements learnable Language Detector
|
| LanguageDetectorModel |
A model for language detection
|
| LanguageDetectorModelLoader |
Loads a Language Detector Model for the command line tools.
|
| LanguageDetectorSampleStream |
This class reads in string encoded training samples, parses them and
outputs LanguageSample objects.
|
| LanguageDetectorSampleStreamFactory |
|
| LanguageDetectorTool |
|
| LanguageDetectorTrainerTool |
|
| LanguageModel |
A language model can calculate the probability p (between 0 and 1) of a
certain sequence of tokens, given its underlying vocabulary.
|
| LanguageParams |
|
| LanguageSample |
Class which holds a classified document and its @ Language.
|
| LanguageSampleStreamFactory<T> |
Stream factory for those streams which carry language.
|
| LeipzigLanguageSampleStream |
|
| LeipzigLanguageSampleStreamFactory |
Note: Do not use this class, internal use only!
|
| LemmaEvaluationErrorListener |
|
| LemmaSample |
Represents an lemmatized sentence.
|
| LemmaSampleEventStream |
Class for creating an event stream out of data files for training a probabilistic lemmatizer.
|
| LemmaSampleSequenceStream |
|
| LemmaSampleStream |
Reads data for training and testing the lemmatizer.
|
| Lemmatizer |
The interface for lemmatizers.
|
| LemmatizerContextGenerator |
Interface for the context generator used for probabilistic lemmatizer.
|
| LemmatizerEvaluationMonitor |
Interface for the lemmatizer evaluator.
|
| LemmatizerEvaluator |
|
| LemmatizerEvaluatorTool |
|
| LemmatizerFactory |
|
| LemmatizerFineGrainedReportListener |
Generates a detailed report for the Lemmatizer.
|
| LemmatizerME |
A probabilistic lemmatizer.
|
| LemmatizerMETool |
|
| LemmatizerModel |
|
| LemmatizerModelLoader |
Loads a Lemmatizer Model for the command line tools.
|
| LemmatizerSampleStreamFactory |
|
| LemmatizerTrainerTool |
|
| LetsmtDocument |
A structure to hold the letsmt document.
|
| LetsmtDocument.LetsmtDocumentHandler |
|
| LetsmtDocument.LetsmtSentence |
|
| LetsmtSentenceStreamFactory |
|
| LineSearch |
Class that performs line search to find minimum
|
| LineSearch.LineSearchResult |
Class to store lineSearch result
|
| LinkedSpan<T extends BaseLink> |
An "default" extended span that holds additional information about the Span
|
| LogProbabilities<T> |
Class implementing the probability distribution over labels returned by
a classifier as a log of probabilities.
|
| LogProbability<T> |
Class implementing the probability for a label.
|
| MarkableFileInputStreamFactory |
A factory that creates MarkableFileInputStream from a File
|
| MascDocument |
|
| MascDocumentStream |
|
| MascNamedEntityParser |
A class to process the MASC Named entity stand-off annotation file
|
| MascNamedEntitySampleStream |
|
| MascNamedEntitySampleStreamFactory |
|
| MascPennTagParser |
A class for parsing MASC's Penn tagging/tokenization stand-off annotation
|
| MascPOSSampleStream |
|
| MascPOSSampleStreamFactory |
|
| MascSentence |
|
| MascSentenceSampleStream |
|
| MascSentenceSampleStreamFactory |
|
| MascToken |
|
| MascTokenSampleStream |
|
| MascTokenSampleStreamFactory |
|
| MascWord |
|
| MaxentModel |
Interface for maximum entropy models.
|
| Mean |
Calculates the arithmetic mean of values
added with the Mean.add(double) method.
|
| ModelType |
|
| ModelUtil |
|
| MosesSentenceSampleStream |
|
| MosesSentenceSampleStreamFactory |
|
| Muc6NameSampleStreamFactory |
|
| MucNameContentHandler |
|
| MucNameSampleStream |
|
| MutableContext |
Class used to store parameters or expected values associated with this context which
can be updated or assigned.
|
| MutableInt |
This is a non-thread safe mutable int.
|
| MutableTagDictionary |
Interface that allows TagDictionary entries to be added and removed.
|
| NaiveBayesEvalParameters |
Parameters for the evalution of a naive bayes classifier
|
| NaiveBayesModel |
Class implementing the multinomial Naive Bayes classifier model.
|
| NaiveBayesModelReader |
Abstract parent class for readers of NaiveBayes.
|
| NaiveBayesModelWriter |
Abstract parent class for NaiveBayes writers.
|
| NaiveBayesTrainer |
Trains models using the combination of EM algorithm and Naive Bayes classifier
which is described in:
Text Classification from Labeled and Unlabeled Documents using EM
Nigam, McCallum, et al paper of 2000
|
| NameContextGenerator |
Interface for generating the context for an name finder by specifying a set of geature generators.
|
| NameEvaluationErrorListener |
|
| NameFinderCensus90NameStream |
This class helps to read the US Census data from the files to build a
StringList for each dictionary entry in the name-finder dictionary.
|
| NameFinderEventStream |
Class for creating an event stream out of data files for training an name
finder.
|
| NameFinderME |
Class for creating a maximum-entropy-based name finder.
|
| NameFinderSequenceValidator |
|
| NameSample |
Class for holding names for a single unit of text.
|
| NameSampleCountersStream |
Counts tokens, sentences and names by type
|
| NameSampleDataStream |
|
| NameSampleDataStreamFactory |
|
| NameSampleDataStreamFactory.Parameters |
|
| NameSampleSequenceStream |
|
| NameSampleTypeFilter |
A stream which removes Name Samples which do not have a certain type.
|
| NameToSentenceSampleStream |
Note: Do not use this class, internal use only!
|
| NameToSentenceSampleStreamFactory |
Note: Do not use this class, internal use only!
|
| NameToTokenSampleStream |
Note: Do not use this class, internal use only!
|
| NameToTokenSampleStreamFactory |
Note: Do not use this class, internal use only!
|
| NegLogLikelihood |
Evaluate negative log-likelihood and its gradient from DataIndexer.
|
| NewlineSentenceDetector |
The Newline Sentence Detector assumes that sentences are line delimited and
recognizes one sentence per non-empty line.
|
| NGramCharModel |
|
| NGramFeatureGenerator |
Generates ngram features for a document.
|
| NGramGenerator |
Generates an nGram, with optional separator, and returns the grams as a list
of strings
|
| NGramLanguageModel |
|
| NGramLanguageModelTool |
|
| NGramModel |
The NGramModel can be used to crate ngrams and character ngrams.
|
| NGramUtils |
Utility class for ngrams.
|
| NKJPSegmentationDocument |
|
| NKJPSegmentationDocument.Pointer |
|
| NKJPSentenceSampleStream |
|
| NKJPSentenceSampleStreamFactory |
|
| NKJPTextDocument |
|
| NumberCharSequenceNormalizer |
Normalizer for numbers
|
| ObjectDataReader |
|
| ObjectStream<T> |
Reads Objects from a stream.
|
| ObjectStreamUtils |
|
| OnePassDataIndexer |
An indexer for maxent model data which handles cutoffs for uncommon
contextual predicates and provides a unique integer index for each of the
predicates.
|
| OnePassRealValueDataIndexer |
An indexer for maxent model data which handles cutoffs for uncommon
contextual predicates and provides a unique integer index for each of the
predicates and maintains event values.
|
| OntoNotesFormatParameters |
|
| OntoNotesNameSampleStream |
Name Sample Stream parser for the OntoNotes 4.0 corpus.
|
| OntoNotesNameSampleStreamFactory |
|
| OntoNotesParseSampleStream |
|
| OntoNotesParseSampleStreamFactory |
|
| OntoNotesPOSSampleStreamFactory |
|
| OSGiExtensionLoader |
OSGi bundle activator which can use an OSGi service as
an OpenNLP extension.
|
| OutcomePriorFeatureGenerator |
The definition feature maps the underlying distribution of outcomes.
|
| ParagraphStream |
Stream filter which merges text lines into paragraphs.
|
| ParallelNegLogLikelihood |
Evaluate negative log-likelihood and its gradient in parallel
|
| Parse |
Data structure for holding parse constituents.
|
| Parser |
Class for a shift reduce style parser based on Adwait Ratnaparkhi's 1998 thesis.
|
| Parser |
Interface for full-syntactic parsers.
|
| Parser |
Built/attach parser.
|
| ParserChunkerFactory |
|
| ParserChunkerSequenceValidator |
|
| ParserConverterTool |
|
| ParserCrossValidator |
|
| ParserEvaluationMonitor |
|
| ParserEvaluator |
Class for ParserEvaluator.
|
| ParserEvaluatorTool |
|
| ParserEventStream |
Wrapper class for one of four parser event streams.
|
| ParserEventStream |
|
| ParserEventTypeEnum |
Enumerated type of event types for the parser.
|
| ParserFactory |
|
| ParserModel |
This is an abstract base class for ParserModel implementations.
|
| ParserModelLoader |
Loads a Parser Model for the command line tools.
|
| ParserTool |
|
| ParserTrainerTool |
|
| ParserType |
|
| ParseSampleStream |
|
| ParseSampleStreamFactory |
|
| ParseSampleStreamFactory.Parameters |
|
| ParseToPOSSampleStream |
Note: Do not use this class, internal use only!
|
| ParseToPOSSampleStreamFactory |
Note: Do not use this class, internal use only!
|
| ParseToSentenceSampleStreamFactory |
Note: Do not use this class, internal use only!
|
| ParseToTokenSampleStreamFactory |
Note: Do not use this class, internal use only!
|
| PerceptronModel |
|
| PerceptronModelReader |
Abstract parent class for readers of Perceptron.
|
| PerceptronModelWriter |
Abstract parent class for Perceptron writers.
|
| PerceptronTrainer |
Trains models using the perceptron algorithm.
|
| PlainTextByLineStream |
Reads a plain text file and return each line as a String object.
|
| PlainTextFileDataReader |
|
| PlainTextNaiveBayesModelReader |
|
| PlainTextNaiveBayesModelWriter |
Model writer that saves models in plain text format.
|
| PorterStemmer |
Stemmer, implementing the Porter Stemming Algorithm
The Stemmer class transforms a word into its root form.
|
| PortugueseContractionUtility |
Utility class to handle Portuguese contractions.
|
| POSContextGenerator |
The interface for a context generator for the POS Tagger.
|
| POSDictionary |
Provides a means of determining which tags are valid for a particular word
based on a tag dictionary read from a file.
|
| POSEvaluationErrorListener |
|
| POSEvaluator |
|
| POSModel |
|
| POSModelLoader |
Loads a POS Tagger Model for the command line tools.
|
| POSModelSerializer |
|
| POSSample |
Represents an pos-tagged sentence.
|
| POSSampleEventStream |
|
| POSSampleSequenceStream |
|
| PosSampleStream |
|
| POSTagger |
The interface for part of speech taggers.
|
| POSTaggerConverterTool |
|
| POSTaggerCrossValidator |
|
| POSTaggerCrossValidatorTool |
|
| POSTaggerEvaluationMonitor |
|
| POSTaggerEvaluatorTool |
|
| POSTaggerFactory |
The factory that provides POS Tagger default implementations and resources
|
| POSTaggerFactory.POSDictionarySerializer |
|
| PosTaggerFeatureGenerator |
|
| PosTaggerFeatureGeneratorFactory |
|
| POSTaggerFineGrainedReportListener |
Generates a detailed report for the POS Tagger.
|
| POSTaggerME |
A part-of-speech tagger that uses maximum entropy.
|
| POSTaggerNameFeatureGenerator |
Adds the token POS Tag as feature.
|
| POSTaggerNameFeatureGeneratorFactory |
|
| POSTaggerTool |
|
| POSTaggerTrainerTool |
|
| POSToSentenceSampleStream |
Note: Do not use this class, internal use only!
|
| POSToSentenceSampleStreamFactory |
Note: Do not use this class, internal use only!
|
| POSToTokenSampleStream |
Note: Do not use this class, internal use only!
|
| POSToTokenSampleStreamFactory |
Note: Do not use this class, internal use only!
|
| PrefixFeatureGenerator |
|
| PrefixFeatureGeneratorFactory |
|
| PreviousMapFeatureGenerator |
|
| PreviousMapFeatureGeneratorFactory |
|
| PreviousTwoMapFeatureGenerator |
|
| Prior |
This interface allows one to implement a prior distribution for use in
maximum entropy model training.
|
| Probabilities<T> |
Class implementing the probability distribution over labels returned by a classifier.
|
| Probability<T> |
Class implementing the probability for a label.
|
| ProbingLanguageDetectionResult |
|
| QNMinimizer |
Implementation of L-BFGS which supports L1-, L2-regularization
and Elastic Net for solving convex optimization problems.
|
| QNMinimizer.Evaluator |
Evaluate quality of training parameters.
|
| QNMinimizer.L2RegFunction |
L2-regularized objective function
|
| QNModel |
|
| QNModelReader |
|
| QNModelWriter |
|
| QNTrainer |
Maxent model trainer using L-BFGS algorithm.
|
| RealBasicEventStream |
|
| RealValueFileEventStream |
|
| RegexNameFinder |
Name finder based on a series of regular expressions.
|
| RegexNameFinderFactory |
Returns a RegexNameFinder based on A selection of
defaults or a configuration and a selection of defaults
|
| RegexNameFinderFactory.DEFAULT_REGEX_NAME_FINDER |
|
| RegexNameFinderFactory.RegexAble |
|
| RelationAnnotation |
|
| ResetableIterator<E> |
This interface makes an Iterator resetable.
|
| ReverseListIterator<T> |
An iterator for a list which returns values in the opposite order as the typical list iterator.
|
| SDContextGenerator |
|
| SDCrossValidator |
A cross validator for the sentence detector.
|
| SDEventStream |
|
| SegmenterObjectStream<S,T> |
|
| SentenceContextGenerator |
Creates contexts/features for end-of-sentence detection in Thai text.
|
| SentenceDetector |
The interface for sentence detectors, which find the sentence boundaries in
a text.
|
| SentenceDetectorConverterTool |
|
| SentenceDetectorCrossValidatorTool |
|
| SentenceDetectorEvaluationMonitor |
|
| SentenceDetectorEvaluator |
|
| SentenceDetectorEvaluatorTool |
|
| SentenceDetectorFactory |
The factory that provides SentenceDetecor default implementations and
resources
|
| SentenceDetectorME |
A sentence detector for splitting up raw text into sentences.
|
| SentenceDetectorTool |
A sentence detector which uses a maxent model to predict the sentences.
|
| SentenceDetectorTrainerTool |
|
| SentenceEvaluationErrorListener |
|
| SentenceFeatureGenerator |
This feature generator creates sentence begin and end features.
|
| SentenceFeatureGeneratorFactory |
|
| SentenceModel |
|
| SentenceSample |
A SentenceSample contains a document with
begin indexes of the individual sentences.
|
| SentenceSampleStream |
This class is a stream filter which reads a sentence by line samples from
a Reader and converts them into SentenceSample objects.
|
| SentenceSampleStreamFactory |
|
| Sequence<T> |
Class which models a sequence.
|
| Sequence |
Represents a weighted sequence of outcomes.
|
| SequenceClassificationModel<T> |
A classification model that can label an input sequence.
|
| SequenceCodec<T> |
|
| SequenceStream |
Interface for streams of sequences used to train sequence models.
|
| SequenceStreamEventStream |
Class which turns a sequence stream into an event stream.
|
| SequenceTrainer |
|
| SequenceValidator<T> |
|
| SerializableArtifact |
|
| SgmlParser |
SAX style SGML parser.
|
| SgmlParser.ContentHandler |
|
| ShrinkCharSequenceNormalizer |
Normalizer to shrink repeated spaces / chars
|
| SimplePerceptronSequenceTrainer |
Trains models for sequences using the perceptron algorithm.
|
| SimpleTokenizer |
Performs tokenization using character classes.
|
| SimpleTokenizerTool |
|
| SnowballStemmer |
|
| SnowballStemmer.ALGORITHM |
|
| Span |
Class for storing start and end integer offsets.
|
| SpanAnnotation |
|
| Stemmer |
The stemmer is reducing a word to its stem.
|
| StringList |
|
| StringPattern |
Recognizes predefined patterns in strings.
|
| StringUtil |
|
| SuffixFeatureGenerator |
|
| SuffixFeatureGeneratorFactory |
|
| TagDictionary |
Interface to determine which tags are valid for a particular word
based on a tag dictionary.
|
| TaggerModelReplacerTool |
|
| TokenClassFeatureGenerator |
Generates features for different for the class of the token.
|
| TokenClassFeatureGeneratorFactory |
|
| TokenContextGenerator |
|
| TokenEvaluationErrorListener |
|
| TokenFeatureGenerator |
Generates a feature which contains the token itself.
|
| TokenFeatureGeneratorFactory |
|
| Tokenizer |
The interface for tokenizers, which segment a string into its tokens.
|
| TokenizerConverterTool |
|
| TokenizerCrossValidator |
|
| TokenizerCrossValidatorTool |
|
| TokenizerEvaluationMonitor |
|
| TokenizerEvaluator |
|
| TokenizerFactory |
The factory that provides Tokenizer default implementations and
resources.
|
| TokenizerME |
A Tokenizer for converting raw text into separated tokens.
|
| TokenizerMEEvaluatorTool |
|
| TokenizerMETool |
|
| TokenizerModel |
|
| TokenizerModelLoader |
Loads a Tokenizer Model for the command line tools.
|
| TokenizerStream |
|
| TokenizerTrainerTool |
|
| TokenNameFinder |
The interface for name finders which provide name tags for a sequence of tokens.
|
| TokenNameFinderConverterTool |
Tool to convert multiple data formats into native OpenNLP name finder training
format.
|
| TokenNameFinderCrossValidator |
|
| TokenNameFinderCrossValidatorTool |
|
| TokenNameFinderDetailedFMeasureListener |
|
| TokenNameFinderEvaluationMonitor |
|
| TokenNameFinderEvaluator |
|
| TokenNameFinderEvaluatorTool |
|
| TokenNameFinderFactory |
|
| TokenNameFinderFineGrainedReportListener |
Generates a detailed report for the NameFinder.
|
| TokenNameFinderModel |
|
| TokenNameFinderModel.FeatureGeneratorCreationError |
|
| TokenNameFinderModelLoader |
Loads a Token Name Finder Model for the command line tools.
|
| TokenNameFinderTool |
|
| TokenNameFinderTrainerTool |
|
| TokenPatternFeatureGenerator |
Partitions tokens into sub-tokens based on character classes and generates
class features for each of the sub-tokens and combinations of those sub-tokens.
|
| TokenPatternFeatureGeneratorFactory |
|
| TokenSample |
|
| TokenSampleStream |
Class which produces an Iterator<TokenSample> from a file of space delimited token.
|
| TokenSampleStream |
This class is a stream filter which reads in string encoded samples and creates
TokenSamples out of them.
|
| TokenSampleStreamFactory |
|
| TokenTag |
|
| TokSpanEventStream |
|
| TrainerFactory |
|
| TrainerFactory.TrainerType |
|
| TrainingParameters |
|
| TrainingToolParams |
Common training parameters.
|
| TrigramNameFeatureGenerator |
Adds trigram features based on tokens and token classes.
|
| TrigramNameFeatureGeneratorFactory |
|
| TwentyNewsgroupSampleStream |
|
| TwentyNewsgroupSampleStreamFactory |
|
| TwitterCharSequenceNormalizer |
Normalizer for Twitter character sequences
|
| TwoPassDataIndexer |
Collecting event and context counts by making two passes over the events.
|
| UncloseableInputStream |
|
| UniformPrior |
Provide a maximum entropy model with a uniform prior.
|
| UrlCharSequenceNormalizer |
Normalizer that removes URls and email addresses.
|
| Version |
The Version class represents the OpenNlp Tools library version.
|
| WhitespaceTokenizer |
This tokenizer uses white spaces to tokenize the input text.
|
| WhitespaceTokenStream |
This stream formats a TokenSamples into whitespace
separated token strings.
|
| WindowFeatureGenerator |
|
| WindowFeatureGeneratorFactory |
|
| WordClusterDictionary |
|
| WordClusterDictionary.WordClusterDictionarySerializer |
|
| WordClusterFeatureGenerator |
|
| WordClusterFeatureGeneratorFactory |
Defines a word cluster generator factory; it reads an element containing
'w2vwordcluster' as a tag name; these clusters are typically produced by
word2vec or clark pos induction systems.
|
| WordpieceTokenizer |
A WordPiece tokenizer.
|
| WordTagSampleStream |
A stream filter which reads a sentence per line which contains
words and tags in word_tag format and outputs a POSSample objects.
|
| WordTagSampleStreamFactory |
Note: Do not use this class, internal use only!
|
| WordTagSampleStreamFactory.Parameters |
|
| WordVector |
A word vector.
|
| WordVectorTable |
A table that maps tokens to word vectors.
|
| WordVectorType |
|
| XmlUtil |
|