Package org.apache.mahout.vectorizer.encoders

Class Summary
AdaptiveWordValueEncoder Encodes words into vectors much as does WordValueEncoder while maintaining an adaptive dictionary of values seen so far.
CachingContinuousValueEncoder  
CachingStaticWordValueEncoder  
CachingTextValueEncoder  
CachingValueEncoder Provides basic hashing semantics for encoders where the probe locations depend only on the name of the variable.
ConstantValueEncoder An encoder that does the standard thing for a virtual bias term.
ContinuousValueEncoder Continuous values are stored in fixed randomized location in the feature vector.
Dictionary Assigns integer codes to strings as they appear.
FeatureVectorEncoder General interface for objects that record features into a feature vector.
InteractionValueEncoder  
LuceneTextValueEncoder Encodes text using a lucene style tokenizer.
StaticWordValueEncoder Encodes a categorical values with an unbounded vocabulary.
TextValueEncoder Encodes text that is tokenized on non-alphanum separators.
WordValueEncoder Encodes words as sparse vector updates to a Vector.
 



Copyright © 2008–2014 The Apache Software Foundation. All rights reserved.