Package org.apache.mahout.vectorizer.collocations.llr

Interface Summary
LLRReducer.LLCallback provide interface so the input to the llr calculation can be captured for validation in unit testing
 

Class Summary
CollocCombiner Combiner for pass1 of the CollocationDriver.
CollocDriver Driver for LLR Collocation discovery mapreduce job
CollocMapper Pass 1 of the Collocation discovery job which generated ngrams and emits ngrams an their component n-1grams.
CollocReducer Reducer for Pass 1 of the collocation identification job.
Gram Writable for holding data generated from the collocation discovery jobs.
GramKey A GramKey, based on the identity fields of Gram (type, string) plus a byte[] used for secondary ordering
GramKeyPartitioner Partition GramKeys based on their Gram, ignoring the secondary sort key so that all GramKeys with the same gram are sent to the same partition.
LLRReducer Reducer for pass 2 of the collocation discovery job.
LLRReducer.ConcreteLLCallback concrete implementation delegates to LogLikelihood class
 

Enum Summary
CollocMapper.Count  
CollocReducer.Skipped  
Gram.Type  
LLRReducer.Skipped Counter to track why a particlar entry was skipped
 



Copyright © 2008–2014 The Apache Software Foundation. All rights reserved.