|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectorg.apache.mahout.text.LuceneStorageConfiguration
public class LuceneStorageConfiguration
Holds all the configuration for SequenceFilesFromLuceneStorage
, which generates a sequence file
with id as the key and a content field as value.
Constructor Summary | |
---|---|
LuceneStorageConfiguration()
|
|
LuceneStorageConfiguration(org.apache.hadoop.conf.Configuration conf)
Deserializes a LuceneStorageConfiguration from a Configuration . |
|
LuceneStorageConfiguration(org.apache.hadoop.conf.Configuration configuration,
List<org.apache.hadoop.fs.Path> indexPaths,
org.apache.hadoop.fs.Path sequenceFilesOutputPath,
String idField,
List<String> fields)
Create a configuration bean with all mandatory parameters. |
Method Summary | |
---|---|
boolean |
equals(Object o)
|
org.apache.hadoop.conf.Configuration |
getConfiguration()
|
List<String> |
getFields()
|
String |
getIdField()
|
List<org.apache.hadoop.fs.Path> |
getIndexPaths()
|
int |
getMaxHits()
|
org.apache.lucene.search.Query |
getQuery()
|
Iterator<Pair<org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>> |
getSequenceFileIterator()
Returns an Iterator which returns (Text, Text) Pair s of the produced sequence files. |
org.apache.hadoop.fs.Path |
getSequenceFilesOutputPath()
|
org.apache.lucene.document.DocumentStoredFieldVisitor |
getStoredFieldVisitor()
|
int |
hashCode()
|
void |
readFields(DataInput in)
|
org.apache.hadoop.conf.Configuration |
serialize()
Serializes this object in a Hadoop Configuration |
void |
setMaxHits(int maxHits)
|
void |
setQuery(org.apache.lucene.search.Query query)
|
void |
write(DataOutput out)
|
Methods inherited from class java.lang.Object |
---|
clone, finalize, getClass, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public LuceneStorageConfiguration(org.apache.hadoop.conf.Configuration configuration, List<org.apache.hadoop.fs.Path> indexPaths, org.apache.hadoop.fs.Path sequenceFilesOutputPath, String idField, List<String> fields)
configuration
- Hadoop configuration for writing sequencefilesindexPaths
- paths to the indexsequenceFilesOutputPath
- path to output the sequence fileidField
- field used for the key of the sequence filefields
- field(s) used for the value of the sequence filepublic LuceneStorageConfiguration()
public LuceneStorageConfiguration(org.apache.hadoop.conf.Configuration conf) throws IOException
LuceneStorageConfiguration
from a Configuration
.
conf
- the Configuration
object with a serialized LuceneStorageConfiguration
IOException
- if deserialization failsMethod Detail |
---|
public org.apache.hadoop.conf.Configuration serialize() throws IOException
Configuration
Configuration
object with a String serialization
IOException
- if serialization failspublic Iterator<Pair<org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>> getSequenceFileIterator()
Iterator
which returns (Text, Text) Pair
s of the produced sequence files.
public org.apache.hadoop.conf.Configuration getConfiguration()
public org.apache.hadoop.fs.Path getSequenceFilesOutputPath()
public List<org.apache.hadoop.fs.Path> getIndexPaths()
public String getIdField()
public List<String> getFields()
public void setQuery(org.apache.lucene.search.Query query)
public org.apache.lucene.search.Query getQuery()
public void setMaxHits(int maxHits)
public int getMaxHits()
public org.apache.lucene.document.DocumentStoredFieldVisitor getStoredFieldVisitor()
public void write(DataOutput out) throws IOException
write
in interface org.apache.hadoop.io.Writable
IOException
public void readFields(DataInput in) throws IOException
readFields
in interface org.apache.hadoop.io.Writable
IOException
public boolean equals(Object o)
equals
in class Object
public int hashCode()
hashCode
in class Object
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |