org.apache.mahout.cf.taste.hadoop.als
Class ParallelALSFactorizationJob

java.lang.Object
  extended by org.apache.hadoop.conf.Configured
      extended by org.apache.mahout.common.AbstractJob
          extended by org.apache.mahout.cf.taste.hadoop.als.ParallelALSFactorizationJob
All Implemented Interfaces:
org.apache.hadoop.conf.Configurable, org.apache.hadoop.util.Tool

public class ParallelALSFactorizationJob
extends AbstractJob

MapReduce implementation of the two factorization algorithms described in

"Large-scale Parallel Collaborative Filtering for the Netflix Prize" available at http://www.hpl.hp.com/personal/Robert_Schreiber/papers/2008%20AAIM%20Netflix/netflix_aaim08(submitted).pdf.

"

Collaborative Filtering for Implicit Feedback Datasets" available at http://research.yahoo.com/pub/2433

Command line arguments specific to this class are:

  1. --input (path): Directory containing one or more text files with the dataset
  2. --output (path): path where output should go
  3. --lambda (double): regularization parameter to avoid overfitting
  4. --userFeatures (path): path to the user feature matrix
  5. --itemFeatures (path): path to the item feature matrix
  6. --numThreadsPerSolver (int): threads to use per solver mapper, (default: 1)


Field Summary
 
Fields inherited from class org.apache.mahout.common.AbstractJob
argMap, inputFile, inputPath, outputFile, outputPath, tempPath
 
Constructor Summary
ParallelALSFactorizationJob()
           
 
Method Summary
static void main(String[] args)
           
 int run(String[] args)
           
 
Methods inherited from class org.apache.mahout.common.AbstractJob
addFlag, addInputOption, addOption, addOption, addOption, addOption, addOutputOption, buildOption, buildOption, getAnalyzerClassFromOption, getCLIOption, getConf, getDimensions, getFloat, getFloat, getGroup, getInputFile, getInputPath, getInt, getInt, getOption, getOption, getOption, getOptions, getOutputFile, getOutputPath, getOutputPath, getTempPath, getTempPath, hasOption, keyFor, maybePut, parseArguments, parseArguments, parseDirectories, prepareJob, prepareJob, prepareJob, prepareJob, setConf, setS3SafeCombinedInputPath, shouldRunNextPhase
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

ParallelALSFactorizationJob

public ParallelALSFactorizationJob()
Method Detail

main

public static void main(String[] args)
                 throws Exception
Throws:
Exception

run

public int run(String[] args)
        throws Exception
Throws:
Exception


Copyright © 2008–2014 The Apache Software Foundation. All rights reserved.