Save This Page
Home » nutch-1.0 » org.apache.nutch » crawl » [javadoc | source]
org.apache.nutch.crawl
public static class: Generator.Selector [javadoc | source]
java.lang.Object
   org.apache.nutch.crawl.Generator$Selector

All Implemented Interfaces:
    org.apache.hadoop.mapred.Mapper, org.apache.hadoop.mapred.Reducer, Partitioner

Selects entries due for fetch.
Method from org.apache.nutch.crawl.Generator$Selector Summary:
close,   configure,   getPartition,   map,   reduce
Methods from java.lang.Object:
equals,   getClass,   hashCode,   notify,   notifyAll,   toString,   wait,   wait,   wait
Method from org.apache.nutch.crawl.Generator$Selector Detail:
 public  void close() 
 public  void configure(JobConf job) 
 public int getPartition(FloatWritable key,
    Writable value,
    int numReduceTasks) 
    Partition by host.
 public  void map(Text key,
    CrawlDatum value,
    OutputCollector output,
    Reporter reporter) throws IOException 
    Select & invert subset due for fetch.
 public  void reduce(FloatWritable key,
    Iterator values,
    OutputCollector output,
    Reporter reporter) throws IOException 
    Collect until limit is reached.