Save This Page
Home » nutch-1.0 » org.apache.nutch » crawl » [javadoc | source]
org.apache.nutch.crawl
public class: LinkDb [javadoc | source]
java.lang.Object
   org.apache.hadoop.conf.Configured
      org.apache.nutch.crawl.LinkDb

All Implemented Interfaces:
    org.apache.hadoop.mapred.Mapper, org.apache.hadoop.util.Tool

Maintains an inverted link map, listing incoming links for each url.
Field Summary
public static final  Log LOG     
public static final  String CURRENT_NAME     
public static final  String LOCK_NAME     
Constructor:
 public LinkDb() 
 public LinkDb(Configuration conf) 
Method from org.apache.nutch.crawl.LinkDb Summary:
close,   configure,   install,   invert,   invert,   main,   map,   run
Methods from java.lang.Object:
equals,   getClass,   hashCode,   notify,   notifyAll,   toString,   wait,   wait,   wait
Method from org.apache.nutch.crawl.LinkDb Detail:
 public  void close() 
 public  void configure(JobConf job) 
 public static  void install(JobConf job,
    Path linkDb) throws IOException 
 public  void invert(Path linkDb,
    Path segmentsDir,
    boolean normalize,
    boolean filter,
    boolean force) throws IOException 
 public  void invert(Path linkDb,
    Path[] segments,
    boolean normalize,
    boolean filter,
    boolean force) throws IOException 
 public static  void main(String[] args) throws Exception 
 public  void map(Text key,
    ParseData parseData,
    OutputCollector output,
    Reporter reporter) throws IOException 
 public int run(String[] args) throws Exception