Save This Page
Home » nutch-1.0 » org.apache.nutch » crawl » [javadoc | source]
org.apache.nutch.crawl
public class: Injector [javadoc | source]
java.lang.Object
   org.apache.hadoop.conf.Configured
      org.apache.nutch.crawl.Injector

All Implemented Interfaces:
    org.apache.hadoop.util.Tool

This class takes a flat file of URLs and adds them to the of pages to be crawled. Useful for bootstrapping the system.
Nested Class Summary:
public static class  Injector.InjectMapper  Normalize and filter injected urls. 
public static class  Injector.InjectReducer  Combine multiple new entries for a url. 
Field Summary
public static final  Log LOG     
Constructor:
 public Injector() 
 public Injector(Configuration conf) 
Method from org.apache.nutch.crawl.Injector Summary:
inject,   main,   run
Methods from java.lang.Object:
equals,   getClass,   hashCode,   notify,   notifyAll,   toString,   wait,   wait,   wait
Method from org.apache.nutch.crawl.Injector Detail:
 public  void inject(Path crawlDb,
    Path urlDir) throws IOException 
 public static  void main(String[] args) throws Exception 
 public int run(String[] args) throws Exception