nutch 2.2.1 运行报错InjectorJob: java.lang.RuntimeException: job failed: name=inject

问题描述

在windows下的eclipse中运行报错,如下,求各位大神帮帮忙,本人小白一个,急急急急InjectorJob:startingat2014-06-2615:45:36InjectorJob:InjectingurlDir:urlsInjectorJob:Usingclassorg.apache.gora.memory.store.MemStoreastheGorastorageclass.****file:/D:/workspace/nutchTest/urlsInjectorJob:java.lang.RuntimeException:jobfailed:name=injecturls,jobid=job_local_0001atorg.apache.nutch.util.NutchJob.waitForCompletion(NutchJob.java:54)atorg.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:233)atorg.apache.nutch.crawl.InjectorJob.inject(InjectorJob.java:251)atorg.apache.nutch.crawl.InjectorJob.run(InjectorJob.java:273)atorg.apache.hadoop.util.ToolRunner.run(ToolRunner.java:65)atorg.apache.nutch.crawl.InjectorJob.main(InjectorJob.java:282)

解决方案

解决方案二:
看下logs下的hadoop.log日志信息
解决方案三:
下面是日志内容2014-06-3010:07:25,234INFOcrawl.InjectorJob-InjectorJob:Usingclassorg.apache.gora.sql.store.SqlStoreastheGorastorageclass.2014-06-3010:07:25,875WARNutil.NativeCodeLoader-Unabletoloadnative-hadooplibraryforyourplatform...usingbuiltin-javaclasseswhereapplicable2014-06-3010:07:26,046WARNmapred.JobClient-Nojobjarfileset.Userclassesmaynotbefound.SeeJobConf(Class)orJobConf#setJar(String).2014-06-3010:07:26,203WARNsnappy.LoadSnappy-Snappynativelibrarynotloaded2014-06-3010:07:27,078INFOmapreduce.GoraRecordWriter-gora.buffer.write.limit=100002014-06-3010:07:27,375WARNplugin.PluginRepository-java.io.FileNotFoundException:D:workspacenutchTest.srcplugin.svnplugin.xml(系统找不到指定的文件。)2014-06-3010:07:29,968WARNregex.RegexURLNormalizer-Can'tloadthedefaultrules!2014-06-3010:07:30,343INFOregex.RegexURLNormalizer-can'tfindrulesforscope'inject',usingdefault2014-06-3010:07:32,953WARNmapred.FileOutputCommitter-Outputpathisnullincleanup2014-06-3010:07:33,703INFOcrawl.InjectorJob-InjectorJob:totalnumberofurlsrejectedbyfilters:02014-06-3010:07:33,703INFOcrawl.InjectorJob-InjectorJob:totalnumberofurlsinjectedafternormalizationandfiltering:22014-06-3010:07:33,875INFOcrawl.FetchScheduleFactory-UsingFetchScheduleimpl:org.apache.nutch.crawl.DefaultFetchSchedule2014-06-3010:07:33,875INFOcrawl.AbstractFetchSchedule-defaultInterval=25920002014-06-3010:07:33,875INFOcrawl.AbstractFetchSchedule-maxInterval=77760002014-06-3010:07:34,281WARNmapred.JobClient-Nojobjarfileset.Userclassesmaynotbefound.SeeJobConf(Class)orJobConf#setJar(String).2014-06-3010:07:34,703INFOmapreduce.GoraRecordReader-gora.buffer.read.limit=100002014-06-3010:07:35,281WARNregex.RegexURLNormalizer-Can'tloadthedefaultrules!2014-06-3010:07:35,281WARNregex.RegexURLNormalizer-Can'tloadthedefaultrules!2014-06-3010:07:35,281INFOcrawl.FetchScheduleFactory-UsingFetchScheduleimpl:org.apache.nutch.crawl.DefaultFetchSchedule2014-06-3010:07:35,281INFOcrawl.AbstractFetchSchedule-defaultInterval=25920002014-06-3010:07:35,281INFOcrawl.AbstractFetchSchedule-maxInterval=77760002014-06-3010:07:35,359INFOregex.RegexURLNormalizer-can'tfindrulesforscope'generate_host_count',usingdefault2014-06-3010:07:37,703INFOmapreduce.GoraRecordWriter-gora.buffer.write.limit=100002014-06-3010:07:37,750WARNmapred.FileOutputCommitter-Outputpathisnullincleanup2014-06-3010:07:37,750WARNmapred.LocalJobRunner-job_local_0002java.lang.NullPointerExceptionatorg.apache.avro.util.Utf8.<init>(Utf8.java:37)atorg.apache.nutch.crawl.GeneratorReducer.setup(GeneratorReducer.java:100)atorg.apache.hadoop.mapreduce.Reducer.run(Reducer.java:174)atorg.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:649)atorg.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:417)atorg.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:260)
解决方案四:
请我贴住解决了吗?我也出了这个问题
解决方案五:
原因是没有找到插件地址,将nutch-site.xml中的<property><name>plugin.folders</name><value>src/plugin</value></property>改为:<property><name>plugin.folders</name><value>plugins</value></property>
解决方案六:
给个地址吧:http://my.oschina.net/DLow/blog/294951,上面这个说法有问题,第一种写法是在IDE中,第二种是用命令行方式,

时间: 2016-08-15

nutch 2.2.1 运行报错InjectorJob: java.lang.RuntimeException: job failed: name=inject的相关文章

集成EaseUI后运行报错:java.lang.NoClassDefFoundError: android.support.v7.appcompat.R$layout

问题描述 具体报错信息06-23 09:26:58.154 7443-7443/com.*****.***** E/AndroidRuntime: FATAL EXCEPTION: main                                                                            Process: com.*****.*****, PID: 7443                                           

我的Android进阶之旅------&amp;gt;Android中MediaRecorder.stop()报错 java.lang.RuntimeException: stop failed.

今天在调用MediaRecorder.stop(),报错了,java.lang.RuntimeException: stop failed. E/AndroidRuntime(7698): Cause by: java.lang.RuntimeException: stop failed. E/AndroidRuntime(7698): at android.media.MediaRecorder.stop(Native Method) E/AndroidRuntime(7698): at co

JUnit报错:java.lang.ClassNotFoundException: com.mogodb.test.test

最近在使用JUnit做单元测试时,发现新写好的测试类运行总是出错,报找不到类异常. Class not found com.mogodb.test.test java.lang.ClassNotFoundException: com.mogodb.test.test at java.net.URLClassLoader$1.run(URLClassLoader.java:202) at java.security.AccessController.doPrivileged(Native Meth

jni-JAVA MVC项目报错:java.lang.NoClassDefFoundError:

问题描述 JAVA MVC项目报错:java.lang.NoClassDefFoundError: 错误信息如下: java.lang.NoClassDefFoundError: Could not initialize class SketchYourHouse.MediumInCJSketchYourHouse.SketchBeans.savePic(SketchBeans.java:56)SketchYourHouse.SketchServlet.doPost(SketchServlet.

Tomcat启动报错:java.lang.IllegalArgumentException: Can&amp;#39;t convert argument:null

一.发现问题为了避免导入的项目重名,我先修改了前一个项目的名称. 重新启动该项目至tomcat,报错:java.lang.IllegalArgumentException: Can't convert argument: null   二.发现问题因为MyEclipse在修改项目名时候,自动更新部署了web.xml文件 并且重新生成了xml文件的头部声明. 新增了java的命名把这个javaee去掉就可以了,一般情况下他会修改servlet的配置. <filter>      <filt

Struts2配置报错:java.lang.ClassNotFoundException

问题描述 Struts2配置报错:java.lang.ClassNotFoundException 严重: Exception starting filter struts2 java.lang.ClassNotFoundException: org.apache.struts2.dispatcher.ng.filter .StrutsPrepareAndExecuteFilter at org.apache.catalina.loader.WebappClassLoader.loadClass

【JUnit 报错】java.lang.NoClassDefFoundError: org/apache/logging/log4j/message/Message

  使用JUnit的时候,报错:java.lang.NoClassDefFoundError: org/apache/logging/log4j/message/Message 原因是因为项目中导入的架包有冲突: 只留下中间的log4j-1.2.17.jar,上下的两个都不要了. 然后 好了  

nutch Exception in thread &amp;amp;quot;main&amp;amp;quot; java.lang.RuntimeException: job failed: name=ge

问题描述 [email protected]:~/apache-nutch-2.2.1/runtime/local$bin/nutchcrawlurls/seed.txt-dircrawl-depth3-topN5InjectorJob:Usingclassorg.apache.gora.memory.store.MemStoreastheGorastorageclass.InjectorJob:totalnumberofurlsrejectedbyfilters:0Inje

Ensemble.Tofino运行报错Unexpected java bridge exception的解决

作为Visual Studio插件的Flex开发工具,Ensemble公司的Tofino确实很吸引人,包含编译环境的安装文件总共才122M,比起Flex Builder要不少.而且,对于熟悉了VS开发环境的.Net程序员,用这样的插件来做Flex开发方便了很多. 从官网下载安装文件,选择122M的包含开发编译环境的安装包下载.关闭Visual Studio,安装成功后开启Visual Studio,正常情况下Flex项目会出现在新建项目的页面上.新建一个Flex工程,如果你机器的java环境配置