You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by webdev1977 <we...@gmail.com> on 2010/09/23 19:44:30 UTC

Stack Trace from Crawling filesystem - OutOfMemoryError: PermGen Space

I would appreciate any help anyone could lend.  

A very deep crawl of a file system using release canidate 1.2 #4   produces
an OutOfMemory error after about two hours of running.  I am parsing
html/text/tika/pdf/zip.  

Any ideas?

FetcherThread" daemon prio=6 tid=0x0559dc00 nid=0xd38 runnable [0x042be000]
  java.lang.Thread.State: RUNNABLE
       at sun.misc.Unsafe.defineClass(Native Method)
       at sun.reflect.ClassDefiner.defineClass(ClassDefiner.java:45)
       at
sun.reflect.MethodAccessorGenerator$1.run(MethodAccessorGenerator.java:381)
       at java.security.AccessController.doPrivileged(Native Method)
       at
sun.reflect.MethodAccessorGenerator.generate(MethodAccessorGenerator.java:377)
       at
sun.reflect.MethodAccessorGenerator.generateConstructor(MethodAccessorGenerator.java:76)
       at
sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:30)
       at
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:2
       at java.lang.reflect.Constructor.newInstance(Constructor.java:513)
       at java.lang.Class.newInstance0(Class.java:355)
       at java.lang.Class.newInstance(Class.java:308)
       at
java.util.ResourceBundle$Control.newBundle(ResourceBundle.java:2295)
       at java.util.ResourceBundle.loadBundle(ResourceBundle.java:1364)
       at java.util.ResourceBundle.findBundle(ResourceBundle.java:1328)
       at java.util.ResourceBundle.findBundle(ResourceBundle.java:1282)
       at java.util.ResourceBundle.getBundleImpl(ResourceBundle.java:1224)
       at java.util.ResourceBundle.getBundle(ResourceBundle.java:822)
       at sun.util.resources.LocaleData$1.run(LocaleData.java:127)
       at java.security.AccessController.doPrivileged(Native Method)
       at sun.util.resources.LocaleData.getBundle(LocaleData.java:125)
       at sun.util.resources.LocaleData.getTimeZoneNames(LocaleData.java:97)
       at
sun.util.TimeZoneNameUtility.getBundle(TimeZoneNameUtility.java:115)
       at
sun.util.TimeZoneNameUtility.retrieveDisplayNames(TimeZoneNameUtility.java:80)
       at java.util.TimeZone.getDisplayNames(TimeZone.java:399)
       at java.util.TimeZone.getDisplayName(TimeZone.java:350)
       at java.text.SimpleDateFormat.subFormat(SimpleDateFormat.java:1110)
       at java.text.SimpleDateFormat.format(SimpleDateFormat.java:899)
       at java.text.SimpleDateFormat.format(SimpleDateFormat.java:869)
       at java.text.DateFormat.format(DateFormat.java:316)
       at
org.apache.nutch.net.protocols.HttpDateFormat.toString(HttpDateFormat.java:92)
       - locked <0x24ef2e88> (a java.text.SimpleDateFormat)
       at
org.apache.nutch.protocol.file.FileResponse.getFileAsHttpResponse(FileResponse.java:209)
       at
org.apache.nutch.protocol.file.FileResponse.<init>(FileResponse.java:160)
       at
org.apache.nutch.protocol.file.File.getProtocolOutput(File.java:77)
       at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:628)
-- 
View this message in context: http://lucene.472066.n3.nabble.com/Stack-Trace-from-Crawling-filesystem-OutOfMemoryError-PermGen-Space-tp1569423p1569423.html
Sent from the Nutch - User mailing list archive at Nabble.com.