You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@nutch.apache.org by webdev1977 <we...@gmail.com> on 2010/09/23 19:44:30 UTC
Stack Trace from Crawling filesystem - OutOfMemoryError: PermGen
Space
I would appreciate any help anyone could lend.
A very deep crawl of a file system using release canidate 1.2 #4 produces
an OutOfMemory error after about two hours of running. I am parsing
html/text/tika/pdf/zip.
Any ideas?
FetcherThread" daemon prio=6 tid=0x0559dc00 nid=0xd38 runnable [0x042be000]
java.lang.Thread.State: RUNNABLE
at sun.misc.Unsafe.defineClass(Native Method)
at sun.reflect.ClassDefiner.defineClass(ClassDefiner.java:45)
at
sun.reflect.MethodAccessorGenerator$1.run(MethodAccessorGenerator.java:381)
at java.security.AccessController.doPrivileged(Native Method)
at
sun.reflect.MethodAccessorGenerator.generate(MethodAccessorGenerator.java:377)
at
sun.reflect.MethodAccessorGenerator.generateConstructor(MethodAccessorGenerator.java:76)
at
sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:30)
at
sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:2
at java.lang.reflect.Constructor.newInstance(Constructor.java:513)
at java.lang.Class.newInstance0(Class.java:355)
at java.lang.Class.newInstance(Class.java:308)
at
java.util.ResourceBundle$Control.newBundle(ResourceBundle.java:2295)
at java.util.ResourceBundle.loadBundle(ResourceBundle.java:1364)
at java.util.ResourceBundle.findBundle(ResourceBundle.java:1328)
at java.util.ResourceBundle.findBundle(ResourceBundle.java:1282)
at java.util.ResourceBundle.getBundleImpl(ResourceBundle.java:1224)
at java.util.ResourceBundle.getBundle(ResourceBundle.java:822)
at sun.util.resources.LocaleData$1.run(LocaleData.java:127)
at java.security.AccessController.doPrivileged(Native Method)
at sun.util.resources.LocaleData.getBundle(LocaleData.java:125)
at sun.util.resources.LocaleData.getTimeZoneNames(LocaleData.java:97)
at
sun.util.TimeZoneNameUtility.getBundle(TimeZoneNameUtility.java:115)
at
sun.util.TimeZoneNameUtility.retrieveDisplayNames(TimeZoneNameUtility.java:80)
at java.util.TimeZone.getDisplayNames(TimeZone.java:399)
at java.util.TimeZone.getDisplayName(TimeZone.java:350)
at java.text.SimpleDateFormat.subFormat(SimpleDateFormat.java:1110)
at java.text.SimpleDateFormat.format(SimpleDateFormat.java:899)
at java.text.SimpleDateFormat.format(SimpleDateFormat.java:869)
at java.text.DateFormat.format(DateFormat.java:316)
at
org.apache.nutch.net.protocols.HttpDateFormat.toString(HttpDateFormat.java:92)
- locked <0x24ef2e88> (a java.text.SimpleDateFormat)
at
org.apache.nutch.protocol.file.FileResponse.getFileAsHttpResponse(FileResponse.java:209)
at
org.apache.nutch.protocol.file.FileResponse.<init>(FileResponse.java:160)
at
org.apache.nutch.protocol.file.File.getProtocolOutput(File.java:77)
at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:628)
--
View this message in context: http://lucene.472066.n3.nabble.com/Stack-Trace-from-Crawling-filesystem-OutOfMemoryError-PermGen-Space-tp1569423p1569423.html
Sent from the Nutch - User mailing list archive at Nabble.com.