You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@nutch.apache.org by Earl Cahill <ca...@yahoo.com> on 2005/10/07 04:39:05 UTC
dtrace and nutch
Just about to switch a box over to solaris 10, in part
so I can try and help out with nutch profiling via
dtrace. Wondering if anyone has tried it.
In my limited experience, dtrace kicks. Some info is
here
http://www.sun.com/bigadmin/content/dtrace/
I think it could really aid in profiling a running
crawl. Like all the emails that say, several hours in
my fetch slows down, I am hoping that dtrace could
work wonders.
A brief overview. Dtrace is a profiler like strace or
truss. Rather unobstrusively dtrace can attach to a
process and from kernel space to user space, report on
what is going on. Dtrace has a langague, called d,
which allows you to hook into the profile. Rumor has
it that java is very well profiled, and I think it
will give you method names and the like. You can hook
in and say, when this method gets called, increment a
counter, track nanoseconds and report every two
seconds at what gets called the most and what takes
the most time from user space to kernel space.
I think it would be nice to pick a few key methods (or
all of them) and write some d that would track what is
going on in those methods during a crawl.
I saw a notice where sun engineers had a booth and
offered a free ipod (I think) if someone had an
application that they couldn't speed it up. Not sure
how it all worked out, but quite the claim.
Anyway, a world of possiblities, just wondering if
anyone is running nutch on solaris, played with dtrace
or is interested in doing so.
Earl
__________________________________
Yahoo! Mail - PC Magazine Editors' Choice 2005
http://mail.yahoo.com
Re: dtrace and nutch
Posted by Earl Cahill <ca...@yahoo.com>.
This explains pretty well what I am talking about
http://blogs.sun.com/roller/page/ahl/20050418#dtracing_java
Earl
--- Earl Cahill <ca...@yahoo.com> wrote:
> Just about to switch a box over to solaris 10, in
> part
> so I can try and help out with nutch profiling via
> dtrace. Wondering if anyone has tried it.
>
> In my limited experience, dtrace kicks. Some info
> is
> here
>
> http://www.sun.com/bigadmin/content/dtrace/
>
> I think it could really aid in profiling a running
> crawl. Like all the emails that say, several hours
> in
> my fetch slows down, I am hoping that dtrace could
> work wonders.
>
> A brief overview. Dtrace is a profiler like strace
> or
> truss. Rather unobstrusively dtrace can attach to a
> process and from kernel space to user space, report
> on
> what is going on. Dtrace has a langague, called d,
> which allows you to hook into the profile. Rumor
> has
> it that java is very well profiled, and I think it
> will give you method names and the like. You can
> hook
> in and say, when this method gets called, increment
> a
> counter, track nanoseconds and report every two
> seconds at what gets called the most and what takes
> the most time from user space to kernel space.
>
> I think it would be nice to pick a few key methods
> (or
> all of them) and write some d that would track what
> is
> going on in those methods during a crawl.
>
> I saw a notice where sun engineers had a booth and
> offered a free ipod (I think) if someone had an
> application that they couldn't speed it up. Not
> sure
> how it all worked out, but quite the claim.
>
> Anyway, a world of possiblities, just wondering if
> anyone is running nutch on solaris, played with
> dtrace
> or is interested in doing so.
>
> Earl
>
>
>
> __________________________________
> Yahoo! Mail - PC Magazine Editors' Choice 2005
> http://mail.yahoo.com
>
__________________________________
Yahoo! Mail - PC Magazine Editors' Choice 2005
http://mail.yahoo.com