You are viewing a plain text version of this content. The canonical link for it is here.
Posted to general@james.apache.org by Jason Hunter <jh...@acm.org> on 2007/11/08 03:36:48 UTC
MarkMail: Powered by Apache James
Some of you may have seen my announcement this morning on community@
about MarkMail, a new email archiving service.
http://markmail.org/message/q2dtugicwt6kzfdj
Apache James lives at the heart of the email ingestion pipeline, and I
wanted to write in here to say thanks to the people who built such a
useful tool. Having a Java-based extensible mail processor really
simplified my life. If you see me at ApacheCon, let me know: I've got a
T-shirt for you.
Funny story: It was when hitting some odd James behavior that MarkMail
had its first "self hosting" moment. I noticed I could move files
manually into the spool directory but they wouldn't be recognized until
after a server restart. I searched on Google: "james server restart",
"apache james server restart", and things like this. No luck. James is
just too common a word. Then I remembered we'd loaded all the Apache
emails onto our test system! Duh. So I tried "list:james restart respool".
http://markmail.org/search/?q=list%3Ajames+restart+respool
Bingo!
I do have a lingering question whose answer I couldn't find in the archive:
* Is there any way to easily employ multiple match conditions
(subject is X, sender contains Y) without setting up extra processors?
-jh-
Re: MarkMail: Powered by Apache James
Posted by Danny Angus <da...@apache.org>.
On 08/11/2007, Jason Hunter <jh...@acm.org> wrote:
> 4,000,000 emails through James one day. :)
That never gets old!
d.
Re: MarkMail: Powered by Apache James
Posted by Jason Hunter <jh...@acm.org>.
Norman Maurer wrote:
> It whould be really cool if you could create an issue on jira and grant
> rights, so we can include it to our trunk.
Will do.
> BTW if you are intressted in mbox maybe mstor.sf.net is a good choice
> for you too ?
Funny you should mention mstor. We use it already. When we bulk load
from an existing mbox archive on the filesystem, we use mstor to parse
the files. Because both it and James expose the JavaMail API, we can
use the same back-end email-to-xml conversion code.
For testing purposes, I wrote an mstor-driven client that sends each
email in the mbox file through the James listener via SMTP. I pushed
4,000,000 emails through James one day. :)
-jh-
Re: MarkMail: Powered by Apache James
Posted by Norman Maurer <no...@apache.org>.
It whould be really cool if you could create an issue on jira and grant
rights, so we can include it to our trunk.
thx
BTW if you are intressted in mbox maybe mstor.sf.net is a good choice
for you too ?
bye
Norman
Am Donnerstag, den 08.11.2007, 10:56 -0800 schrieb Jason Hunter:
> Danny Angus wrote:
> > As for complex conditions you might try JSeive, or alternatively write
> > a rules based matcher yourself.
>
> Thanks for letting me know I wasn't just overlooking something. I think
> writing my own matcher would suffice for my needs.
>
> FYI, that reminds me, I wrote my own MboxMailRepository. I didn't
> contribute it back because mine only needed to be write-only (I write
> every incoming email to Mark Logic and an mbox file). In my mbox
> version I fixed a few issues with the existing one:
>
> * Reports exceptions if the mbox dir isn't there.
> * Closes files in finally block.
> * Nicely exceptions out if no disk space.
> * Ensures "envsender" in From line is one word.
>
> I'd be happy to send in the rewritten version if someone wants to put
> these fixes into the trunk.
>
> -jh-
Re: MarkMail: Powered by Apache James
Posted by Norman Maurer <no...@apache.org>.
Am Donnerstag, den 08.11.2007, 19:12 +0000 schrieb Danny Angus:
> On 08/11/2007, Jason Hunter <jh...@acm.org> wrote:
>
> > I'd be happy to send in the rewritten version if someone wants to put
> > these fixes into the trunk.
>
> Ill do that, all contributions are more than welcome :-)
> If you could create an issue in JIRA, attach the patch/file whatever
> and assign it to me that would help us not to forget it!
>
> d.
+1
Norman
Re: MarkMail: Powered by Apache James
Posted by Danny Angus <da...@apache.org>.
On 08/11/2007, Jason Hunter <jh...@acm.org> wrote:
> I'd be happy to send in the rewritten version if someone wants to put
> these fixes into the trunk.
Ill do that, all contributions are more than welcome :-)
If you could create an issue in JIRA, attach the patch/file whatever
and assign it to me that would help us not to forget it!
d.
Re: MarkMail: Powered by Apache James
Posted by Jason Hunter <jh...@acm.org>.
Danny Angus wrote:
> As for complex conditions you might try JSeive, or alternatively write
> a rules based matcher yourself.
Thanks for letting me know I wasn't just overlooking something. I think
writing my own matcher would suffice for my needs.
FYI, that reminds me, I wrote my own MboxMailRepository. I didn't
contribute it back because mine only needed to be write-only (I write
every incoming email to Mark Logic and an mbox file). In my mbox
version I fixed a few issues with the existing one:
* Reports exceptions if the mbox dir isn't there.
* Closes files in finally block.
* Nicely exceptions out if no disk space.
* Ensures "envsender" in From line is one word.
I'd be happy to send in the rewritten version if someone wants to put
these fixes into the trunk.
-jh-
RE: JAMES at ApacheCon US
Posted by "Noel J. Bergman" <no...@devtech.com>.
Danny Angus wrote:
> Noel J. Bergman <no...@devtech.com> wrote:
> > Aren't I doing it for you?
> Oh!, I thought that they were pretty down on that.
Somewhat to my surprise after the initial comments, it was decided to run it, anyway.
> > Seems that I am: http://us.apachecon.com/us2007/program/talk/1861.
> > I still need your slides.
> You'll need my notes too?
Whatever you've got.
--- Noel
Re: JAMES at ApacheCon US
Posted by Danny Angus <da...@apache.org>.
On 08/11/2007, Noel J. Bergman <no...@devtech.com> wrote:
> > Aren't I doing it for you?
Oh!, I thought that they were pretty down on that.
>
> Seems that I am: http://us.apachecon.com/us2007/program/talk/1861. I still need your slides.
You'll need my notes too?
>
> Jason, do you want to guest-star and add some information (ad hoc if you want) about how JAMES is core to MarkMail?
>
> --- Noel
>
>
>
>
Re: JAMES at ApacheCon US
Posted by Norman Maurer <no...@apache.org>.
Am Donnerstag, den 08.11.2007, 10:46 -0800 schrieb Jason Hunter:
> Noel J. Bergman wrote:
> > Noel J. Bergman wrote:
> >
> >>> Unfortunately I won't be at ACUS, and I've had to pull my James talk :-(
> >> Aren't I doing it for you?
> >
> > Seems that I am: http://us.apachecon.com/us2007/program/talk/1861. I still need your slides.
> >
> > Jason, do you want to guest-star and add some information (ad hoc if you want) about how JAMES is core to MarkMail?
>
> Oh sure, I'd enjoy that. I was already planning to attend that talk.
>
I really whould like to be there :-/ Hopefully nexttime on ApacheCon EU
08
bye
Norman
Re: JAMES at ApacheCon US
Posted by Jason Hunter <jh...@acm.org>.
Noel J. Bergman wrote:
> Noel J. Bergman wrote:
>
>>> Unfortunately I won't be at ACUS, and I've had to pull my James talk :-(
>> Aren't I doing it for you?
>
> Seems that I am: http://us.apachecon.com/us2007/program/talk/1861. I still need your slides.
>
> Jason, do you want to guest-star and add some information (ad hoc if you want) about how JAMES is core to MarkMail?
Oh sure, I'd enjoy that. I was already planning to attend that talk.
-jh-
RE: JAMES at ApacheCon US
Posted by "Noel J. Bergman" <no...@devtech.com>.
Noel J. Bergman wrote:
> > Unfortunately I won't be at ACUS, and I've had to pull my James talk :-(
> Aren't I doing it for you?
Seems that I am: http://us.apachecon.com/us2007/program/talk/1861. I still need your slides.
Jason, do you want to guest-star and add some information (ad hoc if you want) about how JAMES is core to MarkMail?
--- Noel
JAMES at ApacheCon US
Posted by "Noel J. Bergman" <no...@devtech.com>.
> Unfortunately I won't be at ACUS, and I've had to pull my James talk :-(
Aren't I doing it for you?
--- Noel
Re: MarkMail: Powered by Apache James
Posted by Danny Angus <da...@apache.org>.
Jason,
Great stuff! I love it when people use James :-) Most people seem to
quietly get on with it, thanks for sharing with us. I already loved
MarkMail, this makes me love it even more!
Unfortunately I won't be at ACUS, and I've had to pull my James talk :-(
As for complex conditions you might try JSeive, or alternatively write
a rules based matcher yourself.
We did, at one time, have *some* ideas for constructing nesting and
conditionality in matcher configs, but it hasn't got off the ground.
@Work we recently developed an xml schema for constructing complex
conditions, perhaps I should resurrect the idea.
d.
RE: MarkMail: Powered by Apache James
Posted by "Noel J. Bergman" <no...@devtech.com>.
> > drools seems particularly popular right now but most JSR-94 implementation would
> > probably do the job.
And don't forget that we already have JSieve.
--- Noel
Re: MarkMail: Powered by Apache James
Posted by Danny Angus <da...@apache.org>.
On 09/11/2007, Robert Burrell Donkin <ro...@gmail.com> wrote:
> drools seems particularly popular right now but most JSR-94 implementation would
> probably do the job. i suspect that the work invovled wouldn't be
> great. anyone fancy taking this one on?
Drools was exactly my first thought, its a wonder that we hadn't
thought of it before.
I'll raise an issue in JIRA so we don't forget. Hopefully I'll be able
to do the job too.
d.
Re: MarkMail: Powered by Apache James
Posted by Robert Burrell Donkin <ro...@gmail.com>.
On Nov 9, 2007 11:10 AM, Jason Hunter <jh...@acm.org> wrote:
> Robert Burrell Donkin wrote:
>
> > any chance of adding searchable archives for some of the email
> > specification lists? (apache is a rank 9 site but most of the email
> > specification archives seem to be buried on low ranked sites and are
> > effectively invisible to conventional search engines)
>
> Like the piano player at the bar, we take requests. :)
>
> Which exact lists are you referring to? The more details you can give,
> the easier it is to include them.
http://www3.ietf.org/proceedings/94mar/charters/imap-charter.html
* General Discussion <im...@cac.washington.edu>
* To Subscribe <im...@cac.washington.edu>
* Archive <ftp.cac.washington.edu:~/imap/imap_archive>
http://www.ietf.org/html.charters/sieve-charter.html
General Discussion: ietf-mta-filters@imc.org
To Subscribe: ietf-mta-filters-request@imc.org
In Body: body=subscribe
Archive: http://www.imc.org/ietf-mta-filters/mail-archive/
http://www.imc.org/ietf-imapext/
would be very useful
but generally, w3c (http://www.w3.org/Mail/) and IEFT lists
(http://www.ietf.org/WG-WEB-Mail.html) would be great
- robert
Re: MarkMail: Powered by Apache James
Posted by Jason Hunter <jh...@acm.org>.
Robert Burrell Donkin wrote:
> any chance of adding searchable archives for some of the email
> specification lists? (apache is a rank 9 site but most of the email
> specification archives seem to be buried on low ranked sites and are
> effectively invisible to conventional search engines)
Like the piano player at the bar, we take requests. :)
Which exact lists are you referring to? The more details you can give,
the easier it is to include them.
-jh-
Re: MarkMail: Powered by Apache James
Posted by Robert Burrell Donkin <ro...@gmail.com>.
On Nov 8, 2007 9:02 AM, Stefano Bagnara <ap...@bago.org> wrote:
> Jason Hunter ha scritto:
> > Some of you may have seen my announcement this morning on community@
> > about MarkMail, a new email archiving service.
> > [...]
> > Apache James lives at the heart of the email ingestion pipeline, and I
> > wanted to write in here to say thanks to the people who built such a
> > useful tool. Having a Java-based extensible mail processor really
> > simplified my life. If you see me at ApacheCon, let me know: I've got a
> > T-shirt for you.
> > [...]
>
> Cool! Thank you for sharing your experience!
+1
(for those who don't know, jason is one of the greats from the last
days of java@apache and the earliest days of jakarta so it's
especially cool to find out what he's been working on)
been meaning to blog about markmail since this concept is one of my
long term email itches - great to see a working implementation :-)
any chance of adding searchable archives for some of the email
specification lists? (apache is a rank 9 site but most of the email
specification archives seem to be buried on low ranked sites and are
effectively invisible to conventional search engines)
> > I do have a lingering question whose answer I couldn't find in the archive:
> >
> > * Is there any way to easily employ multiple match conditions
> > (subject is X, sender contains Y) without setting up extra processors?
>
> No, unfortunately there's no way. Programmatically it would be easy to
> support something similar, but it's hard to find out an easy xml
> configuration to define complex expressions, so we never added this feature.
a rules engine supporting xml would probably do the job. drools seems
particularly popular right now but most JSR-94 implementation would
probably do the job. i suspect that the work invovled wouldn't be
great. anyone fancy taking this one on?
- robert
Re: MarkMail: Powered by Apache James
Posted by Stefano Bagnara <ap...@bago.org>.
Jason Hunter ha scritto:
> Some of you may have seen my announcement this morning on community@
> about MarkMail, a new email archiving service.
> [...]
> Apache James lives at the heart of the email ingestion pipeline, and I
> wanted to write in here to say thanks to the people who built such a
> useful tool. Having a Java-based extensible mail processor really
> simplified my life. If you see me at ApacheCon, let me know: I've got a
> T-shirt for you.
> [...]
Cool! Thank you for sharing your experience!
> I do have a lingering question whose answer I couldn't find in the archive:
>
> * Is there any way to easily employ multiple match conditions
> (subject is X, sender contains Y) without setting up extra processors?
No, unfortunately there's no way. Programmatically it would be easy to
support something similar, but it's hard to find out an easy xml
configuration to define complex expressions, so we never added this feature.
Stefano