You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@oodt.apache.org by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> on 2012/06/22 15:58:54 UTC

New Met Extractors for Crawler

Hey Ricky,

I just read your page:

https://cwiki.apache.org/confluence/display/OODT/MetExtractors+for+Crawler

Super awesome! My +1 the ProdTypePatternMetExtractor sounds super 
useful.

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: New Met Extractors for Crawler

Posted by "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>.
Hey Ricky,

On Jun 22, 2012, at 5:37 PM, Nguyen, Ricky wrote:

> Thanks guys. It overlaps slightly with AutoDetectProductCrawler and FilenameTokenMetExtractor. But IMHO, it's less work than defining mime-type mappings, and I'm not restricted to using a single token separator.

Yep, gotcha I'm all for having things that do similar things but in easier 
ways. I think it adds functionality and providers users with another option
to leverage.

> 
> Does it sound like something good for OODT 0.5?

+1 for sure!

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++


Re: New Met Extractors for Crawler

Posted by "Nguyen, Ricky" <rn...@chla.usc.edu>.
Thanks guys. It overlaps slightly with AutoDetectProductCrawler and FilenameTokenMetExtractor. But IMHO, it's less work than defining mime-type mappings, and I'm not restricted to using a single token separator.

Does it sound like something good for OODT 0.5?
-ricky

On Jun 22, 2012, at 11:20 AM, Brian Foster wrote:

+1

On Jun 22, 2012, at 06:58 AM, "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov>> wrote:

Hey Ricky,

I just read your page:

https://cwiki.apache.org/confluence/display/OODT/MetExtractors+for+Crawler

Super awesome! My +1 the ProdTypePatternMetExtractor sounds super
useful.

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov<ma...@nasa.gov>
WWW: http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++




---------------------------------------------------------------------
CONFIDENTIALITY NOTICE: This e-mail message, including any attachments, 
is for the sole use of the intended recipient(s) and may contain confidential
or legally privileged information. Any unauthorized review, use, disclosure
or distribution is prohibited. If you are not the intended recipient, please
contact the sender by reply e-mail and destroy all copies of this original message.  

---------------------------------------------------------------------


Re: New Met Extractors for Crawler

Posted by Brian Foster <ho...@me.com>.
+1

On Jun 22, 2012, at 06:58 AM, "Mattmann, Chris A (388J)" <ch...@jpl.nasa.gov> wrote:

Hey Ricky,

I just read your page:

https://cwiki.apache.org/confluence/display/OODT/MetExtractors+for+Crawler

Super awesome! My +1 the ProdTypePatternMetExtractor sounds super 
useful.

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattmann@nasa.gov
WWW: http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++