You are viewing a plain text version of this content. The canonical link for it is here.
Posted to dev@community.apache.org by Karanjeet Singh <ka...@usc.edu> on 2016/01/31 21:52:20 UTC

DRAT is now scanning Apache SVN code base!

Hello Everyone,

With great pleasure, I would like to introduce DRAT (Distributed Release
Audit Tool) which is a distributed, parallelized wrapper around Apache RAT
to inspect for appropriate open source licensing in software projects.
DRAT was started by my advisor, Chris Mattmann, in an effort to get RAT
working on a ver large code base. RAT uses Apache OODT, Apache Tika, and
Apache Solr.

We are now auditing the complete Apache SVN code base to check for proper
licenses. Until now, we have scanned 171 / 191 repositories and
illustrated the statistics for 133 of them through D3 visualization
located at http://drat.dyndns.org:8080/dratviz

Projects should check out the MIME analysis of the code base and click
around. Please also note due to the sheer size of the Apache code bases
and the fact that we scanned and included all revisions in the Apache SVN
repo, DRAT is not running in real time. We are running DRAT on the NSF
Super Computer Wrangler, which has a petabyte of flash storage and the
ability to stand up Hadoop and Spark clusters. We are also working on a
paper describing our results.

Please send feedback to myself (Karanjeet Singh <ka...@usc.edu>),
Professor Mattmann <ma...@usc.edu> and/or irds-L@mymaillists.usc.edu.

Thanks & Regards,
Karanjeet Singh
C.S. Graduate Student
University of Southern California
karanjes@usc.edu | +1-213-675-9583

Fwd: DRAT is now scanning Apache SVN code base!

Posted by Jacques Le Roux <ja...@les7arts.com>.
Just FYI: OFBiz has still not be scanned http://drat.dyndns.org:8080/dratviz/

Jacques

-------- Message transféré --------
Sujet : 	DRAT is now scanning Apache SVN code base!
Date : 	Sun, 31 Jan 2016 12:52:20 -0800
De : 	Karanjeet Singh <ka...@usc.edu>
Répondre à : 	dev@community.apache.org
Pour : 	dev@community.apache.org
Copie à : 	Christian Alan Mattmann <ma...@usc.edu>



Hello Everyone,

With great pleasure, I would like to introduce DRAT (Distributed Release
Audit Tool) which is a distributed, parallelized wrapper around Apache RAT
to inspect for appropriate open source licensing in software projects.
DRAT was started by my advisor, Chris Mattmann, in an effort to get RAT
working on a ver large code base. RAT uses Apache OODT, Apache Tika, and
Apache Solr.

We are now auditing the complete Apache SVN code base to check for proper
licenses. Until now, we have scanned 171 / 191 repositories and
illustrated the statistics for 133 of them through D3 visualization
located at http://drat.dyndns.org:8080/dratviz

Projects should check out the MIME analysis of the code base and click
around. Please also note due to the sheer size of the Apache code bases
and the fact that we scanned and included all revisions in the Apache SVN
repo, DRAT is not running in real time. We are running DRAT on the NSF
Super Computer Wrangler, which has a petabyte of flash storage and the
ability to stand up Hadoop and Spark clusters. We are also working on a
paper describing our results.

Please send feedback to myself (Karanjeet Singh <ka...@usc.edu>),
Professor Mattmann <ma...@usc.edu> and/or irds-L@mymaillists.usc.edu.

Thanks & Regards,
Karanjeet Singh
C.S. Graduate Student
University of Southern California
karanjes@usc.edu | +1-213-675-9583




Re: DRAT is now scanning Apache SVN code base!

Posted by Don Cunningham <ot...@gmail.com>.
On Feb 3, 2016 4:07 AM, "Don Cunningham" <ot...@gmail.com> wrote:

> On Jan 31, 2016 3:52 PM, "Karanjeet Singh" <ka...@usc.edu> wrote:
>
>> Hello Everyone,
>>
>> With great pleasure, I would like to introduce DRAT (Distributed Release
>> Audit Tool) which is a distributed, parallelized wrapper around Apache RAT
>> to inspect for appropriate open source licensing in software projects.
>> DRAT was started by my advisor, Chris Mattmann, in an effort to get RAT
>> working on a ver large code base. RAT uses Apache OODT, Apache Tika, and
>> Apache Solr.
>>
>> We are now auditing the complete Apache SVN code base to check for proper
>> licenses. Until now, we have scanned 171 / 191 repositories and
>> illustrated the statistics for 133 of them through D3 visualization
>> located at http://drat.dyndns.org:8080/dratviz
>>
>> Projects should check out the MIME analysis of the code base and click
>> around. Please also note due to the sheer size of the Apache code bases
>> and the fact that we scanned and included all revisions in the Apache SVN
>> repo, DRAT is not running in real time. We are running DRAT on the NSF
>> Super Computer Wrangler, which has a petabyte of flash storage and the
>> ability to stand up Hadoop and Spark clusters. We are also working on a
>> paper describing our results.
>>
>> Please send feedback to myself (Karanjeet Singh <ka...@usc.edu>),
>> Professor Mattmann <ma...@usc.edu> and/or irds-L@mymaillists.usc.edu.
>>
>> Thanks & Regards,
>> Karanjeet Singh
>> C.S. Graduate Student
>> University of Southern California
>> karanjes@usc.edu | +1-213-675-9583
>>
>

Re: DRAT is now scanning Apache SVN code base!

Posted by Don Cunningham <ot...@gmail.com>.
On Jan 31, 2016 3:52 PM, "Karanjeet Singh" <ka...@usc.edu> wrote:

> Hello Everyone,
>
> With great pleasure, I would like to introduce DRAT (Distributed Release
> Audit Tool) which is a distributed, parallelized wrapper around Apache RAT
> to inspect for appropriate open source licensing in software projects.
> DRAT was started by my advisor, Chris Mattmann, in an effort to get RAT
> working on a ver large code base. RAT uses Apache OODT, Apache Tika, and
> Apache Solr.
>
> We are now auditing the complete Apache SVN code base to check for proper
> licenses. Until now, we have scanned 171 / 191 repositories and
> illustrated the statistics for 133 of them through D3 visualization
> located at http://drat.dyndns.org:8080/dratviz
>
> Projects should check out the MIME analysis of the code base and click
> around. Please also note due to the sheer size of the Apache code bases
> and the fact that we scanned and included all revisions in the Apache SVN
> repo, DRAT is not running in real time. We are running DRAT on the NSF
> Super Computer Wrangler, which has a petabyte of flash storage and the
> ability to stand up Hadoop and Spark clusters. We are also working on a
> paper describing our results.
>
> Please send feedback to myself (Karanjeet Singh <ka...@usc.edu>),
> Professor Mattmann <ma...@usc.edu> and/or irds-L@mymaillists.usc.edu.
>
> Thanks & Regards,
> Karanjeet Singh
> C.S. Graduate Student
> University of Southern California
> karanjes@usc.edu | +1-213-675-9583
>