You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@community.apache.org by hb...@apache.org on 2015/06/21 12:48:01 UTC

svn commit: r1686701 - in /comdev/projects.apache.org: STRUCTURE.txt scripts/README.txt scripts/import/parsepmcs.py

Author: hboutemy
Date: Sun Jun 21 10:48:01 2015
New Revision: 1686701

URL: http://svn.apache.org/r1686701
Log:
updated documentation

Modified:
    comdev/projects.apache.org/STRUCTURE.txt
    comdev/projects.apache.org/scripts/README.txt
    comdev/projects.apache.org/scripts/import/parsepmcs.py

Modified: comdev/projects.apache.org/STRUCTURE.txt
URL: http://svn.apache.org/viewvc/comdev/projects.apache.org/STRUCTURE.txt?rev=1686701&r1=1686700&r2=1686701&view=diff
==============================================================================
--- comdev/projects.apache.org/STRUCTURE.txt (original)
+++ comdev/projects.apache.org/STRUCTURE.txt Sun Jun 21 10:48:01 2015
@@ -18,15 +18,22 @@ Layout of comdev/projects.apache.org:
 /site/json/projects:
         - Contains project-specific data extracted from projects' DOAP files.
 
+/site/doap:
+        - Contains the equivalent DOAP data storage: 1 directory per committee,
+          containing pmc.rdf generated rdf (by parsecommittees.py) + pmc-doap.rdf copied (by parsepmcs.py)
+          + copied DOAP files (by rdfparse.py) from projects managed by the PMC
+
 Suggested cron setup:
     scripts/cronjobs/parsechairs.py - daily
     scripts/cronjobs/parsecomitters.py - daily/hourly (whatever we need/want)
     scripts/cronjobs/podlings.py - daily
     scripts/cronjobs/countaccounts.py - weekly
+    scripts/cronjobs/parsereleases.py - daily
 
 Stuff to run manually when needed:
     scripts/import/parsecommittees.py - requires committee-info.txt to be present
-    scripts/import/addpmc.py - manual run whenever a new PMC is founded
+    scripts/import/parsepmcs.py
+    scripts/import/rdfparse.py
 
 Webserver required:
 To test the site locally, a webserver is required or you'll get

Modified: comdev/projects.apache.org/scripts/README.txt
URL: http://svn.apache.org/viewvc/comdev/projects.apache.org/scripts/README.txt?rev=1686701&r1=1686700&r2=1686701&view=diff
==============================================================================
--- comdev/projects.apache.org/scripts/README.txt (original)
+++ comdev/projects.apache.org/scripts/README.txt Sun Jun 21 10:48:01 2015
@@ -4,41 +4,43 @@ various sources:
 1. updating data (cronjobs)
 
 - countaccounts.py: Extract from LDAP monthly statistics on Unix accounts created
-  in: foundation/accounts-evolution.json + ldapsearch
-  out: foundation/accounts-evolution.json
+  in: json/foundation/accounts-evolution.json + ldapsearch
+  out: json/foundation/accounts-evolution.json
 
 - parsechairs.py: Fetches current VPs from the foundation website.
   in: http://www.apache.org/foundation/
-  out: foundation/chairs.json
+  out: json/foundation/chairs.json
 
 - parsecommitters.py: Fetches and parses the committer (LDAP) list via
   people.apache.org.
   in: http://people.apache.org/committer-index.html
-  out: foundation/people.json + foundation/groups.json
+  out: json/foundation/people.json + json/foundation/groups.json
   List of committers with reference to groups (people.json) and groups with corresponding committers (groups.json)
 
 - podlings.py: Reads podlings.xml from the incubator site and creates a JSON
   with timeline data, as well as current podling projects information.
   in: http://incubator.apache.org/podlings.xml
-  out: foundation/podlings.json + foundation/podlings-evolution.json
+  out: json/foundation/podlings.json + json/foundation/podlings-evolution.json
   Current list of podlings (podlings.json) and monthly statisctics on podlings (podlings-evolution.json)
 
 - parsereleases.py
   in: http://www.apache.org/dist/
-  out: foundation/releases.json
+  out: json/foundation/releases.json
 
 
 2. importing data (import)
 
 - parsecommittees.py: Parses committee-info.txt to detect new and retired committees
-  in: foundation/committees.json + foundation/committees-retired.json + committee-info.txt (https://svn.apache.org/repos/private/committers/board/committee-info.txt)
-  out: foundation/committees.json + foundation/committees-retired.json
+  in: json/foundation/committees.json + json/foundation/committees-retired.json
+      + committee-info.txt (https://svn.apache.org/repos/private/committers/board/committee-info.txt)
+      + https://svn.apache.org/repos/asf/infrastructure/site-tools/trunk/projects/pmc_list.xml + PMC data .rdf files
+  out: json/foundation/committees.json + json/foundation/committees-retired.json + doap/{committeeId}/pmc.rdf
 
 - parsepmcs.py: imports PMC data (RDF) from the old project.apache.org site.
   in: https://svn.apache.org/repos/asf/infrastructure/site-tools/trunk/projects/pmc_list.xml + PMC data .rdf files
-  out: foundation/pmcs.json
+  out: json/foundation/pmcs.json + doap/{committeeId}/pmc-doap.rdf
 
 - rdfparse.py: Parses existing RDF(DOAP) files from the old projects.a.o and
   turns them into JSON objects.
   in: https://svn.apache.org/repos/asf/infrastructure/site-tools/trunk/projects/files.xml + projects' DOAP files
-  out: projects/*.json + foundation/projects.json
+  out: json/projects/*.json + json/foundation/projects.json

Modified: comdev/projects.apache.org/scripts/import/parsepmcs.py
URL: http://svn.apache.org/viewvc/comdev/projects.apache.org/scripts/import/parsepmcs.py?rev=1686701&r1=1686700&r2=1686701&view=diff
==============================================================================
--- comdev/projects.apache.org/scripts/import/parsepmcs.py (original)
+++ comdev/projects.apache.org/scripts/import/parsepmcs.py Sun Jun 21 10:48:01 2015
@@ -1,3 +1,5 @@
+# TODO: merge into parsecommittees.py
+
 from xml.dom import minidom
 import xml.etree.ElementTree as ET
 import re, urllib.request