You are viewing a plain text version of this content. The canonical link for it is here.
Posted to commits@any23.apache.org by si...@apache.org on 2012/01/12 15:08:36 UTC
svn commit: r1230547 - /incubator/any23/site/src/site/apt/
Author: simonetripodi
Date: Thu Jan 12 14:08:34 2012
New Revision: 1230547
URL: http://svn.apache.org/viewvc?rev=1230547&view=rev
Log:
s/deri/apache/
Modified:
incubator/any23/site/src/site/apt/any23-plugins.apt
incubator/any23/site/src/site/apt/configuration.apt
incubator/any23/site/src/site/apt/dev-csv-extractor.apt
incubator/any23/site/src/site/apt/dev-data-conversion.apt
incubator/any23/site/src/site/apt/dev-data-extraction.apt
incubator/any23/site/src/site/apt/dev-microformat-extractors.apt
incubator/any23/site/src/site/apt/dev-validation-fix.apt
incubator/any23/site/src/site/apt/dev-xpath-extractor.apt
incubator/any23/site/src/site/apt/extractors.apt
incubator/any23/site/src/site/apt/getting-started.apt
incubator/any23/site/src/site/apt/plugin-office-scraper.apt
Modified: incubator/any23/site/src/site/apt/any23-plugins.apt
URL: http://svn.apache.org/viewvc/incubator/any23/site/src/site/apt/any23-plugins.apt?rev=1230547&r1=1230546&r2=1230547&view=diff
==============================================================================
--- incubator/any23/site/src/site/apt/any23-plugins.apt (original)
+++ incubator/any23/site/src/site/apt/any23-plugins.apt Thu Jan 12 14:08:34 2012
@@ -9,9 +9,9 @@ Any23 Plugins
A plugin is a standard <Maven3> module containing any implementation of
- * {{{./xref/org/deri/any23/plugin/ExtractorPlugin.html}ExtractorPlugin}}
+ * {{{./xref/org/apache/any23/plugin/ExtractorPlugin.html}ExtractorPlugin}}
- * {{{./xref/org/deri/any23/cli/Tool.html}Tool}}
+ * {{{./xref/org/apache/any23/cli/Tool.html}Tool}}
* How to Register a Plugin
@@ -28,7 +28,7 @@ Any23 Plugins
TODO: plugin support in Any23 Service
Any implementation of <ExtractorPlugin> will automatically registered to the
- {{{./xref/org/deri/any23/extractor/ExtractorRegistry.html}ExtractorRegistry}}.
+ {{{./xref/org/apache/any23/extractor/ExtractorRegistry.html}ExtractorRegistry}}.
Any detected implementation of <Tool> will be listed by the <ToolRunner>
command-line tool in <any23-root/><<bin/any23tools>> .
@@ -46,7 +46,7 @@ Any23 Plugins
An <Extractor Plugin> is a class:
- * implementing the {{{./xref/org/deri/any23/plugin/ExtractorPlugin.html}ExtractorPlugin}} interface;
+ * implementing the {{{./xref/org/apache/any23/plugin/ExtractorPlugin.html}ExtractorPlugin}} interface;
* packaged under <<org.deri.any23.plugin>> .
@@ -79,7 +79,7 @@ Any23 Plugins
An <Tool Plugin> is a class:
- * implementing the {{{./xref/org/deri/any23/cli/Tool.html}Tool}} interface;
+ * implementing the {{{./xref/org/apache/any23/cli/Tool.html}Tool}} interface;
* packaged under <<org.deri.any23.cli>> .
@@ -127,7 +127,7 @@ public class Version implements Tool {
* Crawler CLI Tool
- The {{{./xref/org/deri/any23/cli/Crawler.html}Crawler CLI Tool}} is an extension of the
- {{{./xref/org/deri/any23/cli/Rover.html}Rover CLI Tool}} to add site crawling basic
+ The {{{./xref/org/apache/any23/cli/Crawler.html}Crawler CLI Tool}} is an extension of the
+ {{{./xref/org/apache/any23/cli/Rover.html}Rover CLI Tool}} to add site crawling basic
capabilities. More information about the <CLI> can be found at
{{{./getting-started.html#crawler-tool}Getting Started - Crawler Tool}} section.
Modified: incubator/any23/site/src/site/apt/configuration.apt
URL: http://svn.apache.org/viewvc/incubator/any23/site/src/site/apt/configuration.apt?rev=1230547&r1=1230546&r2=1230547&view=diff
==============================================================================
--- incubator/any23/site/src/site/apt/configuration.apt (original)
+++ incubator/any23/site/src/site/apt/configuration.apt Thu Jan 12 14:08:34 2012
@@ -5,7 +5,7 @@ Configuration
The core module contains the main library code and the command-line implementation.
The main library configuration parameters are managed by the
- {{{./xref/org/deri/any23/configuration/DefaultConfiguration.html} Configuration}}
+ {{{./xref/org/apache/any23/configuration/DefaultConfiguration.html} Configuration}}
class. The default values are declared within the {{{http://any23.googlecode.com/svn/trunk/any23-core/src/main/resources/default-configuration.properties} default-configuration.properties}}
file. The following sections explain how to override the default configuration.
@@ -30,7 +30,7 @@ any23-core/bin/$ ANY23_OPTS="-Dany23.htt
** Override Default Configuration Programmatically
- The {{{./xref/org/deri/any23/configuration/Configuration.html} Configuration}}
+ The {{{./xref/org/apache/any23/configuration/Configuration.html} Configuration}}
properties can be accessed in read-only mode just retrieving the configuration <<singleton>> instance.\
Such instance is <immutable>:
@@ -40,7 +40,7 @@ final String propertyValue = immutableCo
...
+----------------------------------------------------------------------------------------------
- To obtain a <modifiable> {{{./xref/org/deri/any23/configuration/Configuration.html} Configuration}}
+ To obtain a <modifiable> {{{./xref/org/apache/any23/configuration/Configuration.html} Configuration}}
instead it is possible to use the <<copy()>> method.\
One of the <<Any23>> constructors accepts a <<Configuration>> object that allows to customize the behavior
of the <<Any23>> instance for its entire life-cycle.
@@ -55,7 +55,7 @@ final Any23 any23 = new Any23(modifiable
* Use of ExtractionParameters
It is possible to customize the behavior of a single data extraction by providing an
- {{{./xref/org/deri/any23/extractor/ExtractionParameters.html} ExtractionParameters}}
+ {{{./xref/org/apache/any23/extractor/ExtractionParameters.html} ExtractionParameters}}
instance to one the <Any23#extract()> methods accepting it. <<ExtractionParameters>> allows to customize any <property> and <flag>
other then the <<specific extraction options>>.\
If no custom parameters are specified the default configuration values are used.
Modified: incubator/any23/site/src/site/apt/dev-csv-extractor.apt
URL: http://svn.apache.org/viewvc/incubator/any23/site/src/site/apt/dev-csv-extractor.apt?rev=1230547&r1=1230546&r2=1230547&view=diff
==============================================================================
--- incubator/any23/site/src/site/apt/dev-csv-extractor.apt (original)
+++ incubator/any23/site/src/site/apt/dev-csv-extractor.apt Thu Jan 12 14:08:34 2012
@@ -1,6 +1,6 @@
CSV Extractor Algorithm
- The {{{./xref/org/deri/any23/extractor/csv/CSVExtractor.html}CSV Extractor}} produces
+ The {{{./xref/org/apache/any23/extractor/csv/CSVExtractor.html}CSV Extractor}} produces
an RDF representation of a CSV file compliant with the {{{http://www.ietf.org/rfc/rfc4180.txt}RFC 4180}}
and that foresees an header.
Such extractor relies on the presence of an header to use the named fields as RDF properties.
Modified: incubator/any23/site/src/site/apt/dev-data-conversion.apt
URL: http://svn.apache.org/viewvc/incubator/any23/site/src/site/apt/dev-data-conversion.apt?rev=1230547&r1=1230546&r2=1230547&view=diff
==============================================================================
--- incubator/any23/site/src/site/apt/dev-data-conversion.apt (original)
+++ incubator/any23/site/src/site/apt/dev-data-conversion.apt Thu Jan 12 14:08:34 2012
@@ -25,35 +25,35 @@ Data Conversion
useful for the transformation. The facade constructor accepts a list of extractor names, if specified
the extraction will be done only over this list, otherwise the data <MIME Type> will detected and will be applied
all the compatible extractors declared within the
- {{{./xref/org/deri/any23/extractor/ExtractorRegistry.html}ExtractorRegistry}}.
+ {{{./xref/org/apache/any23/extractor/ExtractorRegistry.html}ExtractorRegistry}}.
The <<line 2>> defines the input string containing some {{{http://www.w3.org/TeamSubmission/turtle/}Turtle}} data.
- At <<line 3>> we instantiate a {{{./xref/org/deri/any23/source/StringDocumentSource.html}StringDocumentSource}},
+ At <<line 3>> we instantiate a {{{./xref/org/apache/any23/source/StringDocumentSource.html}StringDocumentSource}},
specifying a content and a the source <URI>.
The <URI> should be the source of the content data, and must be valid.
- Besides the {{{./xref/org/deri/any23/source/StringDocumentSource.html}StringDocumentSource}},
+ Besides the {{{./xref/org/apache/any23/source/StringDocumentSource.html}StringDocumentSource}},
you can also provide input from other sources, such as <HTTP> requests
- and local files. See the classes in the sources {{{./xref/org/deri/any23/source/package-summary.html}package}}.
+ and local files. See the classes in the sources {{{./xref/org/apache/any23/source/package-summary.html}package}}.
The <<line 4>> defines a buffered output stream that will be used to store the data produced by the
writer declared at <<line 5>>.
A writer stores the extracted triples in some destination.
- We use an {{{./xref/org/deri/any23/writer/NTriplesWriter.html}NTriplesWriter}} here that writes
+ We use an {{{./xref/org/apache/any23/writer/NTriplesWriter.html}NTriplesWriter}} here that writes
into a <<ByteArrayOutputStream>>. The main <<RDF>> formats writers are available and it is possible also to store
the triples directly into a <<Sesame>> repository to query them via <<SPARQL>>.
- See {{{./xref/org/deri/any23/writer/RepositoryWriter.html}RepositoryWriter}} and the writer
- {{{./xref/org/deri/any23/writer/package-summary.html}package}}.
+ See {{{./xref/org/apache/any23/writer/RepositoryWriter.html}RepositoryWriter}} and the writer
+ {{{./xref/org/apache/any23/writer/package-summary.html}package}}.
The extractor method invoked at <<line 6>> performs the metadata extraction.
- This method accepts as first argument a {{{./xref/org/deri/any23/source/DocumentSource.html}DocumentSource}} and as
- second argument a {{{./xref/org/deri/any23/writer/TripleHandler.html}TripleHandler}},
+ This method accepts as first argument a {{{./xref/org/apache/any23/source/DocumentSource.html}DocumentSource}} and as
+ second argument a {{{./xref/org/apache/any23/writer/TripleHandler.html}TripleHandler}},
that will receive the sequence parsing events generated by the applied extractors. The extract method defines also
another signature where it is possible to specify a charset encoding for the input data. If <<null>>, the charset
will be auto detected.
- The {{{./xref/org/deri/any23/writer/TripleHandler.html}TripleHandler}} needs to be explicitly closed,
+ The {{{./xref/org/apache/any23/writer/TripleHandler.html}TripleHandler}} needs to be explicitly closed,
this is done safely in a <<finally>> block at <<line 7>>.
The expected output is <UTF-8> encoded at <<line 8>>:
Modified: incubator/any23/site/src/site/apt/dev-data-extraction.apt
URL: http://svn.apache.org/viewvc/incubator/any23/site/src/site/apt/dev-data-extraction.apt?rev=1230547&r1=1230546&r2=1230547&view=diff
==============================================================================
--- incubator/any23/site/src/site/apt/dev-data-extraction.apt (original)
+++ incubator/any23/site/src/site/apt/dev-data-extraction.apt Thu Jan 12 14:08:34 2012
@@ -23,21 +23,21 @@ Data Extraction
the usage of specific extractors.
The <<line 2>> defines the <HTTP User Agent>, used to identify the client during <HTTP> data collection.
- At <<line 3>> we use the runner to create an instance of {{{./xref/org/deri/any23/http/HTTPClient.html}HTTPClient}},
- used by {{{./xref/org/deri/any23/source/HTTPDocumentSource.html}HTTPDocumentSource}} for <HTTP> content fetching.
+ At <<line 3>> we use the runner to create an instance of {{{./xref/org/apache/any23/http/HTTPClient.html}HTTPClient}},
+ used by {{{./xref/org/apache/any23/source/HTTPDocumentSource.html}HTTPDocumentSource}} for <HTTP> content fetching.
- The <<line 4>> instantiates an {{{./xref/org/deri/any23/source/HTTPDocumentSource.html}HTTPDocumentSource}} instance,
- specifying the {{{./xref/org/deri/any23/http/HTTPClient.html}HTTPClient}} and the URL addressing the content
+ The <<line 4>> instantiates an {{{./xref/org/apache/any23/source/HTTPDocumentSource.html}HTTPDocumentSource}} instance,
+ specifying the {{{./xref/org/apache/any23/http/HTTPClient.html}HTTPClient}} and the URL addressing the content
to be processed.
At <<line 5>> we define a buffered output stream used to store data produced by the
- {{{./xref/org/deri/any23/writer/TripleHandler.html}TripleHandler}} defined at <<line 6>>.
+ {{{./xref/org/apache/any23/writer/TripleHandler.html}TripleHandler}} defined at <<line 6>>.
The extraction method at <<line 7>> will run the metadata extraction.
The produced metadata will be written within the passed
- {{{./xref/org/deri/any23/writer/TripleHandler.html}TripleHandler}} instance.
+ {{{./xref/org/apache/any23/writer/TripleHandler.html}TripleHandler}} instance.
- The {{{./xref/org/deri/any23/writer/TripleHandler.html}TripleHandler}} needs to be explicitly closed,
+ The {{{./xref/org/apache/any23/writer/TripleHandler.html}TripleHandler}} needs to be explicitly closed,
this is done safely in a <<finally>> block at <<line 8>>.
The expected output is <UTF-8> encoded at <<line 9>> and is:
@@ -74,11 +74,11 @@ Filter Out Accidental Triples
To remove accidental triples <<Any23>> provides a set of useful filters, located
within the <<org.deri.any23.filter>> package.
- The filter {{{./xref/org/deri/any23/filter/IgnoreTitlesOfEmptyDocuments.html}IgnoreTitlesOfEmptyDocuments}}
- removes triples generated by the {{{./xref/org/deri/any23/extractor/html/TitleExtractor.html}TitleExtractor}}
+ The filter {{{./xref/org/apache/any23/filter/IgnoreTitlesOfEmptyDocuments.html}IgnoreTitlesOfEmptyDocuments}}
+ removes triples generated by the {{{./xref/org/apache/any23/extractor/html/TitleExtractor.html}TitleExtractor}}
whether the document is empty.
- The filter {{{./xref/org/deri/any23/filter/IgnoreAccidentalRDFa.html}IgnoreAccidentalRDFa}} removes accidental
+ The filter {{{./xref/org/apache/any23/filter/IgnoreAccidentalRDFa.html}IgnoreAccidentalRDFa}} removes accidental
<<CSS>> related triples.
+------------------------------------
Modified: incubator/any23/site/src/site/apt/dev-microformat-extractors.apt
URL: http://svn.apache.org/viewvc/incubator/any23/site/src/site/apt/dev-microformat-extractors.apt?rev=1230547&r1=1230546&r2=1230547&view=diff
==============================================================================
--- incubator/any23/site/src/site/apt/dev-microformat-extractors.apt (original)
+++ incubator/any23/site/src/site/apt/dev-microformat-extractors.apt Thu Jan 12 14:08:34 2012
@@ -18,7 +18,7 @@ Microformat Extractors
More specifically:
* Embedding explicitly the logic within the
- {{{./xref/org/deri/any23/extractor/html/package-summary.html}Microformats Extractors}}
+ {{{./xref/org/apache/any23/extractor/html/package-summary.html}Microformats Extractors}}
* Using the default <<Any23>> nesting feature.
@@ -38,7 +38,7 @@ Microformat Extractors
</span>
+----------------------------------------------------------------------------------------------
- Since, as shown below, the {{{./xref/org/deri/any23/extractor/html/HCardExtractor.html}HCardExtractor}}
+ Since, as shown below, the {{{./xref/org/apache/any23/extractor/html/HCardExtractor.html}HCardExtractor}}
contains the code to handle nested hAddress,
+------------------------------
@@ -79,12 +79,12 @@ private boolean addSubMicroformat(String
+-----------------------------------------------------------------------------------------------------
It is higly recommended to decorate the extractors who natively handle the nesting relatioship using the
- {{{./xref/org/deri/any23/extractor/html/annotations/Includes.html}@Includes}} annotation. This annotation,
+ {{{./xref/org/apache/any23/extractor/html/annotations/Includes.html}@Includes}} annotation. This annotation,
if present, avoid the production of <nesting_original> and <nesting_structured> RDF statements.
- The following example shows how the {{{./xref/org/deri/any23/extractor/html/annotations/Includes.html}@Includes}} annotation
- could be used to claim the fact that {{{./xref/org/deri/any23/extractor/html/HCardExtractor.html}HCardExtractor}} natively
- embedds the {{{./xref/org/deri/any23/extractor/html/AdrExtractor.html}AdrExtractor}}.
+ The following example shows how the {{{./xref/org/apache/any23/extractor/html/annotations/Includes.html}@Includes}} annotation
+ could be used to claim the fact that {{{./xref/org/apache/any23/extractor/html/HCardExtractor.html}HCardExtractor}} natively
+ embedds the {{{./xref/org/apache/any23/extractor/html/AdrExtractor.html}AdrExtractor}}.
+----------------------------------------------------------------------------------------------
@Includes( extractors = AdrExtractor.class )
Modified: incubator/any23/site/src/site/apt/dev-validation-fix.apt
URL: http://svn.apache.org/viewvc/incubator/any23/site/src/site/apt/dev-validation-fix.apt?rev=1230547&r1=1230546&r2=1230547&view=diff
==============================================================================
--- incubator/any23/site/src/site/apt/dev-validation-fix.apt (original)
+++ incubator/any23/site/src/site/apt/dev-validation-fix.apt Thu Jan 12 14:08:34 2012
@@ -13,9 +13,9 @@ Validation and Fixing
This pages describes the <<Any23>> rule-based approach, that allows it to detect, fix and correctly extract
RDF from those ill-formed RDFa in XHTML pages.
- More specifically, <<Any23>> allows you to write a {{{./xref/org/deri/any23/validator/Rule.html}Rule}}
- able to detect the errors, a {{{./xref/org/deri/any23/validator/Fix.html}Fix}} containing the logic to fix the problem and a
- {{{./xref/org/deri/any23/validator/Validator.html}Validator}} which acts as a register of rules and fixes. The Validator
+ More specifically, <<Any23>> allows you to write a {{{./xref/org/apache/any23/validator/Rule.html}Rule}}
+ able to detect the errors, a {{{./xref/org/apache/any23/validator/Fix.html}Fix}} containing the logic to fix the problem and a
+ {{{./xref/org/apache/any23/validator/Validator.html}Validator}} which acts as a register of rules and fixes. The Validator
calls all the registered rules and when one of them is applied it calls the associated Fix.
The following code snipped shows how to programmatically detect and fix a very common data error with <<Any23>>.
@@ -42,8 +42,8 @@ Validation and Fixing
</div>
+------------------------------------------------------------------------------------------
- With the <<Any23>> {{{./xref/org/deri/any23/validator/package-summary.html}Validator}} classes it's possible to solve this
- problem simply implementing the {{{./xref/org/deri/any23/validator/Rule.html}Rule}} interface as described below:
+ With the <<Any23>> {{{./xref/org/apache/any23/validator/package-summary.html}Validator}} classes it's possible to solve this
+ problem simply implementing the {{{./xref/org/apache/any23/validator/Rule.html}Rule}} interface as described below:
+------------------------------------------------------------------------------------------
public class MissingOpenGraphNamespaceRule implements Rule {
@@ -78,7 +78,7 @@ public class MissingOpenGraphNamespaceRu
}
+------------------------------------------------------------------------------------------
- The {{{./xref/org/deri/any23/validator/rule/MissingOpenGraphNamespaceRule.html}MissingOpenGraphNamespaceRule}} inspects the DOM
+ The {{{./xref/org/apache/any23/validator/rule/MissingOpenGraphNamespaceRule.html}MissingOpenGraphNamespaceRule}} inspects the DOM
structure of the HTML page and if it finds some META tags with some RDFa property (of the OpenGraph Protocol vocabulary, in this case)
it looks for the declaration of that name space. If there is no declaration it return <<true>>, that means that an error has been detected
within the document.
Modified: incubator/any23/site/src/site/apt/dev-xpath-extractor.apt
URL: http://svn.apache.org/viewvc/incubator/any23/site/src/site/apt/dev-xpath-extractor.apt?rev=1230547&r1=1230546&r2=1230547&view=diff
==============================================================================
--- incubator/any23/site/src/site/apt/dev-xpath-extractor.apt (original)
+++ incubator/any23/site/src/site/apt/dev-xpath-extractor.apt Thu Jan 12 14:08:34 2012
@@ -6,4 +6,4 @@ XPath Extractor
activated by a regular expression over the page URL.
When an extraction rule is activated all the variables it defines are
evaluated and then a NQuads template is expanded for generating statements.
- See {{{./xref/org/deri/any23/extractor/xpath/package-summary.html}Javadoc}}.
+ See {{{./xref/org/apache/any23/extractor/xpath/package-summary.html}Javadoc}}.
Modified: incubator/any23/site/src/site/apt/extractors.apt
URL: http://svn.apache.org/viewvc/incubator/any23/site/src/site/apt/extractors.apt?rev=1230547&r1=1230546&r2=1230547&view=diff
==============================================================================
--- incubator/any23/site/src/site/apt/extractors.apt (original)
+++ incubator/any23/site/src/site/apt/extractors.apt Thu Jan 12 14:08:34 2012
@@ -1,6 +1,6 @@
Any23 Extractors
- This page enlists all the Any23 Extractors (see source code {{{./xref/org/deri/any23/extractor/package-summary.html}package}}).
+ This page enlists all the Any23 Extractors (see source code {{{./xref/org/apache/any23/extractor/package-summary.html}package}}).
* Microformat Extractors
@@ -9,68 +9,68 @@ Any23 Extractors
Specific details about *Microformats* extractors can be found {{{./dev-microformat-extractors.html}here}}.
In particular the *Microformats Nesting* representation policy is described {{{./dev-microformat-extractors.html#microformat-nesting}here}}.
- {{{./xref/org/deri/any23/extractor/html/AdrExtractor.html}AdrExtractor}}
+ {{{./xref/org/apache/any23/extractor/html/AdrExtractor.html}AdrExtractor}}
- {{{./xref/org/deri/any23/extractor/html/GeoExtractor.html}GeoExtractor}}
+ {{{./xref/org/apache/any23/extractor/html/GeoExtractor.html}GeoExtractor}}
- {{{./xref/org/deri/any23/extractor/html/HCalendarExtractor.html}HCalendar}}
+ {{{./xref/org/apache/any23/extractor/html/HCalendarExtractor.html}HCalendar}}
- {{{./xref/org/deri/any23/extractor/html/HCardExtractor.html}HCard}}
+ {{{./xref/org/apache/any23/extractor/html/HCardExtractor.html}HCard}}
- {{{./xref/org/deri/any23/extractor/html/HListingExtractor.html}HListing}}
+ {{{./xref/org/apache/any23/extractor/html/HListingExtractor.html}HListing}}
- {{{./xref/org/deri/any23/extractor/html/HResumeExtractor.html}HResume}}
+ {{{./xref/org/apache/any23/extractor/html/HResumeExtractor.html}HResume}}
- {{{./xref/org/deri/any23/extractor/html/HReviewExtractor.html}HReview}}
+ {{{./xref/org/apache/any23/extractor/html/HReviewExtractor.html}HReview}}
- {{{./xref/org/deri/any23/extractor/html/SpeciesExtractor.html}SpeciesExtractor}}
+ {{{./xref/org/apache/any23/extractor/html/SpeciesExtractor.html}SpeciesExtractor}}
- {{{./xref/org/deri/any23/extractor/html/LicenseExtractor.html}LicenseExtractor}}
+ {{{./xref/org/apache/any23/extractor/html/LicenseExtractor.html}LicenseExtractor}}
- {{{./xref/org/deri/any23/extractor/html/XFNExtractor.html}XFNExtractor}}
+ {{{./xref/org/apache/any23/extractor/html/XFNExtractor.html}XFNExtractor}}
- {{{./xref/org/deri/any23/extractor/html/HRecipeExtractor.html}HRecipeExtractor}}
+ {{{./xref/org/apache/any23/extractor/html/HRecipeExtractor.html}HRecipeExtractor}}
* RDFa [1.0 , 1.1]
The following extractors refer to the {{{http://www.w3.org/TR/rdfa-syntax/}RDFa 1.0}}
and {{{http://www.w3.org/TR/rdfa-core/}RDFa 1.1}} specifications.
- {{{./xref/org/deri/any23/extractor/rdfa/RDFaExtractor.html}RDFaExtractor}}
+ {{{./xref/org/apache/any23/extractor/rdfa/RDFaExtractor.html}RDFaExtractor}}
* Microdata
The following extractors refer to the {{{http://dev.w3.org/html5/md/}Microdata specifications}}.
- {{{./xref/org/deri/any23/extractor/microdata/MicrodataExtractor.html}MicrodataExtractor}}
+ {{{./xref/org/apache/any23/extractor/microdata/MicrodataExtractor.html}MicrodataExtractor}}
* RDF
- {{{./xref/org/deri/any23/extractor/rdf/RDFXMLExtractor.html}RDFXMLExtractor}}
+ {{{./xref/org/apache/any23/extractor/rdf/RDFXMLExtractor.html}RDFXMLExtractor}}
- {{{./xref/org/deri/any23/extractor/rdf/NQuadsExtractor.html}NQuadsExtractor}}
+ {{{./xref/org/apache/any23/extractor/rdf/NQuadsExtractor.html}NQuadsExtractor}}
- {{{./xref/org/deri/any23/extractor/rdf/TurtleExtractor.html}TurtleExtractor}}
+ {{{./xref/org/apache/any23/extractor/rdf/TurtleExtractor.html}TurtleExtractor}}
- {{{./xref/org/deri/any23/extractor/rdf/NTriplesExtractor.html}NTriplesExtractor}}
+ {{{./xref/org/apache/any23/extractor/rdf/NTriplesExtractor.html}NTriplesExtractor}}
* Metadata Extractors
- {{{./xref/org/deri/any23/extractor/html/TitleExtractor.html}TitleExtractor}}
+ {{{./xref/org/apache/any23/extractor/html/TitleExtractor.html}TitleExtractor}}
- {{{./xref/org/deri/any23/extractor/html/HTMLMetaExtractor.html}HTMLMetaExtractor}}
+ {{{./xref/org/apache/any23/extractor/html/HTMLMetaExtractor.html}HTMLMetaExtractor}}
- {{{./xref/org/deri/any23/extractor/html/HeadLinkExtractor.html}HeadLinkExtractor}}
+ {{{./xref/org/apache/any23/extractor/html/HeadLinkExtractor.html}HeadLinkExtractor}}
- {{{./xref/org/deri/any23/extractor/html/ICBMExtractor.html}ICBMExtractor}}
+ {{{./xref/org/apache/any23/extractor/html/ICBMExtractor.html}ICBMExtractor}}
- {{{./xref/org/deri/any23/extractor/html/TurtleHTMLExtractor.html}TurtleHTMLExtractor}}
+ {{{./xref/org/apache/any23/extractor/html/TurtleHTMLExtractor.html}TurtleHTMLExtractor}}
* Content Extractors
- {{{./xref/org/deri/any23/extractor/xpath/XPathExtractor.html}XPath Extractor}} (<<Experimental>>)
+ {{{./xref/org/apache/any23/extractor/xpath/XPathExtractor.html}XPath Extractor}} (<<Experimental>>)
- {{{./xref/org/deri/any23/extractor/csv/CSVExtractor.html}CSV Extractor}} (See the extraction {{{./dev-csv-extractor.html}algorithm}}.)
+ {{{./xref/org/apache/any23/extractor/csv/CSVExtractor.html}CSV Extractor}} (See the extraction {{{./dev-csv-extractor.html}algorithm}}.)
Get more documentation
Modified: incubator/any23/site/src/site/apt/getting-started.apt
URL: http://svn.apache.org/viewvc/incubator/any23/site/src/site/apt/getting-started.apt?rev=1230547&r1=1230546&r2=1230547&view=diff
==============================================================================
--- incubator/any23/site/src/site/apt/getting-started.apt (original)
+++ incubator/any23/site/src/site/apt/getting-started.apt Thu Jan 12 14:08:34 2012
@@ -289,7 +289,7 @@ any23-service/bin$ ./any23server
from the command line in order to start up the server, then go to {{{http://localhost:8080/}}}
to access the web interface. A live demo version of such service is running at {{{http://any23.org/}}}.
You can also start the server from Java by running the
- {{{./xref/org/deri/any23/servlet/Servlet.html}Any23 Servlet}} class. Maven can be used to create a WAR
+ {{{./xref/org/apache/any23/servlet/Servlet.html}Any23 Servlet}} class. Maven can be used to create a WAR
file for deployment into an existing servlet container such as {{{http://tomcat.apache.org/}Apache Tomcat}}.
* Use <<Any23>> as a Library
Modified: incubator/any23/site/src/site/apt/plugin-office-scraper.apt
URL: http://svn.apache.org/viewvc/incubator/any23/site/src/site/apt/plugin-office-scraper.apt?rev=1230547&r1=1230546&r2=1230547&view=diff
==============================================================================
--- incubator/any23/site/src/site/apt/plugin-office-scraper.apt (original)
+++ incubator/any23/site/src/site/apt/plugin-office-scraper.apt Thu Jan 12 14:08:34 2012
@@ -2,7 +2,7 @@ Office Scraper Plugins
* <Excel Plugin>
- The {{{./xref/org/deri/any23/plugin/officescraper/ExcelPlugin.html}ExcelPlugin}} converts any
+ The {{{./xref/org/apache/any23/plugin/officescraper/ExcelPlugin.html}ExcelPlugin}} converts any
<<Microsoft Excel>> <97-2007> document to <RDF>.
<<TODO: add conversion schema.>>