You are viewing a plain text version of this content. The canonical link for it is here.
Posted to user@any23.apache.org by lewis john mcgibbney <le...@apache.org> on 2018/03/23 21:18:10 UTC
[ANNOUNCE] Apache Any23 2.2
The Apache Any23 Team is pleased to announce the release of Apache Any23
2.2.
*What is Any23?*
Anything To Triples (Any23) is a library, a web service and a command line
tool that extracts structured data in RDF format from a variety of Web
documents. Currently it supports the following input formats:
- RDF/XML <http://www.w3.org/TR/REC-rdf-syntax/>, Turtle
<http://www.w3.org/TeamSubmission/turtle/>, Notation 3
<http://www.w3.org/DesignIssues/Notation3>
- RDFa <http://www.w3.org/TR/xhtml-rdfa-primer/> with RDFa1.1 prefix
mechanism
<http://www.w3.org/TR/2010/WD-rdfa-core-20100422/#scoping-of-prefix-mappings>
- Microformats1 <http://microformats.org/> and Microformats2
<http://microformats.org/wiki/microformats-2>: hAdr, hCard, hCalendar,
hEntry, hEvent, hGeo, hItem, hListing, hProduct, hProduct, hRecipie,
hResume, hReview, License, Species, XFN, etc
- JSON-LD <http://json-ld.org/>: JSON for Linking Data. a lightweight
Linked Data format based on the already successful JSON format and provides
a way to help JSON data interoperate at Web-scale.
- HTML5 Microdata <http://dev.w3.org/html5/md/>: (such as Schema.org
<http://schema.org>)
- CSV <http://www.ietf.org/rfc/rfc4180.txt>: Comma Separated Values with
separator autodetection.
- Vocabularies: Extraction support for Dublin Core Terms
<http://dublincore.org/>, Description of a Career
<http://www.w3.org/wiki/DescriptionOfACareerVocabulary>, Description Of
A Project <https://github.com/edumbill/doap/wiki>, Friend Of A Friend
<http://xmlns.com/foaf/spec/>, GEO Names
<http://www.geonames.org/ontology/>, ICAL
<http://www.w3.org/2002/12/cal/icaltzd#>, lkif-core
<https://github.com/RinkeHoekstra/lkif-core>, Open Graph Protocol
<http://ogp.me/>, BBC Programmes Ontology <http://purl.org/ontology/po/>,
RDF Review Vocabulary <http://vocab.org/review/terms.html>, schema.org,
VCard <http://www.w3.org/2006/vcard/ns>, BBC Wildlife Ontology
<http://purl.org/ontology/wo/> and XHTML
<http://www.w3.org/1999/xhtml/vocab/>... and more!
- YAML <http://www.yaml.org/>: human friendly data serialization
standard for all programming languages.
- Additionally, as of 2.1 Any23 provides functionality to extract
triples using the Open Information Extraction (Open IE) system
<https://github.com/allenai/openie-standalone>. The Open IE system runs
over sentences and creates extractions that represent relations in text, in
the case of Any23, this results in triples.
*Downloads*
http://any23.apache.org/download.html
*Release Notes:*
https://s.apache.org/YmRb
Have Fun,
Lewis, on behalf of the Apache Any23 Project Management Committee
--
http://home.apache.org/~lewismc/
http://people.apache.org/keys/committer/lewismc