Apache Anything To Triples (Any23) 是一个库、web服务和命令行工具用来从微格式、RDF、为数据、RDF/XML、Turtle、N-Tripes 和 Quards)中抽取结构化数据。
Apache Any23 1.0 正式发布,当前支持的输入格式如下:
* RDF/XML, Turtle, Notation 3
* RDFa with RDFa1.1 prefix mechanism
* Microformats: Adr, Geo, hCalendar, hCard, hListing, hRecipe, hReview,
License, XFN and Species
* HTML5 Microdata: (such as Schema.org)
* JSON-LD: JSON for Linking Data. a lightweight Linked Data format based
on the already successful JSON format and provides a way to help JSON data
interoperate at Web-scale.
* CSV: Comma Separated Values with separator autodetection.
* Vocabularies: Extraction support for CSV, Dublin Core Terms, Description
of a Career, Description Of A Project, Friend Of A Friend, GEO Names, ICAL,
lkif-core, Open Graph Protocol, BBC Programmes Ontology, RDF Review
Vocabulary, schema.org, VCard, BBC Wildlife Ontology and XHTML.
项目主页:http://s.apache.org/Ull
下载地址:http://any23.apache.org/download.html
来自:开源中国社区