PREFIX schema: PREFIX dbpedia: PREFIX dc: PREFIX v: PREFIX sysont: PREFIX sh: PREFIX skos: PREFIX ov: PREFIX rdf: PREFIX vann: PREFIX geo: PREFIX projects: PREFIX people: PREFIX xsd: PREFIX rdfs: PREFIX foaf: PREFIX owl: PREFIX projectfunding: PREFIX dbr: PREFIX doap: PREFIX lod2: PREFIX void: PREFIX aiiso: PREFIX site: PREFIX dct: PREFIX sioc: PREFIX groups: PREFIX partner: PREFIX aksw: PREFIX sioct: PREFIX dcterms: PREFIX content: PREFIX : projects:REX a aksw:AlumniProject; aksw:hookline "Web-Scale Extension of RDF Knowledge Bases"; aksw:publicationTag "rex"; site:content "Introduction\n============\n\nThe **Web RDF Extraction Framework**, **REX**, addresses the problem of extracting RDF data from templated websites. To this end, REX provide a generic architecture that allows learning XPath wrappers from unlabelled Web pages using knowledge from the Linked Open Data Cloud. REX is to be regarded as a skeleton that is to be fleshed out for your purposes. Still, REX is also a running system as it provides [running implementations](https://github.com/AKSW/REX/wiki \"wiki\") for all of its interfaces.\n\nIn contrast to existing frameworks to RDF extraction using XPath wrappers, REX provides a consistency layer which ensure that the new knowledge extracted is logically consistent with the knowledge already available in the input knowledge base. This website gives an overview of the framework. All technical details can be found on the [Github page's wiki](https://github.com/AKSW/REX/wiki \"Wiki\"). There you will also find:\n\n- The [Java documentation](http://aksw.github.io/REX/ \"Javadocs\") for the coders out there.\n- A [manual](https://github.com/AKSW/REX/wiki \"Manual\") to help you run the framework before you customize it for your purposes.\n- A [ticket system](https://github.com/AKSW/REX/issues \"Tickets\") in case you find some bugs or have some feature request.\n\nArchitecture\n============\n\n $\"The$ \n\nTo facilitate the implementation of extraction processes, the framework provides the four layer-architecture shown in Figure 1. The data for the extraction is first to be gathered from the Web (or any other source of your choice). To this end, interfaces are provided. Each of the modules in each of the layers is provided as an interface. Moreover, an initial implementation of each interface is provided (see [Java Docs](http://aksw.github.io/REX/ \"Javadocs\")).\n\n- The **extraction layer** allows for gathering data from the Web and consists of two modules: The [crawler](https://github.com/AKSW/REX/tree/master/src/main/java/org/aksw/rex/crawler \"Crawler\") gathers website content from the Web while the [domain identifier](https://github.com/AKSW/REX/tree/master/src/main/java/org/aksw/rex/domainidentifier \"Domain identifier\") helps detecting web site domains that contain information pertaining to a given property.\n- The **storage layer** provides interfaces for managing and storing structured data as well as unstructured data. \n- The **induction layer** contains all modules that allow to learn XPath expressions. The core module here is the [XPath Learner](https://github.com/AKSW/REX/blob/master/src/main/java/org/aksw/rex/xpath/XPathLearner.java \"Code\").\n- The **generation layer** allows integration approaches for generating and validating RDF data. The default generator relies on [AGDISTIS](http://aksw.org/projects/AGDISTIS \"AGDISTIS\") and [ORE](http://dl-learner.org \"ORE\").\n\nEvaluation\n==========\nWith REX, we also aimed to provide a baseline system for the extraction of RDF from templated websites. Thus, in addition to providing at least one implementation for all the interfaces, we also evaluated the basic REX. The data we used for the evaluation can be found here. \n\nWhat next?\n=========\nThere are several things you can do.\n\n1. Run REX: Simply follow the [steps in the manual](https://github.com/AKSW/REX/wiki/4---Run \"Run REX\").\n2. Extend REX: Please check out the [installation instructured](https://github.com/AKSW/REX/wiki/3---Installing \"Manual\").\n3. Point out bugs: Please use the [issue tracker](https://github.com/AKSW/REX/issues \"Issues\").\n\nNow you're on. Please extend REX and help improving the extraction of RDF from the Web."^^sysont:Markdown; dct:abstract "REX is an RDF extraction framework for Web data that can learn XPath wrappers from unlabelled Web pages using knowledge from the Linked Open Data Cloud."; doap:browse ; doap:bug-database ; doap:description ; doap:maintainer people:AxelNgonga; doap:programming-language "Java"; doap:wiki ; rdfs:label "REX"; skos:altLabel "Web RDF Extraction Framework"; foaf:logo .