LIMES

http://aksw.org/Projects/LIMES an entity of type: AlumniProject

LIMES is a link discovery framework for the Web of Data. It implements time-efficient approaches for large-scale link discovery based on the characteristics of metric spaces. It is easily configurable via a configuration file as well as through a graphical user interface. LIMES can be downloaded as standalone tool for carrying out link discovery or as a Java library.
xsd:string

General Overview

LIMES implements novel time-efficient approaches for link discovery in metric spaces. Our approaches facilitate different approximation techniques to compute estimates of the similarity between instances. These estimates are then used to filter out a large amount of those instance pairs that do not suffice the mapping conditions. By these means, LIMES can reduce the number of comparisons needed during the mapping process by several orders of magnitude. The approaches implemented in LIMES include the original LIMES original LIMES algorithm for edit distances, HR3, HYPPO, and ORCHID. Additionally, LIMES supports the first planning technique for link discovery HELIOS , that minimizes the overall execution of a link specification, without any loss of completeness. Moreover, LIMES implements supervised and unsupervised machine-learning algorithms for finding accurate link specifications. The algorithms implemented here include the supervised, active and unsupervised versions of EAGLE and WOMBAT.

Architecture

The LIMES framework consists of eight main modules of which each can be extended to accommodate new or improved functionality. The central modules of LIMES is the controller module, which coordinates the matching process. The matching process is carried out as follows: First, the controller calls the configuration module, which reads the configuration file and extracts all the information necessary to carry out the comparison of instances, including the URL of the SPARQL-endpoints of the knowledge bases S (source) and T(target), the restrictions on the instances to map (e.g., their type), the expression of the metric to be used and the threshold to be used.

Given that the configuration file is valid w.r.t. the LIMES Specification Language (LSL), the query module is called. This module uses the configuration for the target and source knowledge bases to retrieve instances and properties from the SPARQL-endpoints of the source and target knowledge bases that adhere to the restrictions specified in the configuration file. The query module writes its output into a file by invoking the cache module. Once all instances have been stored in the cache, the controller chooses between performing Link Discovery or Machine Learning. For Link Discovery, LIMES will re-write, plan and execute the Link Specification (LS) included in the configuration file, by calling the rewriter, planner and engine modules resp. The main goal of LD is to identify the set of links (mapping) that satisfy the conditions opposed by the input LS. For Machine Learning, LIMES calls the machine learning algorithm included in the configuration file, to identify an appropriate LS to link S and T. Then it proceeds in executing the LS. For both taks, the mapping will be stored in the output file choosen by the user in the configuration file. The results are finally stored into a RDF or a XML file.

Evaluation Results

The algorithms implemented in LIMES were published in several papers. Below are links to evaluation results.

Running LIMES

  • Download the LIMES package (includes a user manual) and run it locally on your server
  • You can either execute LIMES using the graphical interface or run LIMES via the command line as a Java executable package.
sysont:Markdown
Dr. Axel-C. Ngonga Ngomo

inverse relations

9 resources BOA
DEER
DEQA
GeoLift
QROWD
SAGE
SAKE
SCMS
SLIPO
9 resources Dr. Axel-C. Ngonga Ngomo
Daniel Obraczka
Prof. Dr. Jens Lehmann
Kevin Dressler
Klaus Lyko
Kleanthi Georgala
Dr. Matthias Wauer
Mofeed Hassan
Dr. Mohamed Ahmed Sherif
1 resources Tommaso Soru
by (Editors: ) [Bibsonomy of ]
5.219992ms