25 datasets found

Filter Results
  • SNB dataset

    The LDBC-SNB Data Generator (DATAGEN) is the responsible of providing the data sets used by all the LDBC benchmarks. This data generator is designed to produce directed labeled...
  • LIVED - Long Device Level Energy Data

    LIVED Data Set Description LIVED stands for Long Device Level Energy Data and contains measurements collected from smart plugs multi-sensors as depicted. The data has been...
  • Production Industry - Printing Machine

    The dataset provides log data from production industry, in particular from offset printing machines. The log data are simulated by mimicking algorithms. An offset printing...
  • IT Industry - Software Events

    The dataset provides log data from IT industry, in particular the logging of events of a software suite for operational IT service management. At different levels of logged data...
  • BENGAL

    This family of datasets for named entity recognition, entity disambiguation and relation extraction are generated automatically out of RDF data using natural language generation...
  • Weidmüller Energy and Injection Molding Data Set

    Weidmüller Energy and Injection Molding Data Set General Description This data set consists of simulated data using based on real measurements. The sensor measurements in the...
  • TWIG

    Dataset generator for Twitter data based on the Twitter7 dataset.
  • Linked Software Dependencies

    Access 475,000+ npm JavaScript libraries as 150,000,000+ RDF triples using TPF, HDT or Turtle
  • Linked Connections

    Linked Connections Mimicking algorithm: https://github.com/PoDiGG/podigg-lc Linked Connections is a method for publishing transit data using a low-cost API. It does this by...
  • GeoNames

    GeoNames covers all countries and contains over 8 million placenames
  • DBPedia

    DBpedia is a crowd-sourced community effort to extract structured information from Wikipedia and make this information available on the Web. DBpedia allows you to ask...
  • Linked Open Vocabularies

    LOV stands for Linked Open Vocabularies. This name is derived from LOD, standing for Linked Open Data. Let's assume that the reader is somehow familiar with the latter concept,...
  • Virtual International Authority File

    The VIAF® (Virtual International Authority File) combines multiple name authority files into a single OCLC-hosted name authority service. The goal of the service is to lower the...
  • TLC Trip Record Data

    This dataset includes trip records from all trips completed in yellow and green taxis in NYC in 2014 and select months of 2015. Records include fields capturing pick-up and...
  • RISIS

    Research Infrastructure for research and innovation policy studies
  • Energy Map Germany

    CSV data of development of solar energy within Germany with installation date, location, nominal capacity, gps information - 1.5 Mio entries
  • BioASQ

    A challenge in large-scale biomedical semantic indexing and question answering
  • Next Bike

    Live information of GPS position of arround 20.0000 bicicles in about 70 cities (http://www.nextbike.net/ - bike rental)
  • CER Smart Metering Project

    The Smart Metering Electricity Customer Behaviour Trials (CBTs) took place during 2009 and 2010 with over 5,000 Irish homes and businesses participating. The purpose of the...
  • GitHub Data

    GitHub is how people build software and is home to the largest community of open source developers in the world, with over 12 million people contributing to 31 million projects...
You can also access this registry using the API (see API Docs).