LinkedData

Revision as of 16:52, 22 December 2008 by 140.147.236.194 (Talk)

Revision as of 16:52, 22 December 2008 by 140.147.236.194 (Talk)

Introduction

The Web is increasingly understood as a global information space consisting not just of linked documents, but also of Linked Data. The Linked Data principles provide a basis for realizing this Web of Data, or Semantic Web. Since early 2007 numerous data sets have been published on the Web according to these principles, in domains as broad as music, books, geographical information, films, people, events, reviews and photos. In combination these data sets consist of over 2 billion RDF triples, interlinked by more than 3 million triples that cross data sets. As this Web of Linked Data continues to grow, and an increasing number of applications are developed that exploit these data sets, there is a growing need for data publishers, researchers, developers and Web practitioners to understand Linked Data principles and practice. This tutorial will address those needs, and provide participants with a solid foundation from which to begin publishing Linked Data on the Web, as well as to implement applications that consume Linked Data from the Web. The tutorial will also draw heavily from material used workshops at previous Linked Data workshops at the W3C and the International Semantic Web Conference.

Aims and Overview

The aim of this tutorial is to provide participants with a detailed conceptual understanding of how to publish Linked Data on the Web, and a gentle introduction to the practical and technical steps that make up the publishing process. In addition the tutorial will cover best practices in topics such as minting URIs for published data sets, vocabulary selection, choosing what RDF data to expose, and interlinking distributed data sets. Specific patterns will be presented for publishing different forms of data set, such as data from static files, relational databases and existing Web APIs. Participants will also be shown how to debug published data.

The second focus of the tutorial will be applications that consume Linked Data from the Web. We will give an overview about existing Linked Data browsers, Web of Data search engines as well as Linked Data Mashups and cover the existing software frameworks that can be used to build applications on top of the Data Web.

Lastly the tutorial will be to provide a collaborative space to explore putting some of the ideas of linked data into practice. The idea is that people can split off into groups, or work independently on adding linked data support to an existing application, exploring modeling issues of what vocabularies to use for particular data sets, and brain storming about new vocabularies that may be needed in the library world.

The tutorial will combine presentations by the tutors with demonstrations, interactive sessions, and group discussion. Other than a broad technical understanding of the Web development and Web publishing process, and a basic conceptual understanding of the Semantic Web, there are no pre-requisites for participation in the tutorial, which will be of interest to the full spectrum of code4lib2009 attendees, including researchers, developers, data managers/publishers and those seeking to comercially exploit the Semantic Web by publishing Linked Data.

Content, Approach and Schedule

The tutorial will be based primarily on material from the How to Publish Linked Data on the Web online tutorial, the key resource in the field of Linked Data publishing, supplemented by practical examples to illustrate key issues and give participants hands-on experience of creating Linked Data.

Program Outline

  • Introduction
    • The Web of Data
    • Linked Data Principles
  • Publishing Linked Data on the Web
    • Practical Task: creating a description of a book with RDF
    • Minting URIs and Content Negotiation
    • Vocabulary Selection
    • Manual and Automatic Interlinking
    • Publishing SPARQL Endpoints
  • Consuming Linked Data from the Web
    • Linked Data Browsers
    • Linked Data Search Engines
    • Linked Data Mashups
    • Libraries for implementing your own applications
  • Breakout Sessions
    • Break out into groups or work independently on putting some of the ideas into practice
    • Talk about publishing some dataset you are familiar with as linked data
    • Hack linked data support into an existing application you have
    • Explore the use of existing vocabularies for library data, or brainstorm about new ones
  • Review, Conclusions and Outlook
    • Results of breakout sessions
    • Linked Data prospects in the year ahead
    • Upcoming directions and challenges for linked data in the library/archives space

Participants

  • Ed Summers (Library of Congress)
  • Mike Giarlo (Library of Congress)
  • Jay Luker (Ex Libris)
  • Mark Diggory (DSpace)
  • Ross Singer (Talis)
  • Corey Harper (NYU)
  • Dan Chudnov (hopefully)