Third International Workshop on Consuming Linked Data (COLD2012)

November 12, 2012

Boston, USA

[News] - [Important Dates] - [Objectives] - [Program] - [Topics] - [Submissions] - [Proceedings] - [Organization] - [Committees] - [Contact] - [History]

Abstract

The quantity of published Linked Data is increasing dramatically. However, applications that consume Linked Data are not yet widespread. Current approaches lack methods for seamless integration of Linked Data from multiple sources, dynamic discovery of available data and data sources, provenance and information quality assessment, application development environments, and appropriate end user interfaces. Addressing these issues requires well-founded research, including the development and investigation of concepts that can be applied in systems which consume Linked Data from the Web. Following the success of the 1st International Workshop on Consuming Linked Data, we organize the second edition of this workshop in order to provide a platform for discussion and work on these open research problems. The main objective is to provide a venue for scientific discourse — including systematic analysis and rigorous evaluation — of concepts, algorithms and approaches for consuming Linked Data.

News

2012-11-07: Linked Data Gathering will be at The Rattlesnake Bar, just 3 minutes walking from the conference hotel. Please RSVP here.
2012-11-02: The program has been published.
2012-09-21: The workshop proceedings are online as CEUR-WS.org Vol-905.
2012-08-27: Bart van Leeuwen and Evan Sandhaus will be Invited Speakers.
2012-08-24: The list of accepted papers is now online.
2012-07-26: The deadline has been extended. Abstracts due July 31. Papers due August 4.
2012-05-15: The workshop has been accepted for ISWC 2012.
2012-06-15: Call for Papers finalised.

Important Dates

Abstract submission deadline: July 31, 2012, 23.59 Hawaii time
Paper submission deadline: August 4, 2012, 23.59 Hawaii time
Acceptance notification: August 24, 2012
Camera-ready versions of accepted papers: September 10, 2012
Workshop date: November 12, 2012

Accepted Papers

A Heuristic-Based Approach for Planning Federated SPARQL Queries (Gabriela Montoya, Maria-Esther Vidal and Maribel Acosta)
Analyses of RDF Triples in Sample Datasets (Jakub Starka, Martin Svoboda and Irena Mlynkova)
Extending the WebID Protocol with Access Delegation (Sebastian Tramp, Henry Story, Andrei Sambra, Philipp Frischmuth, Michael Martin and Sören Auer)
Integrating Linked Metadata Repositories into the Web of Data (Gofran Shukair, Nikolaos Loutas and Vassilios Peristeras)
Learning from the History of Distributed Query Processing - A Heretic View on Linked Data Management (Heiko Betz, Francis Gropengießer, Katja Hose and Kai-Uwe Sattler)
Licenses Compatibility and Composition in the Web of Data (Serena Villata and Fabien Gandon)
MapXplore: Linked Data in the App Store (Csaba Veres)
Producing and Consuming Linked Open Data on Art with a Local Community (Fuyuko Matsumura, Iwao Kobayashi, Fumihiro Kato, Tetsuro Kamura, Ikki Ohmukai and Hideaki Takeda)
QB4OLAP: A Vocabulary for OLAP Cubes on the Semantic Web (Lorena Etcheverry and Alejandro A. Vaisman)
Spamming in Linked Data (Ali Hasnain, Mustafa Al-Bakri, Luca Costabello, Zijie Cong, Ian Davis and Tom Heath)
The Callimachus Project: RDFa as a Web Template Language (Steve Battle, David Wood, James Leigh and Luke Ruth)

Objectives

The term Linked Data refers to a practice for publishing and interlinking structured data on the Web. Since the practice has been proposed in 2006, a grass-roots movement has started to publish and to interlink multiple open databases on the Web following the Linked Data principles. Due to conference workshops, tutorials, and general evangelism an increasing number of data publishers such as the BBC, Thomson Reuters, The New York Times, the Library of Congress, and the UK and US governments have adopted Linked Data principles. The ongoing effort resulted in bootstrapping the Web of Data which, today, comprises billions of RDF triples including millions of links between data sources. The published datasets include data about books, movies, music, radio and television programs, reviews, scientific publications, genes, proteins, medicine, and clinical trials, geographic locations, people, companies, statistical and census data, etc.

Access to Linked Data presents exciting opportunities for the next generation of Web-based applications: data from different providers can be aggregated and fragmentary information from multiple sources can be integrated to achieve a more comprehensive view. While a few applications, such as the BBC music guide have used Linked Data to significant benefit, the deployment methodology has been to harvest the data of interest from the Web to create a private, disconnected repository for each specific application. Such an approach can only be the beginning; new concepts to consume Linked Data are required in order to exploit the Web of Linked Data to its full potential. The concepts, patterns and tools necessary are very different from situations when resource identifiers are local or known a-priori, whole-repository queries are possible, access to the repository is reliable and relevant data sources are known to be trustworthy.

Several open issues that make the development of Linked Data based applications a challenging or still impossible task. These issues include the lack of approaches for seamless integration of Linked Data from multiple sources, for dynamic, on-the-fly discovery of available data, for information quality assessment, and for elaborate end user interfaces. These open issues can only be addressed appropriately when they are conceived as research problems that require the development and systematic investigation of novel approaches. The International Workshop on Consuming Linked Data (COLD) aims to provide a platform for the presentation and discussion of such approaches. Our main objective is to receive submissions that present scientific discussion (including systematic evaluation) of concepts and approaches, instead of exposition of features implemented in Linked Data based applications. For practical systems without formalization or evaluation we refer interested participants to other offerings at ISWC, such as the Semantic Web Challenge or the Demo Track. As such, we see our workshop as orthogonal to these events.

Program

9:00-9:10: Workshop Introduction
09:10 - 10:10: Keynote: Real-time Emergency Response Using Semantic Web Technology (Bart van Leeuwen - netage.nl and Fire Fighter at Fire-department Amsterdam-Amstelland )
Abstract: The incidents that Fire Fighters are being dispatched to are by nature unpredictable. This means that their information demand will change from occasion to occasion and timely as well. Netage.nl started developing and deploying small scale Semantic Technology based solutions at Fire Department Amsterdam-Amstelland. The agile nature of the Semantic Web allowed to create simple solutions to answer the questions that were really asked by the operational personnel. Today more than 15 Fire Stations in the greater Amsterdam area use real time Linked Open Data to supply their Fire Fighters with information. Started with small operational questions, development is slowly moving towards meeting the national paradigm shift on public fire safety. This endeavor has not been unnoticed and resulted in national and international partnerships to promote and implement the ideas outside Amsterdam as well.

Bio: Bart van Leeuwen has been the owner of netage.nl for 16 years. He has a lot of experience in "outside the box" thinking to help his customers get to the right solution. For Bart, technology is never the answer to business questions. Technology is an enabler, and should be treated as such. He has a lot of experience with Data management solutions like Lotus Notes, DB2, Postgres and their integration tools. Bart is also a professional fire fighter at Fire-department Amsterdam-Amstelland where his field experience combined with technological background resulted in ground breaking innovations on operational information delivery.

Querying

10:10 - 10:30: Learning from the History of Distributed Query Processing - A Heretic View on Linked Data Management (Heiko Betz, Francis Gropengießer, Katja Hose and Kai-Uwe Sattler)
10:30 - 11:00 BREAK
11:00 - 11:20: A Heuristic-Based Approach for Planning Federated SPARQL Queries (Gabriela Montoya, Maria-Esther Vidal and Maribel Acosta)

Dataset Analysis

11:20 - 11:40: Analyses of RDF Triples in Sample Datasets (Jakub Starka, Martin Svoboda and Irena Mlynkova)
11:40 - 12:00: Spamming in Linked Data (Ali Hasnain, Mustafa Al-Bakri, Luca Costabello, Zijie Cong, Ian Davis and Tom Heath)

Authentication and Licenses

12:00 - 12:20: Extending the WebID Protocol with Access Delegation (Sebastian Tramp, Henry Story, Andrei Sambra, Philipp Frischmuth, Michael Martin and Sören Auer)
12:20 - 14:00 LUNCH
14:00 - 14:20: Licenses Compatibility and Composition in the Web of Data (Serena Villata and Fabien Gandon)

Linked Data Applications

14:20 - 14:40: Producing and Consuming Linked Open Data on Art with a Local Community (Fuyuko Matsumura, Iwao Kobayashi, Fumihiro Kato, Tetsuro Kamura, Ikki Ohmukai and Hideaki Takeda)
14:40 - 15:00: MapXplore: Linked Data in the App Store (Csaba Veres)
15:00 - 15:20: The Callimachus Project: RDFa as a Web Template Language (Steve Battle, David Wood, James Leigh and Luke Ruth)
15:20 - 16:00 BREAK
16:00 - 16:50: Keynote: Linked Data at The New York Times: The First 161 Years (Evan Sandhaus - New York Times)
Abstract: The New York Times committment to Linked Data began over 160 years ago.
Starting in 1851, The New York Times has always catalogued its archival articles using a controlled vocabulary of people, places, organizations and descriptors. In 2009 The New York Times started publishing this vocabulary as linked data using semantic web standards. In 2011 The Times announced the launch of several RESTful Semantic APIs. And in late 2012 and early 2013, The Times will migrate its entire process for vocabulary management to a system designed around the principles of Linked Data.
In my remarks, I will survey the history of Semantic publishing at The New York Times, outline our semantic strategy, detail the business-case for linked data at The Times and provide an in-depth explanation of our new vocabulary management system.

Bio: Evan Sandhaus is The Lead Architect for Semantic Platforms at The New York Times Company. In his six years with The Times, Mr. Sandhaus has directed strategy and technology for The New York Times Linked Open Data Initiative; developed a semantic technology for identifying key concepts in large text datasets; engineered a patented system for purging template text from Web content; and collaborated with The Linguistic Data Consortium to release and promote The New York Times Annotated Corpus, a collection of 1.8 million richly annotated Times articles published from 1987

Vocabularies

16:50 - 17:10: QB4OLAP: A Vocabulary for OLAP Cubes on the Semantic Web (Lorena Etcheverry and Alejandro A. Vaisman)
17:10 - 17:30: Integrating Linked Metadata Repositories into the Web of Data (Gofran Shukair, Nikolaos Loutas and Vassilios Peristeras)

17:30 - 17:45: Closing Remarks
20:00 - ... : Linked Data Gathering at The Rattlesnake Bar, just 3 minutes walking from the conference hotel. Please RSVP here.

Topics of Interest

Relevant topics for COLD 2012 include but are not limited to:

Live Linked Data (i.e., algorithms and applications that make use of Linked Data at runtime)
Architectures for consuming Linked Data (e.g., Dataspaces)
Handling additional web data (e.g., microformats, microdata, schema.org, APIs, JSON, Open Graph Protocol, Twitter Cards...)
Web scale data management (indexing, crawling, etc.)
Query processing over multiple linked datasets
Search in the Web of Data
Auto-discovery of URIs and data
Caching and replication
Dataset dynamics
Reasoning on Linked Data from multiple sources
Information quality and trustworthiness of Linked Data
User interface research for the interaction with the Web of Data

Submissions

We seek novel technical research papers in the context of consuming Linked Data with a length of up to 12 pages.

Paper submissions must be formatted in the style of the Springer Publications format for Lecture Notes in Computer Science (LNCS).

Please submit your paper via EasyChair at http://www.easychair.org/conferences/?conf=cold2012

Submissions that do not comply with the formatting of LNCS or that exceed the page limit will be rejected without review.

We note that the author list does not need to be anonymized, as we do not have a double-blind review process in place.

Submissions will be peer reviewed by three independent reviewers. Accepted papers have to be presented at the workshop proceedings.

Proceedings

The workshop proceedings are online as CEUR-WS.org Vol-905.

Workshop Organization

The workshop will be co-located with the 11th International Semantic Web Conference (ISWC) in Boston, USA, and will be held on November, 2012.

The workshop will also consist of:

Opening session: This will permit introduction of the workshop topics, goals, participants, and expected outcomes.
Keynote speaker: Bart van Leeuwen and Evan Sandhaus
Research Track: Accepted research papers will be presented at the workshop.
Communication: Networked communication will be encouraged during the workshop using IRC, microblogging and other services, provided with the official hashtag (#cold2012) to follow the live-stream of the event.

Organizing Committee

Programme Committee

Jose Luis Ambite, University of Southern California, USA
Cosmin Basca, University of Zurich, Switzerland
Christian Bizer, Freie Universität Berlin, Germany
Gong Cheng, Nanjing University, China
Oscar Corcho, Universidad Politecnica de Madrid, Spain
Richard Cyganiak, DERI, Ireland
Aba-Sah Dadzie, University of Sheffield, UK
Christina Feilmayr, Johannes Kepler University of Linz, Austria
Yolanda Gil, University of Southern California, USA
Hugh Glaser, University of Southampton, UK
Claudio Gutierrez, Universidad de Chile, Chile
Michael Hausenblas, DERI, Ireland
Tom Heath, Talis, UK
Ralf Heese, Freie Universität Berlin, Germany
Ivan Herman, W3C
Katja Hose, Max-Planck-Institut für Informatik, Germany
Hak-Lae Kim, Samsung R&D, Korea
Pablo Mendes, Freie Universität Berlin, Germany
Giuseppe Pirro, Free University of Bolzano, Italy
Axel Polleres, Siemens AG Österreich, Austria
Kai-Uwe Sattler, TU Illmenau, Germany
Matthew Rowe, Open University, UK
Bernhard Schandl, University of Vienna, Austria
Sebastian Speiser, Karlsruhe Institute of Technology (KIT), Germany
Raphael Troncy, EURECOM, France
Boris Villazon-Terrazas, Universidad Politecnica de Madrid, Spain
Jun Zhao, University of Oxford, UK

Contact

For further information about the workshop, please contact the workshops chairs at cold.org.ws@googlemail.com

History

COLD 2012 is the third edition of the Consuming Linked Data workshop series. The second edition was COLD 2011, and the second one COLD 2010.

Acknowledgements

The workshop is partly supported by the PlanetData project.