Second International Workshop on Consuming Linked Data (COLD2011)

October 23, 2011

Bonn, Germany

[News] - [Important Dates] - [Objectives] - [Program] - [Topics] - [Submissions] - [Proceedings] - [Organization] - [Committees] - [Contact] - [History]

Abstract

The quantity of published Linked Data is increasing dramatically. However, applications that consume Linked Data are not yet widespread. Current approaches lack methods for seamless integration of Linked Data from multiple sources, dynamic discovery of available data and data sources, provenance and information quality assessment, application development environments, and appropriate end user interfaces. Addressing these issues requires well-founded research, including the development and investigation of concepts that can be applied in systems which consume Linked Data from the Web. Following the success of the 1st International Workshop on Consuming Linked Data, we organize the second edition of this workshop in order to provide a platform for discussion and work on these open research problems. The main objective is to provide a venue for scientific discourse — including systematic analysis and rigorous evaluation — of concepts, algorithms and approaches for consuming Linked Data.

News

2011-10-24: COLD 2011 was a big success. Read Juan's report. Thanks to all presenters, our program committee, and all participants!
2011-10-18: Daniel P. Miranker's keynote will be about "Removing the Potholes in the Road to the Semantic Web" (see our program for an abstract).
2011-09-27: The (tentative) program is now online.
2011-09-27: The workshop proceedings are online as CEUR-WS.org Vol-782.

Important Dates

Abstract submission deadline: August 15, 2011, 23.59 Hawaii time
Paper submission deadline: August 19, 2011, 23.59 Hawaii time (extended from August 15, 2011)
Acceptance notification: September 6, 2011
Camera-ready versions of accepted papers: September 14, 2011
Workshop date: October 23, 2011

Objectives

The term Linked Data refers to a practice for publishing and interlinking structured data on the Web. Since the practice has been proposed in 2006, a grass-roots movement has started to publish and to interlink multiple open databases on the Web following the Linked Data principles. Due to conference workshops, tutorials, and general evangelism an increasing number of data publishers such as the BBC, Thomson Reuters, The New York Times, the Library of Congress, and the UK and US governments have adopted Linked Data principles. The ongoing effort resulted in bootstrapping the Web of Data which, today, comprises billions of RDF triples including millions of links between data sources. The published datasets include data about books, movies, music, radio and television programs, reviews, scientific publications, genes, proteins, medicine, and clinical trials, geographic locations, people, companies, statistical and census data, etc.

Access to Linked Data presents exciting opportunities for the next generation of Web-based applications: data from different providers can be aggregated and fragmentary information from multiple sources can be integrated to achieve a more comprehensive view. While a few applications, such as the BBC music guide have used Linked Data to significant benefit, the deployment methodology has been to harvest the data of interest from the Web to create a private, disconnected repository for each specific application. Such an approach can only be the beginning; new concepts to consume Linked Data are required in order to exploit the Web of Linked Data to its full potential. The concepts, patterns and tools necessary are very different from situations when resource identifiers are local or known a-priori, whole-repository queries are possible, access to the repository is reliable and relevant data sources are known to be trustworthy.

Several open issues that make the development of Linked Data based applications a challenging or still impossible task. These issues include the lack of approaches for seamless integration of Linked Data from multiple sources, for dynamic, on-the-fly discovery of available data, for information quality assessment, and for elaborate end user interfaces. These open issues can only be addressed appropriately when they are conceived as research problems that require the development and systematic investigation of novel approaches. The International Workshop on Consuming Linked Data (COLD) aims to provide a platform for the presentation and discussion of such approaches. Our main objective is to receive submissions that present scientific discussion (including systematic evaluation) of concepts and approaches, instead of exposition of features implemented in Linked Data based applications. For practical systems without formalization or evaluation we refer interested participants to other offerings at ISWC, such as the Semantic Web Challenge or the Demo Track. As such, we see our workshop as orthogonal to these events.

Program

9:00-9:10: Workshop Introduction ( Olaf Hartig, Andreas Harth, Juan Sequeda )
9:10-10:10: Keynote: Removing the Potholes in the Road to the Semantic Web ( Daniel P. Miranker, The University of Texas at Austin, USA)
Abstract: The W3C published its first recommendation defining RDF in 1999. One context encouraging further development was that RDF was just the start of a standardized web technology that would enable unprecedented data integration. Twelve years later we have an ecosystem of interoperable ontology editors, triplestores, SPARQL implementations and so on. Yet the rate of commercial adoption of the technology is disappointing. The talk will detail places where the semantic web fails to relate to the market and steps we might take to smooth the road to the adoption of the Semantic Web.
Bio: Professor Miranker arrived at the University of Texas in 1986 after completing his Ph.D. in Computer Science at Columbia University and his B.S. in Mathematics at MIT. Professor Miranker's current work concerns applying the Semantic Web to solving relational and biological data integration problems. Particular projects span SPARQL execution on relational databases and data mining methods that automate data integration. Application subject matter includes both biodiversity and molecular databases.
Professor Miranker is perhaps best known for his early work on the TREAT algorithm; a forward-chaining rule evaluation method originally intended to support the parallel evaluation of rule systems against the contents of a database in a map-reduce like environment. He is coauthor of the 1998 PODS Best Paper, "A Lower Bound Theorem for Indexing Schemes and its Application to Multidimensional Range Queries". In 1999 he founded a dotcom that survived the bust. The company developed and sold semantic software used in the implementation of B2B marketplaces.
10:10-10:30: Consuming Linked Data within a Large Educational Organization ( Fouad Zablith, Mathieu D'Aquin, Stuart Brown, Liam Green-Hughes ) - view paper
10:30-11:00: Coffee Break
11:00-11:20: Crowdsourcing Tasks within Linked Data Management ( Elena Simperl, Barry Norton, Denny Vrandecic ) - view paper
11:20-11:40: Using Tag Clouds to Quickly Discover Patterns in Linked Data Sets ( Xingjian Zhang, Jeff Heflin ) - view paper
11:40-12:00: LODWheel - JavaScript-based Visualization of RDF Data ( Magnus Stuhr, Dumitru Roman, David Norheim ) - view paper
12:00-12:20: Identifying Information Needs by Modelling Collective Query Patterns ( Khadija Elbedweihy, Suvodeep Mazumdar, Amparo E. Cano, Stuart N. Wrigley, Fabio Ciravegna ) - view paper
12:20-14:00: Lunch Break
14:00-14:20: One Query to Bind Them All ( Daniel M. Herzig, Thanh Tran ) - view paper
14:20-14:40: Incremental SPARQL Evaluation for Query Answering on Linked Data ( Florian Schmedding ) - view paper
14:40-15:00: SPLENDID: SPARQL Endpoint Federation Exploiting VOID Descriptions ( Olaf Görlitz, Steffen Staab ) - view paper
15:00-15:15: LDIF - Linked Data Integration Framework ( Andreas Schultz, Andrea Matteini, Robert Isele, Christian Bizer, Christian Becker ) - view paper
15:15-15:30: The Information Workbench as a Self-Service Platform for Linked Data Applications ( Peter Haase, Michael Schmidt, Andreas Schwarte ) - view paper
15:30-16:00: Coffee Break
16:00-16:20: Extracting Core Knowledge from Linked Data ( Valentina Presutti, Lora Aroyo, Alessandro Adamou, Balthasar Schopman, Aldo Gangemi, Guus Schreiber ) - view paper
16:20-16:40: Defining and Executing Assessment Tests on Linked Data for Statistical Analysis ( Benjamin Zapilko, Brigitte Mathiak ) - view paper
16:40-16:55: A Research Agenda for Linked Closed Data ( Marcus Cobden, Jennifer Black, Nicholas Gibbins, Les Carr, Nigel Shadbolt ) - view paper
16:55-17:10: Towards Green Linked Data ( Julia Hoxha, Anisa Rula, Basil Ell ) - view paper
17:10-17:30: Closing Discussion
18:00-19:00: ISWC 2011 Linked Data-a-thon
19:30- ... : Linked Data Gathering (location: Bönnsch brewery restaurant)

Topics of Interest

Relevant topics for COLD 2011 include but are not limited to:

Web scale data management (indexing, crawling, etc.)
Query processing over multiple linked datasets
Search in the Web of Data
Auto-discovery
- of URIs,
- of additional data that is not from the authoritative source of a URI,
- of relevant linked datasets in general
Caching and replication
Dataset dynamics
- processing change notifications,
- keeping consistency,
- temporal tracking of linked datasets
Reasoning on Linked Data from multiple sources
Knowledge discovery deriving insights from the Web of Data
Information quality of Linked Data
- information quality assessment,
- trustworthiness,
- provenance
User interface research for the interaction with the Web of Data
- user interaction and usability,
- visualizing Linked Data,
- natural language interfaces

Submissions

We seek full technical research papers with a length of up to 12 pages. In addition to these full papers, researchers are invited to submit short vision papers and short systems/demo paper with a length of up to 6 pages, respectively; vision and systems/demo papers must be clearly marked as such.

Paper submissions must be formatted in the style of the Springer Publications format for Lecture Notes in Computer Science (LNCS).

Please submit your paper via EasyChair at http://www.easychair.org/conferences/?conf=cold2011

Submissions that do not comply with the formatting of LNCS or that exceed the page limit will be rejected without review.

We note that the author list does not need to be anonymized, as we do not have a double-blind review process in place.

Submissions will be peer reviewed by three independent reviewers. Accepted papers have to be presented at the workshop proceedings.

Proceedings

The workshop proceedings are online as CEUR-WS.org Vol-782.

Workshop Organization

The workshop will be co-located with the 10th International Semantic Web Conference (ISWC) in Bonn, Germany, and will be held on October 23, 2011.

To attend the workshop you have to register for the conference using the ISWC registration system.

The workshop will also consist of:

Opening session: This will permit introduction of the workshop topics, goals, participants, and expected outcomes.
Keynote speaker: Prof. Daniel P. Miranker (University of Texas at Austin, US) will give a kaynote at the workshop.
Research Track: Accepted research papers will be presented at the workshop.
Visions Track: Accepted visions papers will be presented at the workshop.
Systems/Demo Track: Accepted systems/demo papers will be presented at the workshop.
Communication: Networked communication will be encouraged during the workshop using IRC, microblogging and other services, provided with the official hashtag (#cold2011) to follow the live-stream of the event.

Organizing Committee

Programme Committee

Jose Luis Ambite, University of Southern California, USA
Cosmin Basca, University of Zurich, Suisse
Sean Bechhofer, University of Manchester, UK
Christian Bizer, Freie Universität Berlin, Germany
Gong Cheng, Nanjing University, China
Oscar Corcho, Universidad Politecnica de Madrid, Spain
Richard Cyganiak, DERI, Ireland
Li Ding, Rensselaer Polytechnic Institute, USA
Christina Feilmayr, Johannes Kepler University of Linz, Austria
Fabien Gandon, INRIA - Edelweiss, France
Yolanda Gil, University of Southern California, USA
Hugh Glaser, University of Southampton, UK
Paul Groth, Vrije Universiteit Amsterdam, Netherlands
Claudio Gutierrez, Universidad de Chile, Chile
Harry Halpin, University of Edinburgh, UK
Michael Hausenblas, DERI, Ireland
Tom Heath, Talis, UK
Ralf Heese, Freie Universität Berlin, Germany
Ivan Herman, W3C
Katja Hose, Max-Planck-Institut für Informatik, Germany
Hak-Lae Kim, Samsung R&D, Korea
Lalana Kagal, Massachusetts Institute of Technology, USA
Ian Millard, University of Southampton, UK
Alexandre Passant, DERI, Ireland
Axel Polleres, DERI, Ireland
Kai-Uwe Sattler, TU Illmenau, Germany
Bernhard Schandl, University of Vienna, Austria
Simon Schenk, University of Koblenz-Landau, Germany
Raphael Troncy, EURECOM, France
Boris Villazon-Terrazas, Universidad Politecnica de Madrid, Spain
Denny Vrandecic, Karlsruhe Institute of Technology, Germany

Contact

For further information about the workshop, please contact the workshops chairs at cold.org.ws@googlemail.com

History

COLD 2011 is the second edition of the Consuming Linked Data workshop series. The first edition was COLD 2010.

Acknowledgements

The workshop is partly supported by the PlanetData project.