Second International Workshop on Consuming Linked Data (COLD2011)
October 23, 2011
Bonn, Germany
Abstract
The quantity of published Linked Data is increasing
dramatically. However, applications that consume Linked Data are not
yet widespread. Current approaches lack methods for seamless integration
of Linked Data from multiple sources, dynamic discovery of
available data and data sources, provenance and information quality
assessment, application development environments, and appropriate
end user interfaces. Addressing these issues requires well-founded
research, including the development and investigation of concepts
that can be applied in systems which consume Linked Data from the
Web.
Following the success of the 1st International Workshop on Consuming Linked Data, we organize the second edition of this workshop in order to
provide a platform for discussion and work on these open research
problems. The main objective is to provide a venue for scientific
discourse — including systematic analysis and rigorous evaluation —
of concepts, algorithms and approaches for consuming Linked
Data.
News
- 2011-10-24: COLD 2011 was a big success. Read Juan's report. Thanks to all presenters, our program committee, and all participants!
- 2011-10-18: Daniel P. Miranker's keynote will be about "Removing the Potholes in the Road to the Semantic Web" (see our program for an abstract).
- 2011-09-27: The (tentative) program is now online.
- 2011-09-27: The workshop proceedings are online as CEUR-WS.org Vol-782.
Important Dates
- Abstract submission deadline: August 15, 2011, 23.59 Hawaii time
- Paper submission deadline: August 19, 2011, 23.59 Hawaii time (extended from August 15, 2011)
- Acceptance notification: September 6, 2011
- Camera-ready versions of accepted papers: September 14, 2011
- Workshop date: October 23, 2011
Objectives
The term Linked Data refers
to a practice for publishing and interlinking structured data on the Web. Since the practice has been proposed in
2006, a grass-roots movement has started to publish and to interlink
multiple open databases on the Web following the Linked Data principles. Due to
conference workshops, tutorials, and general evangelism an
increasing number of data publishers such as the
BBC, Thomson Reuters, The New York Times, the Library of Congress,
and the UK and US governments have adopted Linked Data principles.
The ongoing
effort resulted in bootstrapping the Web of Data which, today, comprises
billions of RDF triples including millions of links between data sources. The
published datasets include data about books,
movies, music, radio and television programs, reviews, scientific
publications, genes, proteins, medicine, and clinical trials,
geographic locations, people, companies, statistical and census
data, etc.
Access to Linked Data presents exciting
opportunities for the next generation of Web-based applications:
data from different providers can be aggregated and
fragmentary information from multiple sources can be integrated to
achieve a more comprehensive view. While a few applications, such as the
BBC music guide have used Linked Data to significant benefit, the deployment methodology
has been to harvest the data of interest from the
Web to create a private, disconnected repository for each specific
application. Such an approach can only be the beginning; new concepts
to consume Linked Data are
required in order to exploit the Web of Linked
Data to its full potential. The concepts,
patterns and tools necessary are very different from situations
when resource identifiers are local or known a-priori,
whole-repository queries are possible, access to the repository is
reliable and relevant data sources are known to
be trustworthy.
Several open issues that make the development of Linked Data based applications a
challenging or still impossible task. These issues include the lack
of approaches for seamless integration of Linked
Data from multiple sources, for dynamic,
on-the-fly discovery of available data, for
information quality assessment, and for elaborate end user
interfaces. These open issues can only be addressed appropriately
when they are conceived as research problems that require the
development and systematic investigation of novel approaches. The
International Workshop on Consuming Linked Data
(COLD) aims to provide a platform for the presentation and
discussion of such approaches. Our main objective is to receive
submissions that present scientific discussion (including
systematic evaluation) of concepts and approaches, instead of
exposition of features implemented in Linked
Data based applications. For practical systems
without formalization or evaluation we refer interested
participants to other offerings at ISWC, such as the Semantic Web
Challenge or the Demo Track. As such, we see our workshop as orthogonal to these events.
Program
- 9:00-9:10: Workshop Introduction ( Olaf Hartig, Andreas Harth, Juan Sequeda )
-
9:10-10:10: Keynote: Removing the Potholes in the Road to the Semantic Web (
Daniel P. Miranker, The University of Texas at Austin, USA)
Abstract: The W3C published its first recommendation defining RDF in 1999. One context encouraging further development was that RDF was just the start of a standardized web technology that would enable unprecedented data integration. Twelve years later we have an ecosystem of interoperable ontology editors, triplestores, SPARQL implementations and so on. Yet the rate of commercial adoption of the technology is disappointing. The talk will detail places where the semantic web fails to relate to the market and steps we might take to smooth the road to the adoption of the Semantic Web.
Bio: Professor Miranker arrived at the University of Texas in 1986 after completing his Ph.D. in Computer Science at Columbia University and his B.S. in Mathematics at MIT. Professor Miranker's current work concerns applying the Semantic Web to solving relational and biological data integration problems. Particular projects span SPARQL execution on relational databases and data mining methods that automate data integration. Application subject matter includes both biodiversity and molecular databases.
Professor Miranker is perhaps best known for his early work on the TREAT algorithm; a forward-chaining rule evaluation method originally intended to support the parallel evaluation of rule systems against the contents of a database in a map-reduce like environment. He is coauthor of the 1998 PODS Best Paper, "A Lower Bound Theorem for Indexing Schemes and its Application to Multidimensional Range Queries". In 1999 he founded a dotcom that survived the bust. The company developed and sold semantic software used in the implementation of B2B marketplaces.
-
10:10-10:30:
Consuming Linked Data within a Large Educational Organization
(
Fouad Zablith,
Mathieu D'Aquin,
Stuart Brown,
Liam Green-Hughes
)
- view paper
- 10:30-11:00: Coffee Break
-
11:00-11:20:
Crowdsourcing Tasks within Linked Data Management
(
Elena Simperl,
Barry Norton,
Denny Vrandecic
)
- view paper
-
11:20-11:40:
Using Tag Clouds to Quickly Discover Patterns in Linked Data Sets
(
Xingjian Zhang,
Jeff Heflin
)
- view paper
-
11:40-12:00:
LODWheel - JavaScript-based Visualization of RDF Data
(
Magnus Stuhr,
Dumitru Roman,
David Norheim
)
- view paper
-
12:00-12:20:
Identifying Information Needs by Modelling Collective Query Patterns
(
Khadija Elbedweihy,
Suvodeep Mazumdar,
Amparo E. Cano,
Stuart N. Wrigley,
Fabio Ciravegna
)
- view paper
- 12:20-14:00: Lunch Break
-
14:00-14:20:
One Query to Bind Them All
(
Daniel M. Herzig,
Thanh Tran
)
- view paper
-
14:20-14:40:
Incremental SPARQL Evaluation for Query Answering on Linked Data
(
Florian Schmedding
)
- view paper
-
14:40-15:00:
SPLENDID: SPARQL Endpoint Federation Exploiting VOID Descriptions
(
Olaf Görlitz,
Steffen Staab
)
- view paper
-
15:00-15:15:
LDIF - Linked Data Integration Framework
(
Andreas Schultz,
Andrea Matteini,
Robert Isele,
Christian Bizer,
Christian Becker
)
- view paper
-
15:15-15:30:
The Information Workbench as a Self-Service Platform for Linked Data Applications
(
Peter Haase,
Michael Schmidt,
Andreas Schwarte
)
- view paper
- 15:30-16:00: Coffee Break
-
16:00-16:20:
Extracting Core Knowledge from Linked Data
(
Valentina Presutti,
Lora Aroyo,
Alessandro Adamou,
Balthasar Schopman,
Aldo Gangemi,
Guus Schreiber
)
- view paper
-
16:20-16:40:
Defining and Executing Assessment Tests on Linked Data for Statistical Analysis
(
Benjamin Zapilko,
Brigitte Mathiak
)
- view paper
-
16:40-16:55:
A Research Agenda for Linked Closed Data
(
Marcus Cobden,
Jennifer Black,
Nicholas Gibbins,
Les Carr,
Nigel Shadbolt
)
- view paper
-
16:55-17:10:
Towards Green Linked Data
(
Julia Hoxha,
Anisa Rula,
Basil Ell
)
- view paper
-
17:10-17:30:
Closing Discussion
-
18:00-19:00:
ISWC 2011 Linked Data-a-thon
-
19:30- ... :
Linked Data Gathering (location: Bönnsch brewery restaurant)
Topics of Interest
Relevant topics for COLD 2011 include but are not limited
to:
- Web scale data management (indexing,
crawling, etc.)
- Query processing over multiple linked
datasets
- Search in the Web of Data
- Auto-discovery
- of URIs,
- of additional data that is not from the
authoritative source of a URI,
- of relevant linked datasets in general
- Caching and replication
- Dataset dynamics
- processing change notifications,
- keeping consistency,
- temporal tracking of linked datasets
- Reasoning on Linked Data
from multiple sources
- Knowledge discovery deriving insights from the Web of Data
- Information quality of Linked Data
- information quality assessment,
- trustworthiness,
- provenance
- User interface research for the interaction with the Web of Data
- user interaction and usability,
- visualizing Linked Data,
- natural language interfaces
Submissions
We seek full technical research papers with a length of up to 12 pages. In addition to these full papers, researchers are invited to submit short vision papers and short systems/demo paper with a length of up to 6 pages, respectively; vision and systems/demo papers must be clearly marked as such.
Paper submissions must be formatted in the style of the
Springer Publications format for Lecture Notes in Computer Science (LNCS).
Please submit your paper via EasyChair at http://www.easychair.org/conferences/?conf=cold2011
Submissions that do not comply with the formatting of LNCS or
that exceed the page limit will be rejected without review.
We note that the author list does not need to be anonymized, as
we do not have a double-blind review process in place.
Submissions will be peer reviewed by three independent
reviewers. Accepted papers have to be presented at the workshop proceedings.
Proceedings
The workshop proceedings are online as CEUR-WS.org Vol-782.
Workshop Organization
The workshop will be co-located with the 10th International Semantic Web Conference (ISWC) in Bonn, Germany, and will be held on October 23, 2011.
To attend the workshop you have to register for the conference using the ISWC registration system.
The workshop will also consist of:
- Opening session: This will permit introduction of the workshop topics, goals, participants, and expected outcomes.
- Keynote speaker: Prof. Daniel P. Miranker (University of Texas at Austin, US) will
give a kaynote at the workshop.
- Research Track: Accepted research papers will be presented at the workshop.
- Visions Track: Accepted visions papers will be presented at the workshop.
- Systems/Demo Track: Accepted systems/demo papers will be presented at the workshop.
- Communication: Networked communication will be encouraged during the workshop using IRC,
microblogging and other services, provided with the official hashtag (#cold2011) to follow the live-stream of the event.
Organizing Committee
Programme Committee
- Jose Luis Ambite, University of Southern California, USA
- Cosmin Basca, University of Zurich, Suisse
- Sean Bechhofer, University of Manchester, UK
- Christian Bizer, Freie Universität Berlin, Germany
- Gong Cheng, Nanjing University, China
- Oscar Corcho, Universidad Politecnica de Madrid, Spain
- Richard Cyganiak, DERI, Ireland
- Li Ding, Rensselaer Polytechnic Institute, USA
- Christina Feilmayr, Johannes Kepler University of Linz, Austria
- Fabien Gandon, INRIA - Edelweiss, France
- Yolanda Gil, University of Southern California, USA
- Hugh Glaser, University of Southampton, UK
- Paul Groth, Vrije Universiteit Amsterdam, Netherlands
- Claudio Gutierrez, Universidad de Chile, Chile
- Harry Halpin, University of Edinburgh, UK
- Michael Hausenblas, DERI, Ireland
- Tom Heath, Talis, UK
- Ralf Heese, Freie Universität Berlin, Germany
- Ivan Herman, W3C
- Katja Hose, Max-Planck-Institut für Informatik, Germany
- Hak-Lae Kim, Samsung R&D, Korea
- Lalana Kagal, Massachusetts Institute of Technology, USA
- Ian Millard, University of Southampton, UK
- Alexandre Passant, DERI, Ireland
- Axel Polleres, DERI, Ireland
- Kai-Uwe Sattler, TU Illmenau, Germany
- Bernhard Schandl, University of Vienna, Austria
- Simon Schenk, University of Koblenz-Landau, Germany
- Raphael Troncy, EURECOM, France
- Boris Villazon-Terrazas, Universidad Politecnica de Madrid, Spain
- Denny Vrandecic, Karlsruhe Institute of Technology, Germany
Contact
For further information about the workshop,
please contact the workshops chairs at cold.org.ws@googlemail.com
History
COLD 2011 is the second edition of the Consuming Linked Data workshop series. The first edition was COLD 2010.
Acknowledgements
The workshop is partly supported by the PlanetData project.