| 
          
            | 
  
    |  |  
    | Objectives |  
    |  |  
    |  |  
    |  |  
    | In recent years we have witnessed substantial exploitation of search technologies, both at web and enterprise scale. However, the representation of user queries and information in existing search appliances is still almost exclusively achieved by simple syntax-based descriptions (i.e. keyword queries matched against bag-of-words document representation). While these systems have shown to work well for many common search needs, they work on the basis of rough approximations and usually fail to address more complex tasks such as aggregation and information analytics. On the other hand, recent advances in the field of semantic technologies have resulted in tools and standards that allow for the articulation of domain knowledge at a high level of expressivity. Semantic repositories and reasoning engines have now advanced to a state where querying and processing of this knowledge can scale to large-scale scenarios. As such, semantic technologies are posed to provide significant contributions to IR problems. More expressive descriptions of resources are achieved through the representation of the resource content in terms of concepts and structured data (OWL, RDF). The recent media interest around Wolfram Alpha, PowerSet (acquired by Microsoft Bing) and Yahoo! SearchMonkey show the expectations regarding the impact of semantic search The other way around, we have also seen the successful adoption of ideas from IR to the problem of search in semantic (Web) data, which is due to the increasing size of the Semantic Web. Popular examples include the Linking Open Data project, the large body of data in forms of Microformats and RDFa data associated with text. Common to these scenarios is that the search is focused not on a document collection, but on semantic data (which may be possibly linked to or embedded in textual information). Search and ranking large amount of semantic data on the Web is another key topic addressed by this workshop.  |  
    |  |  |  
          
            | 
   |  |   | Challenges |   |  |   |  |   |  |  
    | In this context, challenges for Semantic Search research will include, among others:  
How can semantic technologies be applied to the IR problems? How to address scalability and effectiveness of data Web search (by applying IR technologies)? How to allow web user to exploit the expressiveness of the semantic data on the Web? I.e. how to lower the technical barriers for users to ask complex questions and to interact with web data to obtain concrete answers for complex needs? And most importantly, how can this new generation of search systems that successfully exploit semantics for IR or for data Web search can be evaluated and compared  (with standard IR systems or semantic repositories)?  |  
    |  |  |  
          
            | 
   |  |   | Topics of Interests |   |  |   |  |   |  |  
    | Semantic Search is defined through two main directions. First is Semantic-driven IR, the application of semantic technologies to the IR problem. The second is Semantic Data Search, which mainly deals with the retrieval of semantic data.  Main topics of interest for the envisioned workshop contributions include (but are not limited to) the following: Semantic-driven IR 
Semantic Data SearchExpressive Document ModelsKnowledge Extraction for Building Expressive Document RepresentationMatching and Ranking based on Expressive Document RepresentationInfrastructure for Semantic-driven IR 
Interaction Paradigms for Semantic SearchCrawling, Storage and Indexing of Semantic DataSemantic Data Search and RankingData Web Search: Search in Multi-Data-Source, Multi-Repository ScenariosDealing with Vague, Incomplete and Dirty Semantic DataInfrastructure for Searching Semantic Data on the Web 
Evaluation of semantic searchNatural Language InterfacesKeyword-based Query InterfacesHybrid Query Interfaces (A Combination of NL, Keywords, Forms, Facets, and Formal Queries)Visualization of Semantic Data and Expressive Document Representation on the Web 
Evaluation Methodologies for Semantic SearchStandard Datasets and Benchmarks for Semantic SearchInfrastructure for Semantic Search Evaluation |  
    |  |  |  
          
            | 
   |  |   | Program |   |  |   |  |   |  |  
    | Each presentation is 25 minutes + 5 minutes for questions. 09:00 - 10:30  Session 109:00 Welcome09:15 Keynote 1: Anatomy of the long tail: on satisfying niche interests  by Bo Pang (Yahoo! Research)
 10:00 Algorithm for answer graph construction for keyword queries on RDF data [PDF]
 Parthasarathy K, Sreenivasa Kumar P and Dominic Damien
 10:30 - 11:00  Coffee Break11:00 - 12:00  Session 211:00 Using Personalized PageRank for Keyword Based Sensor Retrieval [PDF]Lorand Dali, Alexandra Moraru and Dunja Mladenic
 11:30 RDF Visualization using a Three-Dimensional Adjacency Matrix [PDF]
 Mario Arias Gallego, Javier D. Fernández, Miguel A. Martinez-Prieto and Pablo De La Fuente
 12:00 - 13:30  Lunch13:30 - 15:00  Session 313:30 Learning to Rank for Semantic Search [PDF]Lorand Dali and Blaz Fortuna
 14:00 Semantic Information Filtering - Beyond Collaborative Filtering [PDF]
 Ivo Lašek and Peter Vojtáš
 14:30 Two-layered architecture for peer-to-peer concept search [PDF]
 Janakiram Dharanipragada, Fausto Giunchiglia, Harisankar Haridas and Uladzimir Kharkevich
 15:00 - 15:30  Coffee Break15:30 - 17:15  Session 415:30 Keynote 2: Integrating and Browsing Linked Data at Web Scale with SWSE by Andreas Harth (AIFB)
 
16:15 Discussion on Semantic Search evaluation
Abstract:
Traditional search models do not fully exploit the potential that
structured datasets offer for searching and browsing. The talk covers challenges
that arise during collection, integration, analysis, and exploration of large
structured datasets. The talk presents experiments on Web datasets, and presents
a universal method for interactive browsing of such datasets independent of
domain or schema. The overall architecture of SWSE, the Semantic Web Search
Engine, includes many of the traditional search engine components, but the unique
characteristics of collectively-edited structured datasets require application
of significantly refined techniques inside each component. The talk
covers work on the overall architecture and implementation of SWSE,
including a crawler for data acquisition, optimised index
structures and query processing for fast response times, a ranking
component to prioritise data items and data sources for
display, and a user interface to browse the integrated data.
 
Bio:
Andreas works as project lead at Institute AIFB at the Karlsruhe
Institute of Technology, Germany, and has been a Ph.D. student with the
Digital Enterprise Research Institute
(DERI) at the National University of Ireland, Galway since March 2004
under the supervision of  Prof. Stefan Decker. He holds a Dipl.-Inf.
(FH) (a nearby equivalent to the U.S.-style M.Sc.) from Fachhochschule
Würzburg. Andreas worked as intern at Fraunhofer Gesellschaft in
Würzburg (1998/1999) and at IBM's Sillicon Valley Lab in San
Jose, CA (2001). He visited USC's Information Sciences Institute in
Marine del Rey, CA as a research assistant (2003/2004). His research
interest is large-scale data interoperation on the Semantic Web. Andreas
has published over a dozen papers, and is author of several open source
software systems. In addition, he has worked in
numerous EU and national projects, served on various program committees,
and participated in W3C working groups.
 17:15 Close
 
 |  
    |  |  |  
          
            | 
   |  |   | Semantic Search Challenge |   |  |   |  |   |  |  
    | Building on the success of previous year's Semantic Search evaluation, 
we call for participation in the Semantic Search Challenge 2011, a 
competition that requires participants to answer queries of varying 
complexity based on a set of structured data collected from the 
Web. This year's competition will consists of two tracks. The first "Entity Search Track" consists of 
queries that refer to one particular entity. The second "List Search 
Track" consists of complex queries with multiple possible answers. The 
winners of each track will receive a prize of $500 dollars (sponsored 
by 
Yahoo!). 
The results of the evaluation will be presented at the 4th 
International 
Semantic Search Workshop (SemSearch 2011), co-located with the World 
Wide Web Conference (WWW 2011) in Hyderabad, India. However, you don't 
need to attend the workshop to participate in the Challenge. 
 Datasets Each track of the competition requires participants to rank 
objects in the same collection, but using different types of queries. Collection The collection is a sample of Linked Data crawled from publicly
 available sources. The dataset consists of a set of RDF triples with 
provenance information for each triple. This dataset has been previously
 made available as part of the Billion Triple Challenge 2009, see some statistics of the dataset and details of the format.
 We have modified this dataset, because the original Billion Triple 
Challenge 2009 dataset contains blank nodes. We will not deal with blank
 nodes in this evaluation and thus require participants to encode blank 
nodes according to the following rule: BNID map to 
http://example.org/URLEncode(BNID), where BNID is the blank node id. 
Since the blank node ids in that dataset are unique, this convention is 
sufficient to map blank nodes to obtain distinct URIs. Instead of 
encoding the blank nodes using this convention, participants can also 
download the following version of the Billion Triple Challenge 2009 
dataset where blank nodes are have been already converted to URIs: Download dataset. Query set #1(Entity Search track)
 For the competition in this track, we provide a set of queries 
that are focused on the task of entity search. These queries are a 
subset of the queries in the Yahoo! Search Query Tiny Sample dataset available under the Yahoo! Webscope. We selected from this dataset a number of entity queries, i.e. queries that refer to one particular entity. See sample queries. 
 
 Query set #2(List Search track)
 The goal of this track is select objects that match particular 
criteria. These queries have been hand-written by the organizing 
committee. See sample queries. 
 The format of these query sets is one query per line in plain text format using UTF-8 encoding.
 The final queries will be posted when the competition begins on March 1st, 2011. The  deadline for submitting results is the March 21st, 2011 . For further news and discussions related to SemSearch 2011 and the 
Semantic Search Challenge, please register at 
http://tech.groups.yahoo.com/group/semsearcheval/. For further details on this challenge, please refer to http://semsearch.yahoo.com/ (This site will go online on the 21st of February).
 |  
    |  |  |  
          
            | 
   |  |   | Organizers |   |  |   |  |   |  |  
    | 
Marko Grobelnik, Jožef  
Stefan Institute, Ljubljana, SloveniaPeter Mika, Yahoo! 
Research, Barcelona, SpainThanh Tran Duc, 
Institute AIFB, University of Karlsruhe (TH), GermanyHaofen Wang, Apex 
Lab, Shanghai Jiao Tong University, China |  
    |  |  |  
          
            | 
   |  |   | Program Committee |   |  |   |  |   |  |  
    | 
Bettina Berendt, Katholieke Universiteit Leuven, BelgiumPablo Castells, Universidad Autonoma de Madrid, SpainPhilipp Cimiano, Semantic Computing Group, Bielefeld University, GermanyGong Cheng, Nanjing University, ChinaMathieu d'Aquin, KMI, Open University, EnglandMiriam Fernandez, KMI, Open University, EnglandBlaz Fortuna, Jožef Stefan Institute, SloveniaNorbert Fuhr, Universität Duisburg-Essen, GermanyLise Getoor, University Maryland, USAPeter Haase, Fluid Operations, Waldorf, GermanyHarry Halpin, University of Edinburgh, ScotlandAndreas Harth, Institute AIFB, Karlsruhe Institute of Technology, GermanyMichiel Hildebrand, Centre for Mathematics and Computer Science Amsterdam, NetherlandsAidan Hogan, DERI, Galway, IrelandGuenter Ladwig, Institute AIFB, Karlsruhe Institute of Technology, GermanyAxel Polleres, Deri, Galway, IrelandYuzhong Qu, Nanjing University, ChinaDanh Lephuoc, Deri, Galway, IrelandDaniel Schwabe, Departamento de Informática, BrazilSergej Sizov, University of Koblenz-Landau, GermanyRudi Studer, Institute AIFB, University of Karlsruhe, GermanyKavitha Srinivas, IBM Research, Hawthorne, USACao Hoang Tru, HCMC University of Technology, HCMC, VietnamGiovanni Tummarello, Deri, Galway, IrelandYong Yu, Apex Lab, Shanghai Jiao Tong University, ChinaIlya Zaihrayeu, University of Trento, ItalyHugo Zaragoza, Yahoo! Research Barcelona, Spain |  
    |  |  |  
          
            | 
   |  |   | Submission and Proceedings |   |  |   |  |   |  |  
    | For submissions, the following rules apply:  
    Full technical papers: up to 10 pages in ACM format Short position or demo papers: up to 5 pages in ACM format  Submissions must be formatted using the WWW2011 templates.  Submissions will be peer reviewed by three independent reviewers. 
Accepted papers will be presented at the workshop and 
included in the workshop proceedings. 
We will pursue a journal special issue with the  topics of the 
workshop if we receive an appropriate number of high-quality 
submissions. 
Details on the proceedings and camera-ready formatting will be 
announced upon notification of the authors. 
Please use the following link to the submission system to submit your 
paper: EasyChair Submission System for SemSearch11 at 
http://www.easychair.org/conferences/?conf=semsearch11  |  
    |  |  |  |  | 
          
            | 
              
                |  |  
                | News |  
                |  |  
    | 2011-2-15: Sample queries are posted! 2011-2-15: Semantic Search Challenge 2011 (sponsored by Yahoo!) is open
 |  
                |  |  
			  
			    |  |  
			    | Important Dates |  
			    |  |  
			    | Deadline for paper submission: February 26th, 2011 (12.00 AM, GMT)
 Deadline for submitting results to the challenge: March 21st, 2011 (12.00 AM, GMT)
 Notification of acceptance:March 14th, 2011
 Camera-ready versions: March 21st, 2011
 WWW'11 Conference: March 28th - April 1st, 2011
 Workshop Day: March 29, 2011
 |  
			    |  |  
              
                |  |  
                | Quick Links |  
                |  |  
                | 
		  
		    |  |  
		    | Workshop Support |  
		    |  |  
		    |   The SEALS initiative is sponsoring and supporting the evaluation of 
search tools. The goal of the SEALS project is to provide an 
independent, open, scalable, extensible and sustainable infrastructure 
(the SEALS Platform) that allows the remote evaluation of semantic 
technologies thereby providing an objective comparison of the different 
existing semantic technologies. This will allow researchers and users to
 effectively compare the available technologies, helping them to select 
appropriate technologies and advancing the state of the art through 
continuous evaluation.
 |  
		    |  |  |  
                |  |  |  |