SWEO Community Project: Linking Open Data on the Seamntic Web
Semantic Web Search Engines
This page collects links to Semantic Web Search Engines. Semantic Web Search Engines use robots to crawl RDF data from the Web and provide search and navigation facilities over crawled data.
The page is part of the community project wiki:SweoIG/TaskForces/CommunityProjects/LinkingOpenData
Semantic Web Search Engines
Falcons developed by IWS China, currently indexes 10 million RDF documents.
Sindice developed by DERI Ireland, currently indexes over 20 million RDF documents. See also their ISWC2007 paper Sindice.com: Weaving the Open Linked Data
Watson developed by KMi, UK. See also the list of papers and presentations about Watson
Yahoo! MicroSearch Search engine over RDF, RDFa and Microformats crawled from the Web. See also Documentation
Semantic Web Search Engine (SWSE). Developed by DERI Ireland. See also their paper MultiCrawler: A Pipelined Architecture for Crawling and Indexing Semantic Web Data.
Swoogle. Developed by ebiquity group at UMBC USA, currently indexes 2.3 million RDF documents. See also their papers about Swoogle
Semantic Web Search developed by Intellidimension Inc.
Uriqr developed by KMi, UK (currently unavailable; not dead, just hibernating - TomHeath, 2007-11-06)
Zitgist Search (still in beta)
Related Technologies
Semantic Web Crawling: a Sitemap Extension. The sitemap extension allows Data publishers can state where RDF is located and which alternative means are provided to access it (Linked Data, SPARQL endpoint, RDF dump). Semantic Web clients and Semantic Web crawlers can use this information to access RDF data in the most efficient way for the task they have to perform.
Related Publications, Blog Posts and Websites
Gong Cheng, Weiyi Ge, Honghan Wu and Yuzhong Qu: Searching Semantic Web Objects Based on Class Hierarchies, LDOW 2008.
Kyumars Sheykh Esmaili, Hassan Abolhassani: A Categorization Scheme for Semantic Web Search Engines