Semantic search

From HLWIKI Canada

Jump to: navigation, search
Semantic search is linked to web 2.0 & web 3.0
Are you interested in contributing to HLWIKI Canada - hlwiki.ca? contact: dean.giustini@ubc.ca

To browse other articles on a range of HSL topics, see the wiki index.

Contents

Introduction

See also GoPubMed, PubReMiner and Semantic web

Semantic search tools seek to improve the accuracy of web searching by considering context (or meaning) of terms as they occur in desirable web documents. Instead of Google's PageRank to predict relevancy, semantic search tools use the science of meaning in language to produce relevant results for users. The goal on semantic search is to deliver information in context rather than have to sort through lists of loosely related keyword results. Some authors regard semantic search tools as a set of techniques to retrieve knowledge from richly structured data sources which enable technologies to articulate domain knowledge at a sophisticated level. Semantic tools on the web are likely to rely on metadata to describe documents and bring them together in information retrieval. This language will help to describe and retrieve documents much like what we encounter in medical databases. Metadata is defined as 'data about data' and the major standard in the field is Dublin Core.

For more background, see the Semantic Media wiki.

Cognition Search http://www.cognition.com/

Cognition search uses a natural language mapping technology and a blend of linguistics and mathematical algorithms to locate and collate content. In effect, this helps computers to find meaning (or related concepts) with the words we use in searches. CS understands the relationship between words and phrases (meaning), paraphrases (a "finger" or a "digit") and taxonomies (a "finger" is part of a "hand", a "cow" is a "bovine" and is a "mammal"). Cognition search permits searching across four domains:

  • Law - Public.Resource.org (1,858 volumes; 675,704 files of federal case law). US Supreme Court Decisions and Court of Appeals decisions from 1950 onward
  • Medicine - Medline, abstracts for biomedical information from international literature; database covers medicine, nursing, pharmacy, dentistry, veterinary medicine; fields with no direct medical connection, such as molecular evolution (~19 million files).
  • Wiki - English version of Wikipedia
  • Bible - New English Translation of Gospels of Matthew, Luke, John and Mark

Duck Duck Go http://duckduckgo.com/

When searching Duck Duck Go, the service brings up the most 'official' page first and if the search terms are linked to a Wikipedia page, a short blurb will appear as well as related search terms at the top. DDG features special category pages, and recognizes calculations, phone numbers, zip codes, ISBNs and product codes, as well as street and IP addresses.

Exalead http://exalead.com

Exalead offers a host of enterprise 2.0 options to narrow searches based on image size, color and content. These features are appearing in other search engines.

Evri.com http://evri.com

Evri is a technology company developing products that change the way consumers discover and engage with content on the Web. Some publishers have used Evri’s semantic platform on their sites, including the most prestigious news organizations such as the Washington Post, Hearst Publishing, Yahoo! and the Times of London. With over 2 million pages across 500 categories, several content recommendation applications and a feature-rich API platform, Evri is rapidly improving access to information.

Factbites http://www.factbites.com/

The aim of the engine is to return meaningful sentences for the search query. Factbites offers you real, meaningful sentences that are right on topic - a technique that lies between a site summary and summary of results.

Falcons http://iws.seu.edu.cn/services/falcons/objectsearch/index.jsp

Falcons is a keyword-based search engine for the Semantic Web. Falcons provides keyword-based search for URIs identifying objects, concepts (classes and properties), and documents on the Semantic Web.

Hakia - http://hakia.com

hakia is a semantic search technology company. The mission of hakia is to deploy semantic search solutions to meet the challenges of elevated user expectations, business efficiency, and lowest cost.

JANE http://biosemantics.org/jane/index.php

Have you recently written a paper, but you're not sure to which journal you should submit it? Or maybe you want to find relevant articles to cite in your paper? Or are you an editor, and do you need to find reviewers for a particular paper? Try Jane

Kosmix http://kosmix.com

Kosmix takes its concept further by providing users with a dashboard of contentcalled – Your guide to the We. The focus is on informational search and makes it suitable for topics when you want information rather than looking for a specific answer or URL. Kosmix received $20 million of funding from Time Warner in late 2008. Its content aggregating technology will become more important as content on the web grows.

Lexxe http://lexxe.com

Leksi is derived from a linguistic term "Lexical" meaning "related to words". It emphasizes language processing from the level of words and the meanings associated with them. It has been exploring more intelligent ways to find information for users in a more meaningful way. Method will bring more accurate and relevant search results than the current search tools.

Lumifi http://lumifi.com

Lumifi resides in your browser and allows you to store searches, bookmark sites, write notes and export to share with colleagues and friends. As mentioned all of these features reside in the Dashboard layout.

NEPOMUK - The Social Semantic Desktop http://nepomuk.semanticdesktop.org/xwiki/bin/view/Main1/

Networked Environment for Personalized, Ontology-based Management of Unified Knowledge (NEPOMUK) brings together researchers, industrial software developers, and representative industrial users, to develop a comprehensive solution for extending the personal desktop into a collaboration environment which supports both the personal information management and the sharing and exchange across social and organizational relations.

NLM Plus http://nlmplus.com/

NLMplus is an innovative semantic search and discovery application, developed by WebLib LLC, a small business in Maryland, in response to a challenge contest by the National Library of Medicine (NLM) to make use of NLM’s vast collection of biomedical data and services for the benefit of the Library’s diverse worldwide user communities.

Powerset http://www.powerset.com/

Powerset is applying its natural language processing to search, aiming to improve how we find information by unlocking the meaning in ordinary human language. Powerset is a search and discovery experience for Wikipedia and improves the entire search process. In the search box, you can express yourself in keywords, phrases, or simple questions. On the results page, questions are answered directly, and aggregated from across multiple articles.

PureDiscovery http://www.purediscovery.com/

When it comes to search, - “Meaning Matters.” It’s time for a search engine that thinks like we do, learns like we can, and interacts with us in a human-like way. PureDiscovery KnowledgeGraph is a leap forward for search that allows users to interact with a search engine conversationally as opposed to using a cryptic search language. Armed with our new approach, your organization will both maximize on-target results and minimize unproductive search time. PureDiscovery is leading a radical reinvention of search, based on a core set of beliefs.

Quertle http://www.quertle.info/

Quertle ("Relationship-driven biomedical research -- Intelligent semantic queries") goes beyond simple term matching to identify the most salient information in the literature. Using a combination of linguistic methods, Quertle finds facts defined within documents, creating its own database of nearly 200 million relationships, and is able to report the ones that are relevant to your query. Quertle's approach is based on a thorough understanding of biology and chemistry and was built from the ground up to address the unique needs of this technical literature.

Semager http://www.semager.de/

A semantic search engine out of Germany. You should look at its analysis page for a website - interesting data there. I like looking into its list of related terms, which allows you to tag surf other aspects of the inquiries meaning.

SemanticWebSearch http://www.semanticwebsearch.com/query/

SWS is a search engine that precisely locates and gathers information on the web. It provides a standard search interface that is able to describe the information people need. On a traditional search engine you might search for information about a person by using 'John' and 'Smith' which may produce results ranging from biblical passages to iron working techniques. SWS allows you to search for persons (foaf:Person) that have the correct first name (foaf:firstName) 'John' and surname (foaf:surname) 'Smith'. You can perform another search to locate news articles (rss:item) authored by (dc:creator) John Smith. This will allow you to describe your search need using well-defined vocabulary (dc:creator, foaf:Person, ...) on the semantic web. Only information that exactly matches your search is returned in our results.

Semantic Web Search Engine http://swse.deri.org/

SenseBot http://www.sensebot.net/

SenseBot (Beta) is a semantic search engine that generates a text summary of web pages on the topic of your search. It uses text mining and multidocument summarization to extract sense from Web pages and present it to the user in a coherent manner. A "Semantic Cloud" of concepts is displayed above the summary, allowing to steer the focus of the results. See some results.

Sindice, semantic web index http://sindice.com/

Billion pieces of reusable information can already be found across hundreds of millions web pages which embed RDF and Microformats. Start consuming this data today with Sindice Data Web services.

Standle http://www.standle.com

Standle is devoted to finding great search engines for users and more accurate search results through those best search engines in their specific areas.

Swingly http://www.swingly.com

TipTop http://beta.tiptopbest.com/aboutus.html

TipTop Technologies, Inc. is an emerging Silicon Valley-based company founded during the summer of 2008, whose first consumer-facing product on the Internet was launched at FeelTipTop.com in June 2009. Through building some unique and powerful technology at the outset, TipTop is well-positioned to take up a leadership position in the growing market of semantic-driven products both in the consumer and the enterprise space.

TrueKnowledge http://www.trueknowledge.com/

The world's first AI question-answering platform. We are using our unique semantic technology to build the first internet-scale platform for directly answering the world's questions. As knowledge is added to the platform we understand and answer more and more.

Truevert http://www.truevert.com

The goal of Truevert is to provide users with information focused on their interest. It provides a scalable, accurate, and powerful tool that parses meaning of language the same way people do - by its context. Truevert is focused on green, environmental awareness. All searches are done from the point of view of environmental and social concern but does not depend on a taxonomy, ontology, thesaurus, dictionary, or require authors to categorize content (as in the so-called semantic web).

URLClassifier Service http://www.urlclassifier.com

A semantic web-service for online extracting topics from URLs -- great search possibilities

Watson http://watson.kmi.open.ac.uk/WatsonWUI/

This is the Watson Web interface for searching ontologies and semantic documents using keywords. The interface is subject to frequent evolutions and improvements. At the moment, enter a set of keywords (e.g. "cat dog old_lady") and a list of URIs of semantic documents will appear where keywords are as identifiers or in classes, properties, and individuals. You can use "wildcards" in the keywords (e.g., "ca? dog*"). You can restrict to particular types of entities (classes, properties or individuals) and elements within entities (local name, label, comment or any literal). For example, you can express queries like "give me the classes or the individuals using the term car in the name or in the label".

Yebol http://www.yebol.com

Yebol's mission is to build the world's knowledge base and provide knowledge based search (semantics) and services. Yebol utilizes a combination of patented algorithms paired with human knowledge to build a Web directory for each query and each user. Instead of the common “listing” of Web search queries, Yebol automatically clusters and categorizes search terms, Web sites, pages and contents.

Yummly http://www.yummly.com/

Yummly is the world’s first semantic recipe search and recommendation platform. Yummly enables you to find and customize recipes based on your personal taste, nutritional and dietary preferences. The site aggregates recipes from cooking websites, and is fully integrated with Facebook.

Zitgist http://zitgist.com/

Use the Browser Application to browse data sources of the Semantic Web. Its UMBEL (Upper-level Mapping and Binding Exchange Layer) is a reference structure for placing content and data in context with other data. It is comprised of 20,000 subject concepts and their relationships — with one another and with external vocabularies and named entities.

References

Personal tools