`
gstarwd
  • 浏览: 1525203 次
  • 性别: Icon_minigender_1
  • 来自: 杭州
社区版块
存档分类
最新评论

Richard MacManus:10个语义应用实例

阅读更多

 把语义应用(semantic applications)视为10月在旧金山web 2.0峰会上的亮点之一,并且列表了10个语义应用的实例,但需要说明的是,这并不是一个top 10之类的榜单。

1、什么是语义应用?

语义应用的一个核心元素是试图识别文本和其它信息的意义(determine the meaning),并且创建其与用户之间的连接( create connections)。 Nova Spivack (下文要介绍的Twine的发起者)同样也将数据的可移植性(data portability)与可连接性(Connectibilty)作为语义应用的关键特征。

Alex Iskold在一篇名为的文章中,列举了实现语义应用的两种方法:

自下而上(Bottom up):即通过将语义标识(semantical annotations)即元数据(meta-data)置入数据中;

自上而下(Top down):即依靠分析现有信息;自上而下的最终解决策略是建立一个能完全以人类的方式理解文本与信息的自然语言处理程序(natural language processor)。

 

下面就放榜:

Freebase
Freebase aims to "open up the silos of data and the connections between them", according to founder Danny Hillis at the Web 2.0 Summit. Freebase is a database that has all kinds of data in it and an API. Because it's an open database, anyone can enter new data in Freebase. An example page in the Freebase db looks pretty similar to a Wikipedia page. When you enter new data, the app can make suggestions about content. The topics in Freebase are organized by type, and you can connect pages with links, semantic tagging. So in summary, Freebase is all about shared data and what you can do with it.

Powerset
Powerset (see our coverage here and here) is a natural language search engine. The system relies on semantic technologies that have only become available in the last few years. It can make "semantic connections", which helps make the semantic database. The idea is that meaning and knowledge gets extracted automatically from Powerset. The product isn't yet public, but it has been riding a wave of publicity over 2007.

 

Twine
Twine claims to be the first mainstream Semantic Web app, although it is still in private beta. See our in-depth review. Twine automatically learns about you and your interests as you populate it with content - a "Semantic Graph". When you put in new data, Twine picks out and tags certain content with semantic tags - e.g. the name of a person. An important point is that Twine creates new semantic and rich data. But it's not all user-generated. They've also done machine learning against Wikipedia to 'learn' about new concepts. And they will eventually tie into services like Freebase. At the Web 2.0 Summit, founder Nova Spivack compared Twine to Google, saying it is a "bottom-up, user generated crawl of the Web".

AdaptiveBlue
AdaptiveBlue are makers of the Firefox plugin, BlueOrganizer. They also recently launched a new version of their SmartLinks product, which allows web site publishers to add semantically charged links to their site. SmartLinks are browser 'in-page overlays' (similar to popups) that add additional contextual information to certain types of links, including links to books, movies, music, stocks, and wine. AdaptiveBlue supports a large list of top web sites, automatically recognizing and augmenting links to those properties.

SmartLinks works by understanding specific types of information (in this case links) and wrapping them with additional data. SmartLinks takes unstructured information and turns it into structured information by understanding a basic item on the web and adding semantics to it.

[Disclosure: AdaptiveBlue founder and CEO Alex Iskold is a regular RWW writer]

Hakia
Hakia is one of the more promising Alt Search Engines around, with a focus on natural language processing methods to try and deliver 'meaningful' search results. Hakia attempts to analyze the concept of a search query, in particular by doing sentence analysis. Most other major search engines, including Google, analyze keywords. The company told us in a March interview that the future of search engines will go beyond keyword analysis - search engines will talk back to you and in effect become your search assistant. One point worth noting here is that, currently, Hakia has limited post-editing/human interaction for the editing of hakia Galleries, but the rest of the engine is 100% computer powered.

Hakia has two main technologies:

1) QDEX Infrastructure (which stands for Query Detection and Extraction) - this does the heavy lifting of analyzing search queries at a sentence level.

2) SemanticRank Algorithm - this is essentially the science they use, made up of ontological semantics that relate concepts to each other.

Talis
Talis is a 40-year old UK software company which has created a semantic web application platform. They are a bit different from the other 9 companies profiled here, as Talis has released a platform and not a single product. The Talis platform is kind of a mix between Web 2.0 and the Semantic Web, in that it enables developers to create apps that allow for sharing, remixing and re-using data. Talis believes that Open Data is a crucial component of the Web, yet there is also a need to license data in order to ensure its openness. Talis has developed its own content license, called the Talis Community License, and recently they funded some legal work around the Open Data Commons License.

According to Dr Paul Miller, Technology Evangelist at Talis, the company's platform emphasizes "the importance of context, role, intention and attention in meaningfully tracking behaviour across the web." To find out more about Talis, check out their regular podcasts - the most recent one features Kaila Colbin (an occassional AltSearchEngines correspondent) and Branton Kenton-Dau of VortexDNA.

UPDATE: Marshall Kirkpatrick published an interview with Dr Miller the day after this post. Check it out here.

TrueKnowledge
Venture funded UK semantic search engine TrueKnowledge unveiled a demo of its private beta earlier this month. It reminded Marshall Kirkpatrick of the still-unlaunched Powerset, but it's also reminiscent of the very real Ask.com "smart answers". TrueKnowledge combines natural language analysis, an internal knowledge base and external databases to offer immediate answers to various questions. Instead of just pointing you to web pages where the search engine believes it can find your answer, it will offer you an explicit answer and explain the reasoning patch by which that answer was arrived at. There's also an interesting looking API at the center of the product. "Direct answers to humans and machine questions" is the company's tagline.

Founder William Tunstall-Pedoe said he's been working on the software for the past 10 years, really putting time into it since coming into initial funding in early 2005.

TripIt
Tripit is an app that manages your travel planning. Emre Sokullu reviewed it when it presented at TechCrunch40 in September. With TripIt, you forward incoming bookings to plans@tripit.com and the system manages the rest. Their patent pending "itinerator" technology is a baby step in the semantic web - it extracts useful infomation from these mails and makes a well structured and organized presentation of your travel plan. It pulls out information from Wikipedia for the places that you visit. It uses microformats - the iCal format, which is well integrated into GCalendar and other calendar software.

The company claimed at TC40 that "instead of dealing with 20 pages of planning, you just print out 3 pages and everything is done for you". Their future plans include a recommendation engine which will tell you where to go and who to meet.

Clear Forest

ClearForest is one of the companies in the top-down camp. We profiled the product in December '06 and at that point ClearForest was applying its core natural language processing technology to facilitate next generation semantic applications. In April 2007 the company was acquired by Reuters. The company has both a Web Service and a Firefox extension that leverages an API to deliver the end-user application.

The Firefox extension is called Gnosis and it enables you to "identify the people, companies, organizations, geographies and products on the page you are viewing." With one click from the menu, a webpage you view via Gnosis is filled with various types of annotations. For example it recognizes Companies, Countries, Industry Terms, Organizations, People, Products and Technologies. Each word that Gnosis recognizes, gets colored according to the category.

 

Also, ClearForest's Semantic Web Service offers a SOAP interface for analyzing text, documents and web pages.

Spock
Spock is a people search engine that got a lot of buzz when it launched. Alex Iskold went so far as to call it "one of the best vertical semantic search engines built so far." According to Alex there are four things that makes their approach special:

The person-centric perspective of a query
Rich set of attributes that characterize people (geography, birthday, occupation, etc.)
Usage of tags as links or relationships between people
Self-correcting mechanism via user feedback loop
As a vertical engine, Spock knows important attributes that people have: name, gender, age, occupation and location just to name a few. Perhaps the most interesting aspect of Spock is its usage of tags - all frequent phrases that Spock extracts via its crawler become tags; and also users can add tags. So Spock leverages a combination of automated tags and people power for tagging.

再次说明,这不是一个top 10榜单,Richard MacManus的原文位于:http://www.readwriteweb.com/archives/10_semantic_apps_to_watch.php。如果你有其它的类似应用需要推荐,你可以在该文的评论中添加。


本文来自CSDN博客,转载请标明出处:http://blog.csdn.net/zhengyun_ustc/articles/1932192.aspx

=======================================================================

 

One of the highlights of October's Web 2.0 Summit in San Francisco was the emergence of 'Semantic Apps' as a force. Note that we're not necessarily talking about the Semantic Web, which is the Tim Berners-Lee W3C led initiative that touts technologies like RDF, OWL and other standards for metadata. Semantic Apps may use those technologies, but not necessarily. This was a point made by the founder of one of the Semantic Apps listed below, Danny Hillis of Freebase (who is as much a tech legend as Berners-Lee).

The purpose of this post is to highlight 10 Semantic Apps. We're not touting this as a 'Top 10', because there is no way to rank these apps at this point - many are still non-public apps, e.g. in private beta. It reflects the nascent status of this sector, even though people like Hillis and Spivack have been working on their apps for years now.

What is a Semantic App?

Firstly let's define "Semantic App". A key element is that the apps below all try to determine the meaning of text and other data, and then create connections for users. Another of the founders mentioned below, Nova Spivack of Twine, noted at the Summit that data portability and connectibility are keys to these new semantic apps - i.e. using the Web as platform.

In September Alex Iskold wrote a great primer on this topic, called Top-Down: A New Approach to the Semantic Web. In that post, Alex Iskold explained that there are two main approaches to Semantic Apps:

1) Bottom Up - involves embedding semantical annotations (meta-data) right into the data.
2) Top down - relies on analyzing existing information; the ultimate top-down solution would be a fully blown natural language processor, which is able to understand text like people do.

Now that we know what Semantic Apps are, let's take a look at some of the current leading (or promising) products...

Freebase

Freebase aims to "open up the silos of data and the connections between them", according to founder Danny Hillis at the Web 2.0 Summit. Freebase is a database that has all kinds of data in it and an API. Because it's an open database, anyone can enter new data in Freebase. An example page in the Freebase db looks pretty similar to a Wikipedia page. When you enter new data, the app can make suggestions about content. The topics in Freebase are organized by type, and you can connect pages with links, semantic tagging. So in summary, Freebase is all about shared data and what you can do with it.

Powerset

Powerset (see our coverage here and here) is a natural language search engine. The system relies on semantic technologies that have only become available in the last few years. It can make "semantic connections", which helps make the semantic database. The idea is that meaning and knowledge gets extracted automatically from Powerset. The product isn't yet public, but it has been riding a wave of publicity over 2007.

Twine

Twine claims to be the first mainstream Semantic Web app, although it is still in private beta. See our in-depth review. Twine automatically learns about you and your interests as you populate it with content - a "Semantic Graph". When you put in new data, Twine picks out and tags certain content with semantic tags - e.g. the name of a person. An important point is that Twine creates new semantic and rich data. But it's not all user-generated. They've also done machine learning against Wikipedia to 'learn' about new concepts. And they will eventually tie into services like Freebase. At the Web 2.0 Summit, founder Nova Spivack compared Twine to Google, saying it is a "bottom-up, user generated crawl of the Web".

AdaptiveBlue

AdaptiveBlue are makers of the Firefox plugin, BlueOrganizer. They also recently launched a new version of their SmartLinks product, which allows web site publishers to add semantically charged links to their site. SmartLinks are browser 'in-page overlays' (similar to popups) that add additional contextual information to certain types of links, including links to books, movies, music, stocks, and wine. AdaptiveBlue supports a large list of top web sites, automatically recognizing and augmenting links to those properties.

SmartLinks works by understanding specific types of information (in this case links) and wrapping them with additional data. SmartLinks takes unstructured information and turns it into structured information by understanding a basic item on the web and adding semantics to it.

[Disclosure: AdaptiveBlue founder and CEO Alex Iskold is a regular RWW writer]

Hakia

Hakia is one of the more promising Alt Search Engines around, with a focus on natural language processing methods to try and deliver 'meaningful' search results. Hakia attempts to analyze the concept of a search query, in particular by doing sentence analysis. Most other major search engines, including Google, analyze keywords. The company told us in a March interview that the future of search engines will go beyond keyword analysis - search engines will talk back to you and in effect become your search assistant. One point worth noting here is that, currently, Hakia has limited post-editing/human interaction for the editing of hakia Galleries, but the rest of the engine is 100% computer powered.

Hakia has two main technologies:

1) QDEX Infrastructure (which stands for Query Detection and Extraction) - this does the heavy lifting of analyzing search queries at a sentence level.

2) SemanticRank Algorithm - this is essentially the science they use, made up of ontological semantics that relate concepts to each other.

Talis

Talis is a 40-year old UK software company which has created a semantic web application platform. They are a bit different from the other 9 companies profiled here, as Talis has released a platform and not a single product. The Talis platform is kind of a mix between Web 2.0 and the Semantic Web, in that it enables developers to create apps that allow for sharing, remixing and re-using data. Talis believes that Open Data is a crucial component of the Web, yet there is also a need to license data in order to ensure its openness. Talis has developed its own content license, called the Talis Community License, and recently they funded some legal work around the Open Data Commons License.

According to Dr Paul Miller, Technology Evangelist at Talis, the company's platform emphasizes "the importance of context, role, intention and attention in meaningfully tracking behaviour across the web." To find out more about Talis, check out their regular podcasts - the most recent one features Kaila Colbin (an occassional AltSearchEngines correspondent) and Branton Kenton-Dau of VortexDNA.

UPDATE: Marshall Kirkpatrick published an interview with Dr Miller the day after this post. Check it out here.

TrueKnowledge

Venture funded UK semantic search engine TrueKnowledge unveiled a demo of its private beta earlier this month. It reminded Marshall Kirkpatrick of the still-unlaunched Powerset, but it's also reminiscent of the very real Ask.com "smart answers". TrueKnowledge combines natural language analysis, an internal knowledge base and external databases to offer immediate answers to various questions. Instead of just pointing you to web pages where the search engine believes it can find your answer, it will offer you an explicit answer and explain the reasoning patch by which that answer was arrived at. There's also an interesting looking API at the center of the product. "Direct answers to humans and machine questions" is the company's tagline.

Founder William Tunstall-Pedoe said he's been working on the software for the past 10 years, really putting time into it since coming into initial funding in early 2005.

TripIt

Tripit is an app that manages your travel planning. Emre Sokullu reviewed it when it presented at TechCrunch40 in September. With TripIt, you forward incoming bookings to plans@tripit.com and the system manages the rest. Their patent pending "itinerator" technology is a baby step in the semantic web - it extracts useful infomation from these mails and makes a well structured and organized presentation of your travel plan. It pulls out information from Wikipedia for the places that you visit. It uses microformats - the iCal format, which is well integrated into GCalendar and other calendar software.

The company claimed at TC40 that "instead of dealing with 20 pages of planning, you just print out 3 pages and everything is done for you". Their future plans include a recommendation engine which will tell you where to go and who to meet.

Clear Forest

ClearForest is one of the companies in the top-down camp. We profiled the product in December '06 and at that point ClearForest was applying its core natural language processing technology to facilitate next generation semantic applications. In April 2007 the company was acquired by Reuters. The company has both a Web Service and a Firefox extension that leverages an API to deliver the end-user application.

The Firefox extension is called Gnosis and it enables you to "identify the people, companies, organizations, geographies and products on the page you are viewing." With one click from the menu, a webpage you view via Gnosis is filled with various types of annotations. For example it recognizes Companies, Countries, Industry Terms, Organizations, People, Products and Technologies. Each word that Gnosis recognizes, gets colored according to the category.

Also, ClearForest's Semantic Web Service offers a SOAP interface for analyzing text, documents and web pages.

Spock

Spock is a people search engine that got a lot of buzz when it launched. Alex Iskold went so far as to call it "one of the best vertical semantic search engines built so far." According to Alex there are four things that makes their approach special:

  • The person-centric perspective of a query
  • Rich set of attributes that characterize people (geography, birthday, occupation, etc.)
  • Usage of tags as links or relationships between people
  • Self-correcting mechanism via user feedback loop

As a vertical engine, Spock knows important attributes that people have: name, gender, age, occupation and location just to name a few. Perhaps the most interesting aspect of Spock is its usage of tags - all frequent phrases that Spock extracts via its crawler become tags; and also users can add tags. So Spock leverages a combination of automated tags and people power for tagging.

Conclusion

What have we missed? ;-) Please use the comments to list other Semantic Apps you know of. It's an exciting sector right now, because Semantic Web and Web 2.0 technologies alike are being used to create new semantic applications. One gets the feeling we're only at the beginning of this trend

分享到:
评论

相关推荐

    Web 2.0 Heros

    7 Richard MacManus: Read/WriteWeb & Web 2.0 Workgroup. . . . . 91 8 TJ Kang:ThinkFree . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101 9 Patrick Crane: LinkedIn. . . . . . . . . . . . . . ....

    卢松松:盘点被收购的科技博客.docx

    - **创始人角色**:被收购后,创始人Richard MacManus继续领导读写网的日常工作。 - **意义**:ReadWriteWeb的例子表明,即使是小型的独立博客也能通过专注和高质量的内容吸引稳定的读者群,并最终实现商业上的成功...

    office2john的源文件

    office2john的源文件

    青海省各市、县区及街镇SVG图

    青海省各市、县区及街镇SVG图

    (源码)基于JavaNIO框架的网络IO处理系统.zip

    # 基于Java NIO框架的网络IO处理系统 ## 项目简介 本项目是一个基于Java NIO(非阻塞IO)框架的网络IO处理系统。通过使用Java NIO的多路复用技术,系统能够高效地处理多个客户端的连接和IO操作。项目涵盖了从基本的Socket编程到复杂的NIO多路复用实现,适合学习和理解Java网络编程的高级特性。 ## 项目的主要特性和功能 1. 多路复用技术使用Java NIO的Selector机制,实现单线程处理多个客户端的连接和IO操作。 2. 多线程处理通过多线程技术,系统能够并发处理多个客户端的请求,提高服务器的处理能力。 3. 文件IO操作包括基本的文件写入、缓冲文件IO、随机访问文件写入等操作,展示了Java文件IO的多种实现方式。 4. 网络IO操作包括Socket客户端和服务器的实现,展示了如何使用Java进行网络通信。

    数据结构- C语言版-树的遍历代码

    二叉树先、中、后序遍历及层次遍历代码 C90版

    支持向量机 - SVM支持向量机

    支持向量机

    (源码)基于Arduino和Android的温湿度数据采集系统.zip

    # 基于Arduino和Android的温湿度数据采集系统 ## 项目简介 本项目是一个基于Arduino和Android的温湿度数据采集系统,主要用于远程监控环境中的温湿度数据。系统由Arduino Nano控制DHT22传感器进行数据采集,并通过USRWIFI232A模块将数据传输到Android设备。Android端通过WiFi连接到服务器,实时读取并显示温湿度数据。 ## 项目的主要特性和功能 硬件设计 使用Arduino Nano作为主控芯片。 采用DHT22传感器进行温湿度数据采集。 通过USRWIFI232A模块实现无线数据传输。 软件设计 Arduino端实现DHT传感器的读取功能,包括温度和湿度的数据采集与处理。 Android端开发了一个Android应用,通过WiFi连接到服务器,实时读取并显示温湿度数据。 功能模块

    c语言情人节的红玫瑰.rar

    c语言情人节的红玫瑰

    c语言UDP传输系统源码.rar

    c语言UDP传输系统源码

    数字标识控件.zip

    数字标识控件

    c语言打字母游戏源码.rar

    c语言打字母游戏源码

    c语言实现数字雨.rar

    c语言实现数字雨

    android Gallery实现异步加载网络图片 并只加载当前停止页面图.zip

    android Gallery实现异步加载网络图片 并只加载当前停止页面图

    C#ASP.NET手机端H5会议室预约系统源码 手机版会议室预约源码数据库 SQL2008源码类型 WebForm

    ASP.NET手机端H5会议室预约系统源码 手机版会议室预约源码 一、源码介绍 H5手机版会议室预约系统是一个高效快速便利的内部预约平台,让需要预定会议室的人能通过这个平 台发布预定,没有预定的人也能通过平台查看他人预定。通过后台添加账号即可登录预约平台,发布会 议室预定。 二、主要功能 后台管理包括 会议室信息管理,预约信息管理,用户信息管理。 前台手机版预约系统包括 日历查看预定信息,点击进入所选日期详细预约信息,预定会议室,我的预 约等功能模块。 后台采用模型管理功能可以使用后台对表结构进行维护,方便二次开发。 后台也可以增加部门,实现各部门之间管理员查看各自部门预约信息,用户信息等功能。

    九州仙侠传2砸蛋系统以及各类修复带数据库

    修复了实时获取区服 修复多区识别问题 带CDK功能 还有一些 忘记了 1年前修复的 好久没搞了

    C#ASP.NET教育培训管理系统源码数据库 Access源码类型 WinForm

    ASP.NET教育培训管理系统源码 一、源码描述:可以实现学员发卡,学员缴费,学员上课,教师管理,课程管理,图书管理, 水晶报表查询,文字与图形并存,管理员权限设置,学员生日提醒,图书逾期提醒。 二、菜单功能 1、 学员管理 2、 教师管理 3、 课程管理 4、 图书管理 5、 统计报表 6、 辅助功能 7、 系统设置 8、 添加学员 9、 学员上课 10、图书借阅 11、今日提醒 12、短信平台 13、微信平台 14、数据清理 三、注意事项 1、开发环境为Visual Studio 2010,数据库为ACCESS,使用.net 4.0开发, C/S架构。 2、管理员登陆名:admin 密码: admin

    考研真题及讲解介绍-数学-2024

    考研真题及讲解介绍-数学-2024

    c语言做的一个任务管理器.rar

    c语言做的一个任务管理器

Global site tag (gtag.js) - Google Analytics