Google’s Caffeine

Google将要在下个月的OSDI10上发表一篇关于它的新索引系统的文章。
据悉新的索引系统被称为Caffeine(咖啡因)。这咖啡因除了系统细节是悬念外,其名字来由也是个悬念?是缩写?还是有特殊含义?
另外,咖啡因系统(很容易让人联想到生产咖啡~_~)最大的特色是以BigTable为核心,数据库驱动的。大家都知道MapReduce,GFS和BigTable被称为Google的三驾马车。索引系统首先让人想到的是其海量数据,使用BigTable而不是更适合分布式处理的MapReduce来构建索引系统,会让人认为Google技术上又了进一步。例如,这篇评论:
Google moving on from MapReduce: rest of world still catching up
实际上,上文也提到之所以不使用MapReduce是它不适合实时(real-time)处理的场合,MapReduce更适用于顺序(sequence)处理的应用。当然,两者有可能是相互配合的,就像Google News的架构一样。其实,我觉得下面这段话更能说明“咖啡因”的意义:

The big picture is maybe more interesting. We used to view the Web as a large collection of documents—as a library. Indexes updated daily were just fine. We now view the Web as an endless stream of data—like a live meeting between billions of people.

从静态的documents到动态的stream of data,Google对搜索技术的理解的确是领先其它竞争对手的。至于实现的技术,其实他们早就公布了,只是如何使用这些技术去实现这个想法而已。

Advertisements

About chill

不再执意欢喜反倒让人欢喜
This entry was posted in TECH. Bookmark the permalink.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s