Five years ago, we launched a conference based on a simple idea, and that idea grew into a movement. The original Web 2.0 Conference (now the Web 2.0 Summit ) was designed to restore confidence in an industry that had lost its way after the dotcom bust. The Web was far from done, we argued. In fact, it was on its way to becoming a robust platform for a culture-changing generation of computer applications and services.
In our first program, we asked why some companies survived the dotcom bust, while others had failed so miserably. We also studied a burgeoning group of startups and asked why they were growing so quickly. The answers helped us understand the rules of business on this new platform.
Chief among our insights was that "the network as platform" means far more than just offering old applications via the network ("software as a service"); it means building applications that literally get better the more people use them, harnessing network effects not only to acquire users, but also to learn from them and build on their contributions.
From Google and Amazon to Wikipedia, eBay, and craigslist, we saw that the value was facilitated by the software, but was co-created by and for the community of connected users. Since then, powerful new platforms like YouTube, Facebook, and Twitter have demonstrated that same insight in new ways. Web 2.0 is all about harnessing collective intelligence.
Collective intelligence applications depend on managing, understanding, and responding to massive amounts of user-generated data in real time. The "subsystems" of the emerging internet operating system are increasingly data subsystems: location, identity (of people, products, and places), and the skeins of meaning that tie them together. This leads to new levers of competitive advantage: Data is the "Intel Inside" of the next generation of computer applications.
Today, we realize that these insights were not only directionally right, but are being applied in areas we only imagined in 2004. The smartphone revolution has moved the Web from our desks to our pockets. Collective intelligence applications are no longer being driven solely by humans typing on keyboards but, increasingly, by sensors. Our phones and cameras are being turned into eyes and ears for applications; motion and location sensors tell where we are, what we’re looking at, and how fast we’re moving. Data is being collected, presented, and acted upon in real time. The scale of participation has increased by orders of magnitude.
With more users and sensors feeding more applications and platforms, developers are able to tackle serious real-world problems. As a result, the Web opportunity is no longer growing arithmetically; it’s growing exponentially. Hence our theme for this year: Web Squared. 1990-2004 was the match being struck; 2005-2009 was the fuse; and 2010 will be the explosion.
Ever since we first introduced the term "Web 2.0," people have been asking, "What’s next?" Assuming that Web 2.0 was meant to be a kind of software version number (rather than a statement about the second coming of the Web after the dotcom bust), we’re constantly asked about "Web 3.0." Is it the semantic web? The sentient web? Is it the social web? The mobile web? Is it some form of virtual reality?
It is all of those, and more.
The Web is no longer a collection of static pages of HTML that describe something in the world. Increasingly, the Web is the world – everything and everyone in the world casts an "information shadow," an aura of data which, when captured and processed intelligently, offers extraordinary opportunity and mind bending implications. Web Squared is our way of exploring this phenomenon and giving it a name.
Redefining Collective Intelligence: New Sensory Input
To understand where the Web is going, it helps to return to one of the fundamental ideas underlying Web 2.0, namely that successful network applications are systems for harnessing collective intelligence.
Many people now understand this idea in the sense of "crowdsourcing," namely that a large group of people can create a collective work whose value far exceeds that provided by any of the individual participants. The Web as a whole is a marvel of crowdsourcing, as are marketplaces such as those on eBay and craigslist, mixed media collections such as YouTube and Flickr, and the vast personal lifestream collections on Twitter, MySpace, and Facebook.
Many people also understand that applications can be constructed in such a way as to direct their users to perform specific tasks, like building an online encyclopedia (Wikipedia), annotating an online catalog (Amazon), adding data points onto a map (the many web mapping applications), or finding the most popular news stories (Digg, Twine). Amazon’s Mechanical Turk has gone so far as to provide a generalized platform for harnessing people to do tasks that are difficult for computers to perform on their own.
But is this really what we mean by collective intelligence? Isn’t one definition of intelligence, after all, that characteristic that allows an organism to learn from and respond to its environment? (Please note that we’re leaving aside entirely the question of self-awareness. For now, anyway.)
Imagine the Web (broadly defined as the network of all connected devices and applications, not just the PC-based application formally known as the World Wide Web) as a newborn baby. She sees, but at first she can’t focus. She can feel, but she has no idea of size till she puts something in her mouth. She hears the words of her smiling parents, but she can’t understand them. She is awash in sensations, few of which she understands. She has little or no control over her environment.
Gradually, the world begins to make sense. The baby coordinates the input from multiple senses, filters signal from noise, learns new skills, and once-difficult tasks become automatic.
The question before us is this: Is the Web getting smarter as it grows up?
Consider search – currently the lingua franca of the Web. The first search engines, starting with Brian Pinkerton’s webcrawler, put everything in their mouth, so to speak. They hungrily followed links, consuming everything they found. Ranking was by brute force keyword matching.
In 1998, Larry Page and Sergey Brin had a breakthrough, realizing that links were not merely a way of finding new content, but of ranking it and connecting it to a more sophisticated natural language grammar. In essence, every link became a vote, and votes from knowledgeable people (as measured by the number and quality of people who in turn vote for them) count more than others.
Modern search engines now use complex algorithms and hundreds of different ranking criteria to produce their results. Among the data sources is the feedback loop generated by the frequency of search terms, the number of user clicks on search results, and our own personal search and browsing history. For example, if a majority of users start clicking on the fifth item on a particular search results page more often than the first, Google’s algorithms take this as a signal that the fifth result may well be better than the first, and eventually adjust the results accordingly.
Now consider an even more current search application, the Google Mobile Application for the iPhone. The application detects the movement of the phone to your ear, and automatically goes into speech recognition mode. It uses its microphone to listen to your voice, and decodes what you are saying by referencing not only its speech recognition database and algorithms, but also the correlation to the most frequent search terms in its search database. The phone uses GPS or cell-tower triangulation to detect its location, and uses that information as well. A search for "pizza" returns the result you most likely want: the name, location, and contact information for the three nearest pizza restaurants.
All of a sudden, we’re not using search via a keyboard and a stilted search grammar, we’re talking to and with the Web. It’s getting smart enough to understand some things (such as where we are) without us having to tell it explicitly. And that’s just the beginning.
And while some of the databases referenced by the application — such as the mapping of GPS coordinates to addresses — are "taught" to the application, others, such as the recognition of speech, are "learned" by processing large, crowdsourced data sets.
Clearly, this is a "smarter" system than what we saw even a few years ago. Coordinating speech recognition and search, search results and location, is similar to the "hand-eye" coordination the baby gradually acquires. The Web is growing up, and we are all its collective parents.
Cooperating Data Subsystems
In our original Web 2.0 analysis, we posited that the future "internet operating system" would consist of a series of interoperating data subsystems. The Google Mobile Application provides one example of how such a data-driven operating system might work.
In this case, all of the data subsystems are owned by one vendor — Google. In other cases, as with Apple’s iPhoto ‘09, which integrates Flickr and Google Maps as well as Apple’s own cloud services, an application uses cloud database services from multiple vendors.
As we first noted back in 2003, "data is the Intel Inside" of the next generation of computer applications. That is, if a company has control over a unique source of data that is required for applications to function, they will be able to extract monopoly rents from the use of that data. In particular, if a database is generated by user contribution, market leaders will see increasing returns as the size and value of their database grows more quickly than that of any new entrants.
We see the era of Web 2.0, therefore, as a race to acquire and control data assets. Some of these assets — the critical mass of seller listings on eBay, or the critical mass of