Has the Semantic Web Arrived?
This is a question that brings up memories of Passover and the story of the four sons: the wise son, the wicked son, the simple son and the one too young to even ask a question. When pondering the reality and complexities of the semantic web, until becoming a technology entrepreneur I clearly was the “son too young” - I didn’t even know enough to ask a question. But now, after 2+ years working with NLP, data structures, online reputation, etc., I think I’ve finally graduated to being the simple son. And boy, does that feel good. For better or worse, I’ve got some reasonably well-formed thoughts on the topic. I’d say that there is the precise Tim Berners-Lee conception of the semantic web, and my less machine-driven framework for how people can enjoy Web 3.0 services without the full-blown technological breakthroughs associated with Tim’s vision. Per Wikipedia, the layperson’s definition of the semantic web is:
…an evolving extension of the World Wide Web in which web content can be expressed not only in natural language, but also in a form that can be read and used by software agents, thus permitting them to find, share and integrate information more easily. It derives from W3C director Tim Berners-Lee’s vision of the Web as a universal medium for data, information, and knowledge exchange.
At its core, the semantic web comprises a philosophy, a set of design principles, collaborative working groups,
and a variety of enabling technologies. Some elements of the semantic
web are expressed as prospective future possibilities that have yet to
be implemented or realized. Other elements of the semantic web are expressed in formal specifications. Some of these include Resource Description Framework (RDF), a variety of data interchange formats (e.g. RDF/XML, N3, Turtle, N-Triples), and notations such as RDF Schema (RDFS) and the Web Ontology Language (OWL). All of which are intended to provide a formal description of concepts, terms, and relationships within a given knowledge domain.
The essence of which is manifest in Tim Berners-Lee original vision of the semantic web back in 1999:
I have a dream for the Web [in which computers] become capable of
analyzing all the data on the Web – the content, links, and
transactions between people and computers. A ‘Semantic Web’, which
should make this possible, has yet to emerge, but when it does, the
day-to-day mechanisms of trade, bureaucracy and our daily lives will be
handled by machines talking to machines. The ‘intelligent agents’
people have touted for ages will finally materialize.
I personally think Tim is right on. His is the right vision that will take shape and change the way we navigate and extract value from the Internet, over time. That said, are we there yet? If one believes what is coming out of Radar Networks, the answer is: almost. Business 2.0 just ran a very interesting story on Nova Spivak’s company and the ground their covering to make Tim’s vision of the semantic web a reality.
For all the wonders that today’s Web can deliver to your fingertips
— the Norwegian word for ice cream, a seat on the next flight to
Paris, the best price for a Clash CD — it has a fundamental flaw.It’s
basically a compendium of billions of text documents designed to be
read by humans. You can search it for keywords, but the results aren’t
much use until you sort through them to find the page that has the info
you want.To take the Web to the next level — to move from Web
2.0 to Web 3.0 — the information in those documents will have to be
turned into data that a machine can read and evaluate on its own. Only
then will computers be able to take over tasks we now do by hand.********************
Think of it as the difference between two dimensions and three
dimensions. “People will see the Web start to become smarter,” Spivack
says. “Eventually it will have some reasoning capabilities built into
it.”********************
The Web just isn’t very smart yet; one webpage is the same as any
other. It might have a higher Google ranking, but there’s no
distinction based on meaning.The semantic Web in the
Berners-Lee vision acts more like a series of connected databases,
where all information resides in a structured form. Within that
structure is a layer of description that adds meaning that the computer
can understand. (“Semantics” is the branch of linguistics concerned
with meaning.)********************
“It’s in the combination that the real power of this comes out,”
Coates says. “The mashup is an early example of the Web that is to
come. Semantic technologies have not taken off as much as we’d hoped
because people are finding more utility in other Web 2.0 technologies
at the moment. The goal is the most important thing: reusable,
repurposable, and reconnectable data. How we get there is not as
important.”The shift to a semantic Web is still in its very
early days. Spivack envisions a time line of five to seven years. But
the shift is clearly under way. James Hendler, one of the coauthors of
that seminal Scientific American article, sees the same dynamics he did when the Web was first forming.
Right. 5-7 years. While it seems to me that the semantic web is a worthy mission, just like AI, we need to keep things in perspective. Small wins are key. Given the complexity, mass and dynamism of unstructured data populating the Web, it is hard to see a clear path to the the ultimate solution, one that is robust enough to be developed today yet applicable to tomorrow’s Internet. Suffice it to say, Nova and his colleagues have bitten off a lot, but if anyone can crack the case they surely can. I just think that advances in the semantic web will be very incremental, and that big strides in usability and power will be made through human/social means: tagging, recommendations, communities, better measures and applications of reputation, etc. We have already seen tremendous advances in ease-of-use and navigibility of the Web simply through these tools, tools that are available today and not dependent upon the “holy grail” of bleeding-edge algorithms, software and hardware. I’m not saying that Radar Networks isn’t awesome and that they won’t get there; I’m just not holding my breath waiting. I want to get more value out the Internet today and tomorrow, and my guess is that I’ll continue to get more value today through simple, community-based tools and technologies than the semantic web. Just a hunch.
——-
——-
COMMENT:
AUTHOR: Christopher Smith
EMAIL: csmith@ibanknet.com
DATE: 07/05/2007 09:40:46 AM
Hi Roger,
I always enjoy reading your blog.
I collect and analyze bank financials. Two years ago, I was intrigued by the semantic web, but realized it wasn’t “prime time”. At the same time Google announced Google Base. This was the closest practical solution at the time.
At iBanknet.com we are using Google Base to power our search. So if you go to our search, and select one of our Google Bases such as Bank Call Reports reports, you can do a search for the leading banks in NY sorted by assets.
While not the semantic web in the purest form, it’s real, it exists, and it gives you a peek into the potential of the semantic web.
Best Regards,
Chris
——-
COMMENT:
AUTHOR: Roger
EMAIL: roger@monitor110.com
URL: http://www.informationarbitrage.com
DATE: 07/05/2007 09:50:34 AM
Chris, thanks for the comment. Exactly my point. Thanks for the concrete example. Best, Roger
——-
COMMENT:
AUTHOR: Adrian Heilbut
EMAIL: aheilbut@gmail.com
URL: http://www.nonstationary.com
DATE: 07/05/2007 06:12:04 PM
Freebase http://www.freebase.com from Metaweb http://www.metaweb.com looks as if it has the potential to be another important player in the emergence of a more semantic web, and Freebase is already out in an alpha. There has also been some activity around extending wiki platforms with semantic types. http://ontoworld.org
I agree that the success of these efforts will depend very much on the extent to which they enable the gradual, incremental evolution of the tools and information flows that people depend on today. In particular, the technologies ought to be able to operate on the open web, and not only within the walled gardens of a single vendors’ world like Freebase.
One more caveat - I think people need to be careful not to start using ‘semantic web’ technologies to do things that are perfectly well suited to representation by relational data models, which will always retain enormous power. Many of the promises of the ‘semantic web’ can start to sound awfully similar to hype around AI and knowledge representation technologies like Cyc http://www.cyc.com, which go back to the 80s or even further to the old days of hierarchical and network databases.
——-
COMMENT:
AUTHOR: mehnaz
EMAIL: mehnaz@gmail.com
URL:
DATE: 07/12/2007 07:56:01 AM
Silicon Valley Entrepreneur & Strategy Consultant Sramana Mitra, defines web 3.0.Sramana writes a follow-on synthesis explaining why the Semantic Web can only be implemented in a Contextual Domain. Thus, Web 3.0 according to her, is a Verticalized, Contextualized, Personalized Web.
Links: http://sramanamitra.com/blog/1165
http://sramanamitra.com/blog/572
http://sramanamitra.com/blog/775
——-
————