Storobia Archive
We need are tools that can help us locate relevant web documents more efficently and extract information from them automatically. For this to happen we need a method for those tools to better "understand" a web page. Enter the semantic web.
"a web of data that can be processed directly and indirectly by machines"
The idea is that all sorts of documents - not just web pages - can be analysed automatically and relevant data extracted. This would allow greater personal productivity, internet data mining, better communication between suppliers and customers, etc. Semantic blog software could even make the blogosphere less of a labyrinth!
One way of achieving this would be to extend existing data and systems via some form of "metadata": data about data.
|
Key to making the semantic web a reality is interoperability, not just at the XML level but at the higher levels of OWL and ontology. So where should should this be defined?
Many people, especially those in the academic world, favour a centralised definition of a standard ontology to be used by everyone. That would be technically desirable and make the realisation of the semantic web far simpler.
Unfortunately it's also impractical. Most of the information on the web today is not produced by academics or technologists. The idea of bloggers using a centrally defined ontology is unrealistic.
There's also the issue that the web already contains a huge amount of useful semantic data in the form of user defined "tags". This "folksonomy" is unstructured and riddled with contradictions and holes, yet also extremely powerful. It would be foolish to throw it away.
So the challenge facing the semantic web is to somehow bridge the gap between the informal folksonomy of internet users and the more rigorous ontology required for automated information processing. That's not going to be an easy thing to do. If we do achieve it then it might really be "Web 3.0".