Information Age: News, analysis & insight for IT & business leaders

2 September 2010

The hunt for meaning

10 December 2008  

The semantic web may have yet to arrive, but semantic technology is already helping businesses to make sense of their information

The semantic web has a hallowed position among futuristic technologies, not least of all because its chief champion is Tim Berners-Lee, the man who invented the world wide web.

Unfortunately for Berners-Lee and his acolytes, there is little chance that the semantic revolution – whereby information on the web is supplemented with metadata that describes its meaning, thereby improving the power of search and other information management functions – will happen as quickly, as dramatically or as visibly as the rise of the web in the 1990s.

Instead, the development of the semantic web is more likely to mirror that of Web 2.0 – it will be gradual and partial. It will suit some websites to add semantic metadata to their information, others not at all.

However, in the (slightly) more self-contained world of enterprise information management, semantic technology has some more immediate applications and is already helping some organisations make sense of their oceans of unstructured data.

The umbrella heading of ‘semantic technology’ includes tools that analyse text in order to divine its meaning as well as formats and standards for codifying and integrating information on the basis of that meaning.

In the first case, there are two approaches. One is statistical; some of the meaning of a document can be retrieved by mathematically analysing the text.

An example of a business application of this approach comes from information service provider Thomson Reuters, a sophisticated adopter of semantic technology in all its forms.

“We applied some statistical semantic technology to our marketing campaigns,” explains Peter Jackson, chief scientist at Thomson Reuters Professional. “We used a document categorisation system to analyse the documents that people visit online, based on word-pairs found in the text. That analysis informs the marketing mail-outs we send them, based on what they are interested in.”

More intriguing is the second approach, an ongoing attempt to train computers to decode the meaning of words based on linguistic principles. “The second approach is to look into our own brains and try to codify what all these words mean,” explains Hans Uszkoreit, professor of computational linguistics at Saarland University and a semantic technology luminary.

The technology available today can recognise defined concepts, such as company names, with 90% accuracy, Uszkoreit explains. It can also divine simple relationships between those concepts (such as what company sells which products), however the more complex the relationships between the concepts, the lower the rate of recognition currently achievable.

Thomson Reuters has also applied this technology to a business problem in its legal information division. “We developed a system that used natural language processing to identify names of people and companies involved in case reports,” explains Jackson. “It means a customer can now ask for a report on all the attorneys that have represented Microsoft in court, for example. That is a huge development tool for law firms.”

This is emphatically not the same as search technology, Jackson explains. “This is more than just search; you need a system that knows, for example, that the West Bank of Jordan is not a financial institution.”

Meaningful standards

Standards that have arisen to describe the semantic meaning of a given data set include the resource description framework (RDF) and the web ontology language (OWL). These provide a framework of meanings – an address or a name, to use two prosaic examples – that can be assigned to data.

Not only can these standards help business to codify their unstructured data, but they can also help in application development and integration, argues Orestis Terzidis, a director at SAP’s research campus.

“When messages pass between two applications, you have to make sure you copy the data from one input field to the correct corresponding field,” he explains. “One approach to that is to refer to an ontology such as OWL. You can ensure that in both applications the fields are defined as an address, for example, and use the ontology to direct the transfer of data.

“That will shift the integration difficulties from a technical problem to a question of the true definition of things,” he adds, which will allow greater business involvement in integration projects.

Terzidis adds that while there are many approaches to this kind of integration, introducing ‘meaning’ to the realm of computing, semantic technology may be one of the most powerful.

“People have compared semantic technology to the relational database,” he says, “in that you can do almost anything that you can do with semantic technology using alternative methods. But with semantic technology you can do it in a simpler and in a more reusable way.”

Further reading


Comments 

There are currently no comments on this article

People who read this also read...

How will semantic technology boost the UK’s economy?

Gordon Brown might believe the semantic web is a ‘simple concept’ but its potential contribution to the UK economy is anything but

Fear of the unknown

An inability to measure outcomes and the risk of reputational damage are among the concerns that prevent organisations making the most of social media as a marketing tool

Brown pledges to invest in web science

In the latest round of web-related election promises, PM announces plan for Web Science institute, public services homepage for all

The true value of enterprise IT

A new report from Deloitte argues that 40 years of enterprise IT have failed to improve business performance. But focussing on 'knowledge flow', it argues, can change that

Tories unveil ‘technology manifesto’

Open goverment data and 100 Mbps broadband roll out among Conservative party claims

 

White Papers

Read article

10 Steps to an Enterprise Mobility Strategy

Regain control of your enterprise mobility strategy with these ten steps.

Read article

1Z0-040 Oracle Database 10G New Features for Administrators Practice Exam

Oracle 9i administrators can certify on Oracle 10G by passing this exam. The ExamForce 1Z0-040 Oracle Database 10G New Features for Administrators practice exam provides their unique triple testing mode to instantly set a baseline of your knowledge and focus your study where you need it most.

Read article

2009 Gartner Magic Quadrant Report

Riverbed positioned in Leaders quadrant of 2009 Gartner Magic Quadrant for WAN Optimization Controllers.

More
Advertisement