Information Age: News, analysis & insight for IT & business leaders

 
Information Age Blog

The social science of sentiment

30 April 2010  

Pete Swabey

In his book ‘The Shock of the Old’, David Edgerton observes that is not always the newest, shiniest technologies that cause the greatest upheaval.

This year’s UK general election is a case in point: the great innovation in media coverage has been televised debates between party leaders (first introduced in the US in 1960) and not, as many had predicted, the use of social media.

Nevertheless, social networks have provided an unprecedented insight into what voters are saying, and a number of pundits have used sentiment analysis technology on social media content to gauge public feeling.

This is a timely test for a technology that has a clear business application – just as political parties can track what people are saying about their policies and personalities, so too can businesses follow “the conversation” around their brands and products.

"We’re trying to treat the language used in microblogs as more or less its own language"
Niklas de Besche, Meltwater Buzz

But how reliable are social media sentiment analysis technologies? Can software really uncover the sentiment behind the context-dependent, self-referential and often highly-ironic language that is used on social networks?

Yesterday I spoke to Niklas de Besche, executive director at social media monitoring provider Meltwater Buzz, which has been tracking sentiment towards the UK’s political parties by analysing blog posts, online video submissions and ‘tweets’. The fruit of this analysis – as of 29th April 2010 – can be seen below.

De Besche explained how Meltwater Buzz has developed an algorithm that analyses social media content and rates it as positive, negative or neutral in sentiment. The company uses Amazon.com’s Mechnical Turk service, which allows customers to ‘crowdsource’ human operatives to perform small tasks, to train that algorithm. This training is an ongoing work in progress, de Besche explained, because the language of the social web is constantly evolving.

He admitted, though, that some forms of social media content are easier to assess than others. Blog posts, for example, can be analysed in detail. The short messages users share in such ‘microblogging’ services as Twitter are far harder to penetrate given their brevity, and analysing the sentiment of tweets has called for its own approach.

"We’re trying to treat the language used in microblogs as more or less its own language," de Besche said.

The company claims “confidently” that its service is 80% accurate. This claim is based on the fact that the algorithm returns a percentage assessment of sentiment (e.g. 65% positive or 42% negative), and only content that returns a percentage of over 20% is categorised as positive or negative; otherwise it is deemed neutral.

"This reduces our accuracy to 80%," said de Besche, "but in turn we are conservative and highly accurate on the posts that are either marked positive or negative."

This means, however, that when Meltwater Buzz categorises content as neutral, it could be that the algorithm just doesn’t “understand” the sentiment.

De Besche argued that this ambiguity comes out in the wash when large quantities of content is analysed. And indeed, only when a large quantity of content is available is automated sentiment analysis required.

Still, social media sentiment analysis remains an inexact science. I wonder whether the technology in its present form can really be used to discover underlying sentiment that would otherwise be invisible, or whether it simply provides seemingly-scientific corroboration for what marketers and policy-makers in tune with their audience already know.

De Besche, however, is positive that accuracy will improve as the technology develops, or in other words that there is no logical reason why software can’t one day understand the sentiment behind all human communication. “We are still at the very early days of what is possible.”


Comments  [1]

Philip Sheldrake
Friday 30th April 2010

If I may be so bold, that is the most confusing and foggiest explanation I have heard in a while regarding the estimation of the accuracy of sentiment analysis.

I think I'm right in saying that Turksters (eg, night watchmen trying to supplement their hourly wage) are used to rate social media contributions as -ve, neutral or +ve, and this is compared to the algorithm's conclusions.

Well, if you speak with Media Tenor, Integrasco, Brandwatch, Alterian, Collective Intellect, Crimson Hexagon, Radian 6, Nielsen, Dow Jones, SAS, et al, you'll find their approaches to the manual assessment of sentiment are restricted to trained specialists rather than warehouse security guards. And the very best automated (read "expensive") services classifying sentiment as -ve, neutral and +ve achieve c. 65%.

To my knowledge, only SAS has claimed slightly higher accuracy during their recent webcast announcing their entry into the social analytics market. Katie Delahaye Paine took part in the webcast, so perhaps she can vouch for their claims. Katie?

Lastly, mathematically, a banding / tolerance of 20% does not an accuracy of 80% (100-20) make. Whoops.

I haven't got my hands on the new service from Meltwater since they stopped white labelling SM2, but I look forward to my invitation! That's if I'm still invited :-)

Report this comment »

People who read this also read...

 

White Papers

Read article

Developing ios Solutions for Business

Whitepapers

Quickly develop and deploy custom iPad and iPhone solutions. With FileMaker Pro, iPad and iPhone solutions can be prototyped and completed in hours or days versus weeks or months. No iOS application programming or design experience is required.

Read article

IDC Spotlight: Access Control and Certification

Whitepapers

Read this brief for best practices on managing user access compliance.

Read article

GPS World

Whitepapers

Is the PREMIER global media brand serving the exploding world of positioning and navigation for OEM, commercial and consumer applications.

More

Latest Posts

Your brain on Twitter

New science reveals that older brains may find social networking services distracting, but also that there are similarities between Twitter and the brain itself

Social judgment

Has the advent of the social network damaged the authority of Britain's legal system?

London’s tech future lies in the City

Playing on London's strengths – namely its reputation as global financial capital – would be the best way to support its technology industry

Reassessing Russia

Parallels CEO Sergei Beloussov sets the record straight on Russia's high tech potential

SpotlightOnSpend reacts to open criticism

Spend analysis software vendor Spikes Cavell responds to a blogger's excoriating analysis of its open data portal

Advertisement
Video ORSYP Survey Surveys
div class="banner">