Menu Content/Inhalt
Home arrow Jim's projects arrow m11l arrow Twanguages: a language census of Twitter
Twanguages: a language census of Twitter PDF Print E-mail
Written by Jim DeLaHunt   
Friday, 31 July 2009

What "twanguage" do you "tweet"?  Twitter, the buzzing conversation of brief web and SMS messsages, exploded into wide use in 2009. But just how wide?  To how many countries has it spread?  And into which languages?  I'm aiming to find out.

I've started a project named "Twanguages", a language census of a sample of Twitter's global traffic. I'm curious: which are the top languages? Are #hashtags localised? How does language correlate with location?  And which Unicode character is the most rarely used?

I presented our initial results at the 33rd Internationalization and Unicode Conference (IUC33), held in San Jose, California, on October 14-16, 2009. Take a look at the "Twanguages: a language census for Twitter" IUC33 session page for the paper and slides.

Follow @twanguages on Twitter for news.

Stay tuned!

Last Updated ( Friday, 16 October 2009 )
< Prev   Next >

Sponsored Links