|Twanguages: a language census of Twitter|
|Written by Jim DeLaHunt|
|Friday, 31 July 2009|
What "twanguage" do you "tweet"? Twitter, the buzzing conversation of brief web and SMS messsages, exploded into wide use in 2009. But just how wide? To how many countries has it spread? And into which languages? I'm aiming to find out.
I've started a project named "Twanguages", a language census of a sample of Twitter's global traffic. I'm curious: which are the top languages? Are #hashtags localised? How does language correlate with location? And which Unicode character is the most rarely used?
I presented our initial results at the 33rd Internationalization and Unicode Conference (IUC33), held in San Jose, California, on October 14-16, 2009. Take a look at the "Twanguages: a language census for Twitter" IUC33 session page for the paper and slides.
Follow @twanguages on Twitter for news.
|Last Updated ( Friday, 16 October 2009 )|
|< Prev||Next >|