Inspired by the great maps from CASA showing the distribution of twitter languages for London and New York, I decided to take on a self development project to try and do the same for Melbourne and in the process learn some more about python, SQL and web development.
At the time of writing, the application shows geolocated tweets that have been collected from November 2012 to March 2013 around Melbourne. The application allows you to switch between a map of languages (Top 5 languages) and a map of profanity. Tweets are still being collected and the map caches will continue to be updated with new data every fortnight or so.
There are already some interesting stories in the map such as people tweeting from the airport runways; the lack of Greek being spoken on Twitter despite it being the 2nd most spoken language at home (Census 2011); big mix of languages at universities, the airport and in parks; people tweet a lot on trams & trains; the lack of Chinese languages; tweeters out at sea (only recording confident location matches); and a number of residential houses that like to swear a lot.
A total of 58 languages have been detected so far (map only shows the top 5 that have the most tweets against them). For now, Malay is leading in the top 5 Languages spoken on Twitter (disregarding English) and the outer eastern suburbs are marginally winning in which suburb swears the most.
Feel free to use the comments section to ask any questions or make any observations/hypotheses of patterns within the data. It will be interesting to see the data evolve over the next few months to see if the patterns change at all. If anyone is interested to know more about my learning experiences with this project, I intend on doing a quick talk about it at the next Esri Australia developer meetup in Melbourne. I will put my slides up as a separate blog post when that happens.
With webmaps getting all the limelilght these days, I thought I would put together a static map. Click for a HQ version.