Via Data Mining, Scott Nowson has published his thesis “The Language of Weblogs: A study of genre and individual differences.” I’ll admit to not having read it all yet (it’s 300 pages) but unsurprisingly:
The study concludes by confirming that both gender and personality are projected by language in blogs; furthermore, approaches which take the context of language features into account can be used to detect more variation than those which do not.
It’s unsurprising as one of the defining characteristic of a blog has tended to be that personality comes through, which usually reflects gender. In fact, I’d say the ability to allow yourself to project a personality is a pre-reuqisite for a good blogger, as illustrated in wurk.net‘s questioning of the major blog networks about what they are looking for. The common elements that came through were passion, personality and great writing skills (plus staying power – you have to be able to do this consistently).
Without personality, without putting some of yourself out there, it’s just neutral reporting, and there are plenty of ‘official’ sources for that.
Update: Scott commented about the meaning of personality: “By personality I am referring specifically to the traits of the five factor model (Neuroticism, Extraversion, Openness, Agreeableness and Conscientiousness), which I’ve shown can be projected to varying degrees by language.” His report which I’ve started reading, drives down into the components of personality that are measurable opposed to my assessment of surface personality which is the sum of them. So whilst I can make a snap judgement based on my perceptions, he’s used tools to delve into the underlying traits that are projected throught he language.