So, I imagine some of you have seen wordclouds. For the 0.1% of you haven't, they are visual depictions of word frequencies in a speech, block of text, etc. They're all the rage for corporate communications - "look at us, we're hip! we can move beyond powerpoint templates and start really communicating." (sorry, if you sense bitterness - we may have seen a few yesterday).
Anyway, it got me thinking about what a CGB word cloud would look like. First of all, I imagined the DBD would have 1000000% more inanity than a front page post. For that reason, obviously, I chose to focus on the DBD.
After I made the first rev, I realized that, in addition to filtering out common English words, there are other words repeated on CGB that skew the results: May, 16, PDT, reply, actions, etc (basically, what's on the bottom of each post).
Long story short, after flexing my ctrl+h muscles, here's the CGB word cloud. Pretty cool, no?
CGB: what are your key takeways?