Installing matplotlib and pandas on Amazon EC2

I’m trying to add analytics to my first Twitter bot and to do that, I need to use the pandas and matplotlib packages in Python. Trying to install these on my Amazon EC2 instance revealed an unpleasant thing about the Amazon Linux package manager: it’s updated like the decor in my local bodega is updated, hardly ever. […]

My First Twitter Bot

Inspired by the legion of Twitter bots created by Allison Parish to disseminate all kinds of lovely linguistic mayhem, I decided to build my own bot. While I love what she’s put together (my personal favorites are The Ephemerides and Deep Question Bot), I knew that generating text algorithmically wouldn’t be my thing. I’d played with Citibike […]

You going to enumerate that?

Following my last post on OrderedDict in Python, I came across another useful technique for working with data in Python, the enumerate function. In C++, using an index to reference data is second nature, but in Python with list comprehensions. For example: mylist = [‘foo’,’bar’,’boo’] for i in mylist: print i >>>foo bar boo This returns the […]

Hadoop Lessons

I recently completed a very interesting class focusing mainly on learning the Hadoop framework, as well as general principles of distributed algorithm design and analysis. It was a nice balance between theory and practice, although most of the theory was in the classroom and the practice was on our own for the assignments. By the […]