Real Time Video Analytics Using Hadoop and HBase

25 Jun

I recently attended the HBaseCon 2013 conference and talked about building a real-time video analytics platform using Hadoop and HBase.

My presentation is embedded below:

 

 

More information about HBaseCon 2013 here: http://blog.cloudera.com/blog/2013/06/the-hbasecon-2013-afterglow/

Ubuntu 12.04 on a Nano PC nT-1200

9 May

I recently set up a fully working Ubuntu 12.04.2 32-bit (yes, with working display / video card!) on a Foxconn nT-1200 Nano PC (Intel Atom D2500 with Intel GMA 3600 video card) and a OCZ 120 GB SSD and G-Skill 4 GB RAM.

Continue reading

PHP: Compare time for summation and sorting across large runs

7 May
A little while back, I wrote a PHP script to generate a random set of large arrays and both summate them across one dimenion, and sort them.

 Continue reading 

s3cmd timeout problems moving large files on S3 (> 250MB)

19 Mar

Recently we had a problem with s3cmd giving errors while moving or copying (large) files across buckets in S3. This had worked fine perfectly before, but our file sizes were increasing.

Continue reading

ServerDensity: creating a new device and getting its access key via the API

31 Jan

I’ve recently been setting up devices that need to have ServerDensity monitoring automatically added to them when I start them up. Setting up the ServerDensity agent itself can be automated through shell commands, but you also need to set the API key in the config. I use a Python file to create a new device in ServerDensity and output the API/agent key for the newly created device.

Continue reading

Gentle Introduction to HBase II – Amazon Elastic Map Reduce (EMR) HBase Cluster

26 Nov

Below is a WYSIWYG tutorial on how to set up a HBase cluster (and Thrift) on Amazon’s Elastic Map Reduce.

Continue reading

Gentle Introduction to HBase Part I – Data Structure

13 Nov

In this post (hopefully the first of more), I hope to provide a gentle introduction to HBase (since I never had one myself!) This specific post is more about the specifics of HBase’s data structure, but I hope to do more posts introducing HBase programming using a combination of HBase on Amazon’s Elastic Map Reduce (I like to call it Amazon’s HAAS or HBase-as-a-Service) and Python using the HappyBase library (this offers a really easy interface to the HBase Thrift interface.)

Continue reading

Using Amazon/AWS Glacier with Python boto

16 Oct

Amazon’s Glacier is a fairly new AWS service for low-cost storage/backups. But there is little documentation on how to actually use it.

Continue reading

s3cmd IAM problem and solving it

19 Sep

I had a problem with the popular s3cmd application and IAM permissions, and was able to solve it.

Continue reading

Disabling author name on WPTouch / WordPress

24 Jul

WPTouch enables you to disable/hide the author name in the main index page through the options, but how do you do so on the post pages?

Continue reading