Posts Tagged 'xml'

Flickr Popular Tags page parsing using Java – update

In my last post on parsing Flickr popular tags I had discussed converting the Flickr page HTML to XML (using Tidy on the command line) and then using the Java XML APIs to get a Document that I could parse.

But I was able to find some sample code of how to use Tidy (jTidy) to parse HTML and return a Document, so I have changed my code as follows…

Continue reading ‘Flickr Popular Tags page parsing using Java – update’


Blog Stats

  • 6,384 hits

My Work Blog

This is my code blog, where I post any interesting code I develop during the course of my work. My research blog is on Blogger, and that is where I update the course of my research work at Columbia University. Blogger's source code formatting is not up to par, but Wordpress' is excellent, which is why my code is being posted on this blog.