05 Jan 24
30 Jul 19
Big PDF of useful things about data engineering, data mining, workflows, etc.
11 Oct 12
19 May 11
Examples of using elasticsearch and protovis to provide with useful and good looking visualizations.
20 Mar 11
An overview of different techniques to extract actual content from web pages.
16 Mar 11
01 Dec 10
17 Sep 10
R is an open source implementation of the S-Plus language. It is freely available and is licensed under the GPL. The language simplifies many statistical computations and can be a powerful tool.
02 Sep 10
Datasets about almost everything !
05 Apr 10
A visualization of data gathered on 210 million public Facebook profiles that shows the information by location, with connections drawn between places that share friends. For example, a lot of people in LA have friends in San Francisco, so there’s a line between them.