Browsing All Posts filed under »Datasets«

Featured Dataset: British National Bibliography

September 1, 2011 by


Today, I’ll be featuring a dataset encompassing a complete catalog of publishing activity in the UK: the British National Bibliography. The national bibliography records the publishing activity of the United Kingdom and the Republic of Ireland and as such is a measure of their intellectual output. This has traditionally included printed publications and more recently […]

Featured Dataset: ChEMBL-RDF, with Egon Willighagen

August 23, 2011 by


Published in Kasabi by Egon Willighagen, ChEMBL is a conversion into Linked Data of an important dataset of chemicals from the European Bioinformatics Institute. I had a conversation with Egon—who is involved in postdoctoral research associate at the Karolinska Institutet in Stockholm—about this set, and asked him what it contains: The database consist basically of […]

Featured Dataset: Prelinger Archives

August 9, 2011 by


Following on a bit from the Lego featured last week, I wanted to take a look at some media data. So, today I’d like to feature a very interesting set of data called the Prelinger Archives. Rick Prelinger, back in 1982, began a collection of rather diverse films under the broad category of being “ephemeral”. […]

Featured Dataset: Bricklink

August 4, 2011 by


Featured datasets usually end with an emphasis on what you can build on top of them, so I’m irretrievably drawn to feature the Bricklink set today—a set you can build on top of, and features data built for building… er… with Lego. Bricklink is a Lego marketplace. Essentially it is the EBay for Lego, where […]

Featured Dataset: BBC Music

August 2, 2011 by


Featured Dataset: BBC Music Our next developer event for Kasabi will be based on culture, so I thought it’d be a good idea to feature a musical dataset today. BBC Music is a set based on the BBC Music website, and contains a comprehensive collection of information about musical talent covered by BBC programs—from radio […]

Featured Dataset: IATI

July 29, 2011 by


As mentioned earlier, we held our first themed hackday this week, and I’m happy to be able to feature one of the datasets published by a hacker during the event. I’m very excited about this dataset. For a start, it was published during a hack event, and the time-scale for getting them out is pretty […]

Featured Datasets: NHS Organizations and Performance Data

July 26, 2011 by


Tomorrow we’re hosting our first themed hack day on the topic of Open Government Data. For the event I wanted to gather some new datasets that could be used to create some new interesting applications. I decided to focus on the NHS and have just published three new datasets: NHS Organization — a dataset containing the […]

Featured Dataset: MTA New York City Transit

July 22, 2011 by


Today’s featured dataset is one full of transit data from New York city, which is handy, because the Metropolitan Transportation Agency is looking for apps to be built on their data! Haveing a read of Programmable Web’s blog post, it seems that the MTA is offering a $15k prize for the best app, and they have […]

Featured Dataset: Foodista

July 14, 2011 by


It’s virtually impossible to feature a dataset about food without immediately thinking of applications (and puns involving forking, mashups of spuds etc…). Today’s ‘set is Foodista, which takes data from the community wiki as a crawl of food, tools and cooking techniques from the site. From the Developer’s Docs, the purpose of this dataset is […]

Featured Dataset: English Heritage

July 12, 2011 by


The dataset English Heritage is full of information covering one of my favourite topics: historic locations. The set comprises metadata for around 400,000 places the UK Government’s adviser on the historic environment (English Heritage) records. Ian Davis published this conversion of data available directly from English Heritage, interlinked with location information from the Ordnance Survey. […]


Get every new post delivered to your Inbox.