A Closer Look At The Data

The data is extracted from Martin Sullivan of Zois Limited's scrape of all the latest vacancies at the jobseekers.direct.gov.uk website.

Read more about his motivations for making this available from his page The Unofficial National Jobcentre Plus Mirror.

You are free to get the entire csv file of all job vacancies in the UK from his ftp site.

Martin announced the availability of this data on the Apps section of the data.gov.uk website on 12th July 2010.

So, let me spell it out again for you, this raw data stream is totally unofficial - contains no warranties for fitness of purpose whatsoever and you should bear in mind that it can be withdrawn by the data owners at any time.

The data is a csv file containing 14 columns, they are:

title,reference,location,hours,wage,work_pattern,employer,employer_ref,

pension,duration,closing_date,description,apply,added,office_code

For a fuller explanation of the contents of the original csv file then grab Martin's readme.txt.

Even a quick rummage through the raw data will expose that you will likely want to do some post-processing in order to make it palatable for the public.

  • some fields contain a mixture of upper and lower case text, this seems to depend upon the whims of local operators
  • some fields are left empty
  • wage for example can contain hourly, weekly, monthy and annual rates as well as comments such as "Meets the national minimum wage"
  • dates are database-friendly dates YYYY-MM-DD which will likely cause public confusion
  • office_code is in the format GUF for Guildford, you will need to dereference this in order that the public know which office is advertising the vacancy
  • JCP offices is another csv file you can download from Martins ftp site although you can look them up in a Google Fusion Table or pick them from a map.

It is possible to do some simple post-processing and here is a page that shows what can be achieved. This example also demonstrates how the JCP offices are dereferenced and how they can mashed up with Google static maps along with thier contact details.

www.godalming-tc.gov.uk/localjobs.htm

If you want me to prepare a similar page-stream which you can just slot into your website daily, then get in touch on twitter @paulgeraghty or email me directly using the name foofoonet and my provider is gmail.com (so you should be able to put the @ in the right place, yeah?)