You are reading a single comment by @chez_jay and its replies. Click here to read the full conversation.
  • I received an email today which included a number of the usual clichés, but threw in a number of gems I had not had inflicted on me before: "data lake", "next generation influencing", "the bringing together of separately siloed expertise to drive greater impact", "leveraging our digital touch points".
    I feel quite sick.

  • Sometimes novel concepts do need a name and perhaps 'data lake' is one of these. It means a great big... er... pool of all an organisation's data, but the important bit is that it contains unstructured data as well as structured data.

    As opposed to 'data warehouse', or even 'great big database' which imply an overall structure. I'm reminded of EU food surpluses being described as 'butter mountains' and 'milk lakes'.

    Reading Wikipedia to check where it originated, I'm amused by:

    A data swamp is a deteriorated data lake, that is inaccessible to its intended users and provides little value.

  • Having just edited a 15,000-word document on data standards and schemas, I'm thinking that I should go back and drop these in.

  • We use the term like that as it is important to differentiate between data in a native format and data we have integrated. you could of course call it raw vs standardised tho.

  • I had to google that a few months ago.

    Big data team >>>

About

Avatar for chez_jay @chez_jay started