Short Course Slides
Download file 14 MB
12 years delivering excellence
Join a global community
Toolkits, content & more
Big Data describes the huge volume of data – both structured and unstructured – that inundates a business on a day-to-day basis. It involves datasets that are too large to store on one machine and it requires multiple computers to work together to process the volume of data. In turn, we can use the data to predict better results and forecasts than would be possible with smaller data sets.
Now, let’s think about where Big Data has come from, and some of the key associated technologies around it. We can now mesh different datasets to create core insights and understanding about our customers and environment better than we’ve could before - it’s the coming together of a number of different tools and techniques. Some of the key technologies in this area are around predictive analytics - being able to use the data to understand how things may happen in the future.
Some technologies associated with Big Data include:
Predictive analytics is software and/or hardware solutions that enable firms to discover, evaluate, optimize, and deploy predictive models by analyzing Big Data sources to improve business performance or mitigate risk. Effectively, this is about making core predictions based on previous behavior to evaluate what we think is the likely outcome for the future.
This is very important when thinking about buyer behavior, for example, in a marketing world. We need to be intuitive about what exactly is going to happen, and understand the variables that have changed from the past in order to make certain assumptions about what might happen in the future. It’s also worthwhile using common sense and intuition when examining the data to add to the benefit of your experience of the metrics.
You can think of Big Data technologies as being like a jigsaw puzzle. It’s all about meshing technologies together. There’s an element of overlap, as well as linkages, because all these technologies need to come together to create a robust Big Data strategy. And if you don’t have these linkages, your picture might not be complete.
[18.104.22.168] When thinking about Big Data, you can use a number of different approaches. One very useful framework involves the four Vs:
Big Data, by its very nature, is a larger number of data sources, and can provide greater volumes of insights on individual customers. It’s the aggregating of the information that creates the richness within the dataset. In other words, it’s not just a single dataset. It’s about meshing a number of different data sources.
Big Data includes a wider variety of data types and sources. For example, the data could be a rich combination of structured, semi-structured, and unstructured data.
Veracity refers to the need to have robust data. Essentially, you must have some form of integrity within your data. Make it trusted, make it clean, and make it de-duplicated, because any anomalies that you have within it will eventually lead to inaccuracies that you might not notice later on. Remember, robust inputs lead to robust outputs, and this applies to Big Data as well.
This is the frequency of new data being entered into the set. It’s always better to work with fresh data and data sets with high levels of data velocity.Back to Top
Jack Preston is a Data Scientist working within marketing analytics, with a particular focus on strategic customer loyalty. Jack has experience working in both small-scale startups and large corporates, including dunnhumby and Notonthehighstreet. He also holds an MSc in Business Analytics from UCL where he graduated with distinction.
ABOUT THIS DIGITAL MARKETING MODULE
This short course covers the principles of analytics and demonstrates techniques and useful tools that you can use to develop and refine your knowledge of data analytics.
You will learn:
Approximate learning time: 3 hours