In this episode of Future-Proof Your Career, we speak to Caroline Keep, a data scientist, a teacher, a maker, and a researcher in machine learning. She is the recipient of multiple awards, including the Times Education Supplement teacher award, and a founder of Liverpool Makerfest.
We spoke to Caroline about how you extract meaning from data, and how we can all be more engaged in the effort to decipher the world around us.
Here’s what we learned.
Data is the real world, quantified
Don’t think of data as just endless spreadsheets and numbers. It’s a representation of the real world and the things that matter. Understanding the data is a way to understand the world.
Understanding data is a process
Caroline talked about multiple steps in the ‘data cycle’:
- Start with discovery: play with the data at your disposal to get a feel for it
- Create a hypothesis: what are you trying to test?
- Discuss your idea with other people and gather perspectives, check your reasoning
- Clean your data: the real world is messy and full of bias and noise
- Test your idea: does your hypothesis hold true?
Build domain knowledge
Understanding the space you’re exploring is critical to give you a reference point. Otherwise you won’t know if the results you find are nonsense!
If the data you want doesn’t exist, you can get it
There are lots of sources of interesting data, but the Internet of Things makes it cheaper and easier than ever to collect data that doesn’t exist. Whether you want to track temperature, movement, light or pollution, or anything for that matter, simple sensors and cheap computers like the Raspberry Pi allow anyone to experiment (see links below)
Caroline referenced some great resources and projects, including:
- Kaggle: a data science community - https://www.kaggle.com/
- NodeRed: a drag and drop IoT platform: https://nodered.org/
- Kettle Companion: A connected kettle that helps carers keep an eye on vulnerable people - https://kettlecompanion.com/
- Rstudio: software for data science - https://posit.co/products/open-source/rstudio/
- Python: a powerful but accessible programming language - https://www.python.org/
- Jupyter Notebook: https://jupyter.org/
Future-Proof Your Career
Welcome to Future-proof Your Career, your guide to the most important skills for a long, successful working life. This is a special season of the Talk About Tomorrow podcast, exploring in depth the idea of the Three Cs, three skill groups that are critical to success, in a business or as an entrepreneur. The ability to curate information, create new things, and communicate ideas. In each episode we explore a facet of one of these skills, alongside a guest.
My name is Tom Cheesewright, I’m an applied futurist advising organisations around the globe on how to see and prepare for the future. Alongside me is my co-host Katharine McNamara, communications expert extraordinaire.