Category Archives: Uncategorized

Data science on the command line

There are just a few linux command line tools that I use many times a day! less for checking the contents of files, verifying you’ve got the right output or input format, quickly examining data grep for searching within files, especially … Continue reading

Posted in Uncategorized | Leave a comment

Best books for pre-school girls

My 3.5 year old daughter loves reading books, but it can sometimes be hard to find great stories. Traditional fairytales are full of wicked stepmothers, spineless fathers and helpless princesses, while other stories are full of brave boys and girls who dream … Continue reading

Posted in Uncategorized | Tagged , , | 1 Comment

Visualising waveforms with Python and Bokeh

I’ve recently been playing with the Bokeh Python library for visualisation. One thing I end up trying to do more often than I should is trying to draw waveforms for talks and presentations. Turns out that Bokeh is great for this! Here’s a … Continue reading

Posted in Uncategorized | 1 Comment

Language Models

Language models assign probability to sequences of words. They have many applications, including machine translation, smartphone typing, information retrieval, though I’m familiar with them through speech recognition. For many years, the probabilities of N-Grams – that’s words or sequences of … Continue reading

Posted in Uncategorized | Leave a comment

Busy busy busy!

Back to work after maternity leave doesn’t leave me much time to keep the blog up to date! But, I’ve also been busy on a couple of other articles. The first, over at Statistics Views, is an introduction to the role … Continue reading

Posted in Uncategorized | Leave a comment

Why all the hype about big data?

The amount of data in the world is doubling every 2 years, but we are still at the very beginning of a huge explosion in information.  Research firm ICD forecast that by 2020, 40 zettabytes of data will have been … Continue reading

Posted in Uncategorized | Tagged , , , , | Leave a comment

My ‘to-read’ pile

My to-read pile is very short at the moment, but I’m slowly working my way through these books: The Information, James Gleick A history of information and the way it’s changed our lives. Black Swan, Nassim Taleb Nothing to do … Continue reading

Posted in Uncategorized | Tagged | 1 Comment