Contribute Media
A thank you to everyone who makes this possible: Read More

One workshop that data scientists don't want you to attend...


By Oliver Laslett & Andraz Hribernik

Filmed at PyData London 2017

Description With this one weird trick you can build a text processing pipeline!

We've all fallen for clickbait articles online. They pollute our news feeds and make it harder to filter out valuable information. In this workshop we'll stream news articles in real-time and detect clickbait using simple machine learning techniques. You won't believe what happened next...

Abstract By the end of the workshop you'll have your very own python app for streaming real-time news and detecting click bait. In the workshop we'll cover:

  • Streaming data from a REST API
  • Preprocessing textual data
  • Training a simple machine learning classifier for clickbait
  • Putting everything together in a scikit-learn pipeline
  • Analysing our results (which news source is the most clickbaity?)


Improve this page