Natural Language Processing from Scratch


We introduce the fundamental technique of natural language processing using Python and OpenNasa datasets. In particular:

  • bag of words models
  • stop words
  • tf/idf
  • basic topic modeling
  • word clouds
  • language detection

A GitHub repository will be made available with all the code and slides used during the talk.

