Workshop: [SOLD OUT] Machine Learning and Natural Language Processing on Social Media Data at Scale

Location: O’Neill, 4th fl.

Duration: 1:00pm - 4:00pm

Day of week: Thursday

Level: Intermediate

Key Takeaways

  • Running Spark and Hadoop clusters on Google Cloud Platform
  • Using Spark Spark MLlib on Google Cloud Platform
  • Using AutoML for Custom Deep Learning models
  • Using Natural Language APIs for Natural Language Processing


Familiarity with GCP, Spark and/or Machine Learning is helpful but not necessary.    

This workshop is mostly hands-on. Please bring a laptop. No pre-installs necessary.

Social media data is a mirror for our collective thoughts and views on the world, but working with these massive datasets is challenging. In this workshop, we’ll cover how Spark and Hadoop on Google Cloud can help us prepare datasets in parallel, at scale. Then we’ll use this data to build machine learning models with Spark MLlib, Google Cloud Natural Language API and Google Cloud AutoML. You'll walk away from this workshop with a better understanding of how to do natural language processing on massive troves of text data.

Speaker: Brad Miro

Machine Learning Engineer @Google

Brad is passionate about educating the world about artificial intelligence both by empowering developers and improving societal understanding. He is currently a Developer Programs Engineer at Google where he specializes in machine learning and big data solutions. Outside of work, Brad can be found singing, climbing, playing board games and locating the best restaurants in NYC.

Find Brad Miro at

Speaker: Dale Markowitz

Software Engineer @Google Research

Dale Markowitz is a Developer Advocate for Machine Learning on Google Cloud. Formerly she worked as a software engineer for Google Research and the online dating site OkCupid.

Find Dale Markowitz at


This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.