PyDSLA: Erin LeDell, Presents Intro to Machine Learning with H2O and Python

This workshop will provide an overview of how to use H2O, the scalable open source machine learning library, from Python.  The core algorithms of H2O are implemented in Java, however, fully-featured APIs are available in R, Python, Scala, and also through a web interface. The focus of this hands-on workshop will be the “h2o” Python module.

All of H2O’s algorithm implementations are distributed, which allows the software to scale to large datasets that may not fit into RAM on a single machine.  H2O currently features distributed implementations of Generalized Linear Models, Gradient Boosting Machines, Random Forest and Deep Neural Nets.  In this workshop, attendees will learn how to train machine learning models, perform grid search, cross-validation, and evaluate model performance using the H2O Python API.


Erin LeDell is a Statistician and Machine Learning Scientist at She is the main author of H2O Ensemble. Before joining, she was the Principal Data Scientist at and Marvin Mobile Security (acquired by Veracode in 2012) and the founder of DataScientific, Inc.Erin received her Ph.D. in Biostatistics with a Designated Emphasis in Computational Science and Engineering from University of California, Berkeley. Her research focuses on ensemble machine learning, learning from imbalanced binary-outcome data, influence curve based variance estimation and statistical computing. She also holds a B.S. and M.A. in Mathematics.

Date: January 19th, 2016 (Tuesday)

– 6:30pm food/bev & networking
– 7:30pm talks starts promptly

You must have a confirmed RSVP and please arrive by 6:55pm the latest. Please RSVP for this meetup here on Eventbrite, as space is limited due to the capacity of the room. Make sure to RSVP now!

Venue: Venice Arts, 1702 Lincoln Blvd, Venice, CA 90291

