(Posting the slides and video recording from this event is long overdue – our apologies for the delay.)
IRIS.TV is a video recommendation engine, and in their excellent talk Tom & Joel shared with us their approach to personalization and how they use machine learning to achieve their goals. They also shared the architecture of their systems, including how data is flowing between the various parts (i.e. the ETL pipeline). One thing that popped up as a question and I found interesting is that the usage of R and Python in their company is split 50-50. In fact, while Tom is using mainly R, Joel is using mainly Python. While this is perhaps unusual at a relatively small startup, our survey on software tools used for data science revealed that most data scientists in LA (at least those attending the meetups) are using R or Python, with a roughly equal split/small advantage for R. It just happens that for obvious reasons (such as network effects) that one tool dominates in most companies.
The talk touched on several other issues, and you can view the slides here:
I’d like to thank Cross Campus for hosting and Factual for sponsoring and providing pizza. (I would also like to personally thank Aaron Crow for not only being a great software engineer and organizer of several meetups, but also for making sure we have great pizza).
Finally, below are a few pictures from this meetup (Thanks, Jeff Weakley!) and more meetup pics are available in this flickr album. If you are interested, there are even more pictures from our previous meetups available in our flickr account here.