What can we learn from 750 billion GitHub events and 42 TB of code
"Data gives us insights into how people build software, and the activities of open source communities on GitHub represent one of the richest datasets ever created of people working together at scale." --GitHub Universe 2016 We are going to analyze - live on stage - 5 years of GitHub metadata and 42 TB code stored in it to answer questions like: - How is this run - Design decisions based on actual usage of APIs - The most effective phrasing to request changes - Effects of social media - Beyond project stars - Community health - Static code analysis - Tabs or spaces?
In 2011 Felipe Hoffa moved from Chile to San Francisco to join Google as a Software Engineer. Since 2013 he's been a Developer Advocate on big data - to inspire developers around the world to leverage the Google Cloud Platform tools to analyze and understand their data in ways they could never before. You can find him in several YouTube videos, blog posts, and conferences around the world.