Personal Information
Organization / Workplace
San Francisco Bay Area United States
Occupation
Principal Staff Software Engineer at LinkedIn
Industry
Technology / Software / Internet
About
I am a technical lead in the data infrastructure team at LinkedIn. I like solving large scale challenges in distributed data systems. I've incubated and launched several key data infrastructure projects at LinkedIn, some of which are open source: Apache Helix, Espresso and Databus.
I'm currently involved in building some new projects aimed at simplifying the big data analytics space: Cubert, Gobblin, Pinot and WhereHows.
Tags
big data
hadoop
strata
metadata
governance
data science
kafka
analytics
etl
privacy
apache
streaming
gobblin
linkedin
business intelligence
reporting
"big data" "data warehouse" pinot gobblin
databus linkedin socc cdc
See more
- Presentations
- Documents
- Infographics