LinkedIn said it will open source an internal application called WhereHows, which is a data mining portal for enterprise information.
Technically, LinkedIn calls WhereHows "a data discovery lineage portal." From a business perspective, WhereHows is designed to surface data from multiple stores via metadata.
According to LinkedIn, WhereHows has captured the status of 50,000 datasets, 14,000 comments and 35 million job executions good for a storage footprint topping 15 petabytes.
In a blog post, LinkedIn outlined the reasons it built WhereHows--its big data ecosystem was too diversified with multiple applications designed to do one specific job. As a result, LinkedIn has everything from Informatica to Spark to Hive to Oracle to Hadoop to Teradata as well as a bevy of schedulers.;
Chief Analytics Officer Europe
15% off with code 7WDCAO17
Chief Analytics Officer Spring 2017
15% off with code MP15
Big Data and Analytics for Healthcare Philadelphia
$200 off with code DATA200
10% off with code 7WDATASMX
Data Science Congress 2017
20% off with code 7wdata_DSC2017