google-servers-datacenter

Google Launches Cloud Dataproc, A Managed Spark And Hadoop Big Data Service

Google Launches Cloud Dataproc, A Managed Spark And Hadoop Big Data Service

Google is adding another product in its range of big data services on the Google Cloud Platform today. The new Google Cloud Dataproc service, which is now in beta, sits between managing the Spark data processing engine or Hadoop framework directly on virtual machines and a fully managed service like Cloud Dataflow, which lets you orchestrate your data pipelines on Google’s platform.

Greg DeMichillie, director of product management for Google Cloud Platform, says Dataproc users will be able to spin up a Hadoop cluster in under 90 seconds — significantly faster than other services — and Google will only charge 1 cent per virtual CPU/hour in the cluster. That’s on top of the usual cost of running virtual machines and data storage, but as DeMichillie noted, you can add Google’s cheaper preemptible instances to your cluster to save a bit on compute costs. Billing is per-minute, with a 10-minute minimum.

Because Dataproc can spin up clusters this fast, users will be able to set up ad-hoc clusters when needed and because it is managed, Google will handle the administration for them.

Read Also:
This New Mapping Tool is a Data Lover's Dream

Because the service uses the standard Spark and Hadoop distributions (with a few tweaks), it’s compatible with virtually all existing Hadoop-based products, and users should be able to easily port their existing workloads over to Google’s new service.



Chief Analytics Officer Europe

25
Apr
2017
Chief Analytics Officer Europe

15% off with code 7WDCAO17

Read Also:
New Platform Makes Videos As Searchable As Text

Chief Analytics Officer Spring 2017

2
May
2017
Chief Analytics Officer Spring 2017

15% off with code MP15

Read Also:
This New Mapping Tool is a Data Lover's Dream

Big Data and Analytics for Healthcare Philadelphia

17
May
2017
Big Data and Analytics for Healthcare Philadelphia

$200 off with code DATA200

Read Also:
Big Data to accelerate Tour de France coverage

SMX London

23
May
2017
SMX London

10% off with code 7WDATASMX

Read Also:
Top 28 Cheat Sheets for Machine Learning, Data Science, Probability, SQL & Big Data

Data Science Congress 2017

5
Jun
2017
Data Science Congress 2017

20% off with code 7wdata_DSC2017

Read Also:
The Enterprise of Things: It's the back end that counts
Read Also:
Why the Internet of Things won't be about the 'things'

Leave a Reply

Your email address will not be published. Required fields are marked *