google-servers-datacenter

Google Launches Cloud Dataproc, A Managed Spark And Hadoop Big Data Service

Google Launches Cloud Dataproc, A Managed Spark And Hadoop Big Data Service

Google is adding another product in its range of big data services on the Google Cloud Platform today. The new Google Cloud Dataproc service, which is now in beta, sits between managing the Spark data processing engine or Hadoop framework directly on virtual machines and a fully managed service like Cloud Dataflow, which lets you orchestrate your data pipelines on Google’s platform.

Greg DeMichillie, director of product management for Google Cloud Platform, says Dataproc users will be able to spin up a Hadoop cluster in under 90 seconds — significantly faster than other services — and Google will only charge 1 cent per virtual CPU/hour in the cluster. That’s on top of the usual cost of running virtual machines and data storage, but as DeMichillie noted, you can add Google’s cheaper preemptible instances to your cluster to save a bit on compute costs. Billing is per-minute, with a 10-minute minimum.

Because Dataproc can spin up clusters this fast, users will be able to set up ad-hoc clusters when needed and because it is managed, Google will handle the administration for them.

Read Also:
How HR Departments Can Obtain and Use Big Data

Because the service uses the standard Spark and Hadoop distributions (with a few tweaks), it’s compatible with virtually all existing Hadoop-based products, and users should be able to easily port their existing workloads over to Google’s new service.



Enterprise Data World 2017

2
Apr
2017
Enterprise Data World 2017

$200 off with code 7WDATA

Read Also:
How can Operators Tap into the Power of Location Based Services?

Data Visualisation Summit San Francisco

19
Apr
2017
Data Visualisation Summit San Francisco

$200 off with code DATA200

Read Also:
Data & Analytics Take Center Court at US Open 2015

Chief Analytics Officer Europe

25
Apr
2017
Chief Analytics Officer Europe

15% off with code 7WDCAO17

Read Also:
Making data analytics work for you--instead of the other way around

Chief Analytics Officer Spring 2017

2
May
2017
Chief Analytics Officer Spring 2017

15% off with code MP15

Read Also:
Open Sourcing SparkADMM: a Massively-parallel Framework for Solving Big Data Problems

Big Data and Analytics for Healthcare Philadelphia

17
May
2017
Big Data and Analytics for Healthcare Philadelphia

$200 off with code DATA200

Read Also:
MapR adds in-Hadoop Document Database
Read Also:
Hitachi dubs new data-mining software 'artificial intelligence'

Leave a Reply

Your email address will not be published. Required fields are marked *