New big data tools for machine learning spring from home of Spark and Mesos

New big data tools for machine learning spring from home of Spark and Mesos

New big data tools for machine learning spring from home of Spark and Mesos

If the University of California, Berkeley's AMPLab doesn't ring bells, perhaps some of its projects will: Spark and Mesos.

AMPLab was planned all along as a five-year computer science research initiative, and it closed down as of last November after running its course. But a new lab is opening in its wake: RISELab, another five-year project at UC Berkeley with major financial backing and the stated goal of "focus[ing] intensely for five years on systems that provide Real-time Intelligence with Secure Execution [RISE]."

AMPLab was created with "a vision of understanding how machines and people could come together to process or to address problems in data -- to use data to train rich models, to clean data, and to scale these things," said Joseph E. Gonzalez, Assistant Professor in the Department of Electrical Engineering and Computer Science at UC Berkeley.

RISELab's web page describes the group's mission as "a proactive step to move beyond big data analytics into a more immersive world," where "sensors are everywhere, AI is real, and the world is programmable." One example cited: Managing the data infrastructure around "small, autonomous aerial vehicles," whether unmanned drones or flying cars, where the data has to be processed securely at high speed.

Read Also:
Blockchain, IoT, Artificial Intelligence Poised to Shake Up Healthcare

Other big challenges Gonzalez singled out include security, but not the conventional focus on access controls. Rather, it involves concepts like "homomorphic" encryption, where encrypted data can be worked without first having to decrypt it. "How can we make predictions on data in the cloud," said Gonzalez, "without the cloud understanding what it is it's making predictions about?"

Though the lab is in its early days, a few projects have already started to emerge:

Machine learning involves two basic kinds of work: Creating models from which predictions can be derived and serving up those predictions from the models. Clipper focuses on the second task and is described as a "general-purpose low-latency prediction serving system" that takes predictions from machine learning frameworks and serves them up with minimal latency.

Clipper has three aims that ought to draw the attention of anyone working with machine learning: One, it accelerates serving up predictions from a trained model. Two, it provides an abstraction layer across multiple machine learning frameworks, so a developer only has to program to a single API.

Read Also:
Preventing sports injuries with wearable tech and data analysis


Data Innovation Summit 2017

30
Mar
2017
Data Innovation Summit 2017

30% off with code 7wData

Read Also:
Converging IoT, Cloud, and Big Data Technologies to Revolutionize the World

Big Data Innovation Summit London

30
Mar
2017
Big Data Innovation Summit London

$200 off with code DATA200

Read Also:
Converging IoT, Cloud, and Big Data Technologies to Revolutionize the World

Enterprise Data World 2017

2
Apr
2017
Enterprise Data World 2017

$200 off with code 7WDATA

Read Also:
How Predictive AI Will Change Shopping

Data Visualisation Summit San Francisco

19
Apr
2017
Data Visualisation Summit San Francisco

$200 off with code DATA200

Read Also:
Quick Guide to Data Journalism

Chief Analytics Officer Europe

25
Apr
2017
Chief Analytics Officer Europe

15% off with code 7WDCAO17

Read Also:
Salesforce Einstein deploys AI across the cloud suite

Leave a Reply

Your email address will not be published. Required fields are marked *