New big data tools for machine learning spring from home of Spark and Mesos
- by 7wData
If the University of California, Berkeley's AMPLab doesn't ring bells, perhaps some of its projects will: Spark and Mesos.
AMPLab was planned all along as a five-year Computer science research initiative, and it closed down as of last November after running its course. But a new lab is opening in its wake: RISELab, another five-year project at UC Berkeley with major financial backing and the stated goal of "focus[ing] intensely for five years on systems that provide Real-time Intelligence with Secure Execution [RISE]."
AMPLab was created with "a vision of understanding how machines and people could come together to process or to address problems in data -- to use data to train rich models, to clean data, and to scale these things," said Joseph E. Gonzalez, Assistant Professor in the Department of Electrical Engineering and Computer Science at UC Berkeley.
RISELab's web page describes the group's mission as "a proactive step to move beyond big data analytics into a more immersive world," where "sensors are everywhere, AI is real, and the world is programmable." One example cited: Managing the data infrastructure around "small, autonomous aerial vehicles," whether unmanned drones or flying cars, where the data has to be processed securely at high speed.
Other big challenges Gonzalez singled out include security, but not the conventional focus on access controls. Rather, it involves concepts like "homomorphic" encryption, where encrypted data can be worked without first having to decrypt it. "How can we make predictions on data in the cloud," said Gonzalez, "without the cloud understanding what it is it's making predictions about?"
Though the lab is in its early days, a few projects have already started to emerge:
Machine learning involves two basic kinds of work: Creating models from which predictions can be derived and serving up those predictions from the models. Clipper focuses on the second task and is described as a "general-purpose low-latency prediction serving system" that takes predictions from machine learning frameworks and serves them up with minimal latency.
Clipper has three aims that ought to draw the attention of anyone working with machine learning: One, it accelerates serving up predictions from a trained model. Two, it provides an abstraction layer across multiple machine learning frameworks, so a developer only has to program to a single API.
[Social9_Share class=”s9-widget-wrapper”]
Upcoming Events
Shift Difficult Problems Left with Graph Analysis on Streaming Data
29 April 2024
12 PM ET – 1 PM ET
Read MoreCategories
You Might Be Interested In
DataOps: The New DevOps of Analytics
31 Mar, 2019According to Gartner’s report, Innovation Insight for DataOps, 27 December 2018, “DataOps is a collaborative data management practice focused on …
Tips to plan storage elements of artificial intelligence
4 Aug, 2022Organizations create more data than ever before. Newer, faster technologies and storage systems have risen to the challenge, but the …
5 Things to Think About When Considering Monetizing Data
27 Mar, 2018Trey Stephens is the Director of Audience Monetization at Acxiom, where he specializes in connecting offline with online data and …
Recent Jobs
Do You Want to Share Your Story?
Bring your insights on Data, Visualization, Innovation or Business Agility to our community. Let them learn from your experience.