A growing number of data experts are confirming that 2016 is the year of action when it comes to data analytics. As a result, data professionals have moved beyond general discussions of what data analytics can do for their organizations, and want real use cases that they can learn from and copy.
Information Management spoke with Ravi Dharnikota, head of enterprise architecture at SnapLogic, about his observations at the recent Strata & Hadoop World conference in San Jose, CA. Dharnikota said attendees he spoke with were most interested in streaming data, data lakes, and in Apache Spark as an analytics platform of choice.
Information Management: What are the most common themes that you heard among conference attendees and how do those themes align with what you expected?
Ravi Dharnikota: Compared to the 2015 event, this year shifted a bit away from academic discussions of the latest Apache project and towards real use cases. This year I heard quite a bit about:
Data lake — How to do the lake right; ingestion; governance; data prep.
Spark — A huge shift towards support for technologies to run on Spark as a platform.
IM: What are the most common data challenges that attendees are facing?
RD: One of the most common challenges with data management is simply its pervasiveness. It’s everywhere in the organization. They need some way of bringing it all together in one place, making data searchable and consumable by everyone, with “guardrails” in place.
The other challenge is that the big data ecosystem is both constantly changing and can be quite noisy with overlapping messages from vendors and open source die-hards. Organizations that just want to get stuff done to drive business practices need help from end-end frameworks.