With Forrester projecting that “100% of all large enterprises will adopt [Hadoop and related technologies such as Spark] for big data analytics within the next two years,” the chances are pretty high that your enterprise is in the midst of a decision, or has already made it: Which Hadoop vendor do I pick? Though this will change over time, “currently there is no absolute winner in the market,” Forrester pointed out, and it’s easy to get confused trying to parse differences between the different stacks.
The Hadoop vendors themselves, however, give us clues as to who they think is winning, as Ovum analyst Tony Baer highlighted. All you have to do is look at who they position themselves against in their marketing literature.
Picking apart a variety of “objective” benchmarking studies, Baer rejected these studies as “self-serving exercises that vendors typically stack in their own favor.” He’s right, but he’s also correct to suggest that, though the data in these surveys is not to be relied upon, there is “metadata” that tells much:
These are the frontrunners, according to Forrester’s methodology. Judging from Hortonworks’ marketing literature, the real frontrunner between the two pure-play Hadoop vendors must be Cloudera, with Forrester acknowledging that “Cloudera’s scope and pace of innovation is astounding,” while “Hortonworks is a rock when it comes to its promise to offer a 100% open source distribution.