Twitter Poll Results: Is Apache Hadoop Running in Production?

By May 22, 2017Blog

Following the publication of our White Paper, 2017 Preview: The Year of Enterprise-wide Production Hadoop, we ran a series of Twitter polls to get a rough sense of where the market is on the following 4 questions:

  1. Does your company use Hadoop in production?
  2. What stage are most of your Hadoop deployments (lab, PoC, Pilot, Enterprise-wide production)
  3. When will you have Hadoop in Enterprise-wide produciton use?
  4. What challenges did you encounter while expanding Hadoop use?

We started with the basics, asking first:

The split between production and non-production use is in line with what we hear from our community.

As we discuss at length in the white paper, this concept of “production” Hadoop can be misleading. For instance, pilot deployments and enterprise-wide deployments are both considered “production,” but they are vastly different in terms of DataOps, as table 1 below illustrates.

Table 1: DataOps Considerations from Lab to Enterprise-wide Production

In the next poll, we learned that 72% of Hadoop deployments are stacked up in the pre-enterprise wide stages.

One of the other diagrams you’ll find in our white paper is the Enterprise Hadoop Deployment Continuum. In the version below, I have added the percentages from the Twitter poll in each stage.

Figure 1: Most Hadoop deployments are in pre and limited production.

With this established, we then asked the Twitterverse when they expect to be enterprise-wide with Hadoop? Reassuringly, the same 28% that told us they were enterprise-wide in poll #2 reiterated this in poll #3.

Less reassuring, however, is that only 9% of those that are presently pre-enterprise wide have concrete plans to get into enterprise-wide in the next 12 months, and even fewer have such plans in the next 24 months.

An eyebrow-raising 55% said that they’re not sure when they will reach enterprise-wide deployment.

And when asked about the challenges big data pros faced increasing their use of Hadoop, responses were very evenly distributed across the four big areas we hear from the ODPi community.

ODPi is here to remove risk and uncertainty from Hadoop and Big Data. We do this through comprehensive testing suites that improve predictability and through compliance programs to ensure interoperability. In other words, ODPi is here to smooth and illuminate the path to enterprise-wide production use of Hadoop for the 55% of respondents that don’t know when (if?) they will get there.

And the ODPi Special Interest Groups, or SIGs, were set up to address the widespread challenges that poll #4 surfaced.

Like all the technical work at ODPi, SIGs are wide open for all to participate in.

Join us and help drive toward solutions in these areas.