Hadoop

Cloudera CDH on the Bigstep Metal Cloud

CDH provides a single-tenant and high-performance Hadoop environment which can be deployed and scaled at the click of a button. CDH is the only Hadoop solution to offer unified batch processing, interactive SQL, interactive search, and role-based access controls. CDH has been downloaded by more enterprises than all other similar distributions combined.

Test Cloudera CDH on the Bigstep Metal Cloud

MapR on the Bigstep Metal Cloud

MapR on the Bigstep Metal Cloud is the most extensive high-availability (HA) Hadoop distribution on the market. Its unique architecture advantage makes it not only the most reliable solution for big data, but also the world’s fastest solution to date.

Test MapR on the Bigstep Metal Cloud

Hadoop Then & Now: Doug Cutting’s Take on the Past, the Present, and the Future of This Game-Changing Technology

Not only is Doug Cutting the brains behind the most successful big data platform, Hadoop, he is also one of the reasons why the open source community was finally able to push its way into the enterprise, alongside the proprietary software giants who dominated those arenas for decades. Now on board with Cloudera, Cutting’s heart is still well rooted in his brainchild Hadoop. Named for a yellow toy elephant that lives in his sock drawer, the name Hadoop was actually created by his son. Cutting wanted a name that didn’t mean anything, so that the product wouldn’t be limited by its name as it evolved. He actually held onto the name idea for years before the opportunity presented itself. Aside from the unique (and undoubtedly cute) name, what else does Cutting have to say about his open source prodigy? Continue Reading

3 Predictions for Hadoop & Big Data Between Now and 2021

The past few years have brought some growing pains for big data and Hadoop. They had to fight the reputation that they were just buzzwords and had nothing really new to offer the business world. Then they had to overcome the worries over security, which most new technologies have to face in today’s cyber environment (ahem, cloud, mobile, and social). Next, they had to become a bit more user-friendly. But big data and Hadoop have now proven their worth and hurdled those obstacles — what’s next? Here’s a peek at the most prominent business technologies as we bound toward 2021. Continue Reading

Why Spark Isn’t Going to Dethrone Hadoop Anytime Soon

Once upon a time, Yahoo! needed a way to scale out storage, along with the ability to parallelize tasks, without emptying the king’s coffers. The king’s men ended up developing HDFS and MapReduce. This was the beginning of Hadoop, and for quite some time, it appeared as if Hadoop would be THE platform for managing big data. Vendors came to pay homage to the king. Open source projects ran wild. The future was good. Continue Reading

5 Incredible Ways to Use Your Hadoop-Based Data Lake

When a new concept comes along, the first thing it has to prove to the marketplace is why it is a valuable thing and what it has to offer that previous ideas can’t. Such is the case with both Hadoop and the data lake. Neither is easy to master, and both come with considerable investments of time and expense. What does your organization stand to gain for these investments? What can you actually get done with a data lake built on Hadoop? Continue Reading

Big Data’s ‘Elephant in the Room’: The Issue Nobody Wants to Talk About

It’s been a hushed murmur in LinkedIn discussions and blog posts. It’s a silent scream in many businesses. But until now, it hasn’t been spoken aloud. Intel changed all that recently by coming out and publicly saying, “This is the dirty little secret about big data: No one actually knows what to do with it,” stated Jason Waxman, an Intel vice president and general manager of the Intel cloud platforms team, “They think they know what to do with it, and they know they have to collect it, because you have to have a big data strategy. But deriving the insights from big data is a little harder to do,” he said. Continue Reading