| By Maureen O'Gara | Article Rating: |
|
| April 13, 2009 05:45 AM EDT | Reads: |
9,621 |
Cloudera, the start-up that going to commercialize Hadoop, the Google-inspired, Apache-fostered open source software that powers the data processing engines behind some of the biggest and most popular web sites - sites like Yahoo, Facebook, Amazon and Google itself - even Microsoft - pulled in a $5 million first round led by Accel Partners.
Ah, but the private investors going in with Accel constitute a veritable cavalcade of industry glitterati that you practically have to put sunglasses on just to read the list.
It includes VMware co-founder and former CEO Diane Greene and her husband, VMware's other co-founder, Mendel Rosenblum, Flickr co-founder Caterina Fake, Microsoft's online chief and former Yahoo EVP Qi Lu, former MySQL CEO Marten Mickos, LinkedIn president Jeff Weiner, Loudcloud founder In Sik Rhee, Illustra CEO Dick Williams, Facebook CFO Gideon Yu, Palm SVP Mike Abbott and early Google employee David desJardins.
Goodness me, what validation!
The operation got started last October and just announced the general availability of its free Distribution for Hadoop, a pre-packaged RPM bundle for Red Hat Linux systems or an Amazon EC2 image licensed under an Apache 2 license.
The web site widgetry, written in Java, stores and processes big data, petabytes of information often distributed across thousands of servers, and Cloudera means to bring its data analysis skills to enterprise data center by making it easier to install, configure and manage, according to co-founder Christophe Bisciglia, the former manager of Google's Hadoop cluster.
Cloudera's other founders include CEO Mike Olsen, the guy who sold Sleepycat Software to Oracle, Amr Awadallah, Yahoo's former VP of engineering, and Jeff Hammerbacher, creator of the Hive project and conveniently enough entrepreneur-in-residence at Accel Partners.
Hadoop's creators Doug Cutting and Mike Cafarella, who reverse engineered the open source project from a Google research paper, are advisors.
Cloudera's distribution, based on the stable Hadoop 0.18.3, includes the Hadoop Distributed File System (HDFS), which runs on commodity hardware and supports tens of millions of files in a single instance; the Google-conceived MapReduce, which divides applications into small blocks of work for automatic parallelization and execution on large clusters; Hive, the data warehousing infrastructure built on top of Hadoop; and Pig, the platform for analyzing large data sets in Hadoop using the high-level language for expressing data analysis programs called logically enough PigLatin.
Cloudera has launched a portal at http://my.cloudera.com where people can use a free web-based configuration tool to create custom packages. Settings for the clusters can be saved on the portal to enable automatic updates.
It's also got a free pre-configured VMware image available for evaluation and use in equally free online training (http://www.clodera.com/hadopp-training). It'll run on Linux, Mac or Windows desktops.
The company expects to make money on support and consulting. It plans on chasing biotechs, the oil and gas cartel, insurance companies and retail establishments.
Hadoop, by the way, is named for a stuffed elephant that belonged to Cutting's son.
Published April 13, 2009 Reads 9,621
Copyright © 2009 SYS-CON Media, Inc. — All Rights Reserved.
Syndicated stories and blog feeds, all rights reserved by the author.
More Stories By Maureen O'Gara
Maureen O'Gara the most read technology reporter for the past 20 years, is the Cloud Computing and Virtualization News Desk editor of SYS-CON Media. She is the publisher of famous "Billygrams" and the editor-in-chief of "Client/Server News" for more than a decade. One of the most respected technology reporters in the business, Maureen can be reached by email at maureen(at)sys-con.com or paperboy(at)g2news.com, and by phone at 516 759-7025. Twitter: @MaureenOGara
- Cloud Expo New York Speaker Profile: Jill T. Singer – Federal CIO Emeritus
- Cloud Expo New York: API Security, Does My Business Need an OAuth Server?
- Session Topics: 12th Cloud Expo / Cloud Expo New York
- Cloud Expo New York: Aligning Your Cloud Security with the Business
- Cloud Expo NY: Best Practices for Architecting Your Cloud Infrastructure
- The Rise of the Thin Client
- Patterns to Bring Enterprise and Social Identity to the Cloud
- NIST to Sponsor FFRDC Widespread Adoption of Integrated CyberSecurity
- Lunch Keynote at Cloud Expo New York | CIOs Are Transforming the Cloud
- Logicworks to Exhibit at Cloud Expo New York
- Cloud Expo NY: Virtualization, Compliance, and Healthcare in the Cloud
- Is Cloud Safer Than Your Traditional Datacenter?
- Cloud Expo New York Speaker Profile: Jill T. Singer – Federal CIO Emeritus
- Cloud Expo New York: API Security, Does My Business Need an OAuth Server?
- Session Topics: 12th Cloud Expo / Cloud Expo New York
- Cloud Expo New York | Danger Ahead: Why File Sync Is NOT Endpoint Backup
- Cloud Expo New York: Aligning Your Cloud Security with the Business
- Cloud Expo NY: Best Practices for Architecting Your Cloud Infrastructure
- Overview of the OpenStack Cloud
- The Rise of the Thin Client
- Cloud Expo New York: Managing Legal Risks in Cloud Computing
- Patterns to Bring Enterprise and Social Identity to the Cloud
- NIST to Sponsor FFRDC Widespread Adoption of Integrated CyberSecurity
- Lunch Keynote at Cloud Expo New York | CIOs Are Transforming the Cloud
- Effective Page Authorization In JavaServer Faces
- The Top 250 Players in the Cloud Computing Ecosystem
- Cloud Expo New York Call for Papers Now Open
- SOA Focus - Web Services Security in Java EE
- IBM Security Report Predicts Mobile/Satellite Attacks in 2005
- Industry Experts Discuss the State of Cloud Computing
- The Cloud Computing Kettle Heats Right Up
- The Top 100 Bloggers on Cloud Computing
- The Next Chapter in the Virtualization Story Begins
- Java Application Security in the Corporate World
- ColdFusion Security Best Practices
- Cloud Expo 2011 East To Attract 10,000 Delegates and 200 Exhibitors























