Hadoop Primer

Hadoop - A Primer

by Steve Staso
July, 2008

Hadoop is a distributed computing platform written in Java. It incorporates features similar to those of the
Google File System and of MapReduce to process vast amounts of data

"Hadoop is a Free Java software framework that supports data intensive distributed applications running on large clusters of commodity computers. It enables applications to easily scale out to thousands of nodes and petabytes of data" (Wikipedia)

  • What platform does Hadoop run on?
  • Java 1.5.x or higher, preferably from Sun
  • Linux
  • Windows for development
  • Solaris, but not documented - yet

Labels

storage storage Delete
cluster cluster Delete
web web Delete
hpc hpc Delete
other other Delete
Enter labels to add to this page:
Please wait 
Looking for a label? Just start typing.

Sign up or Log in to add a comment or watch this page.


The individuals who post here are part of the extended Sun Microsystems community and they might not be employed or in any way formally affiliated with Sun Microsystems. The opinions expressed here are their own, are not necessarily reviewed in advance by anyone but the individual authors, and neither Sun nor any other party necessarily agrees with them.

© 2010, Oracle Corporation and/or its affiliates
Powered by Atlassian Confluence
Oracle Social Media Participation Policy Privacy Policy Terms of Use Trademarks Site Map Employment Investor Relations Contact