Practical MapReduce
[Pages:18]Practical MapReduce Top ten tips
Tom White, Cloudera Hadoop User Group UK, London 14 April 2009
About me
Apache Hadoop Committer, PMC Member, Apache Member
Employed by Cloudera Writing a book on Hadoop for O'Reilly
What is MapReduce?
Another way of looking at it
Tips
1. Use the right MapReduce language
Structured Dynamic System
Pig, Hive Streaming, Dumbo
Java
2. Consider your input data "chunk" size
Hadoop dislikes lots of small files Namenode eats memory MapReduce produces too many trivial maps
CombineFileInputFormat Packs multiple files into one split Considers locality
Large unsplittable files aren't great either
But see LzoTextInputFormat
................
................
In order to avoid copyright disputes, this page is only a partial summary.
To fulfill the demand for quickly locating and searching documents.
It is intelligent file search solution for home and business.
Related download
- cbs news new york times poll iraq and the war against
- cbs news face the nation
- america s healthiest buys beauty and home products to find
- practical mapreduce
- cbs news poll
- cbs news poll the fall campaign begins september 6 8 2004
- juggling college admissions issues and once the letters
- travel and entertainment cbs you
Related searches
- definition of practical significance
- practical significance definition
- what is practical significance
- practical significance definition psychology
- practical significance vs statistical
- practical significance statistics definition
- define practical significance
- difference between statistical and practical significance
- practical significance example
- what is practical significance in statistics
- easy statistical and practical significance
- statistical vs practical significance example