hadoop - Manually splitting and compressing input for Amazon EMR -

September 15, 2012

instead of using hadoop-lzo index lzo input file, decided split chunks, compressed lzo close 128mb (since default block size on amazon distribution[1]).

is there wrong (from cluster performance perspective) provide input split , compressed size close default hdfs block size?

Search This Blog

DIs

hadoop - Manually splitting and compressing input for Amazon EMR -

Comments

Post a Comment

Popular posts from this blog

php - cannot display multiple markers in google maps v3 from traceroute result -

c# - DetailsView in ASP.Net - How to add another column on the side/add a control in each row? -

css - Text drops down with smaller window -