fast way to uncompress file inside hadoop HDFS


If I ignore GET | UNZIP | PUT method which basically reads the file from HDFS into local machine, then the easiest way is to use PIG script

a = load '/user/jiri/file.gz' using PigStorage();
store a into '/user/jiri/file' using PigStorage();

it cannot be easier than that

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

%d bloggers like this: