![]() TABLE 3 - With parquet + compression enabled as SnappyĢ) Is it possible to compress a non-compressed parquet table later with snappy?Īlter table is a logical operation that updates the table metadata in the metastore database. TABLE 2 : TEXT FORMAT with default compression Snappy I created three table with different senario. Please refer this link for more Create Table propertiesġ) Since snappy is not too good at compression (disk), what would be the difference on disk space for a 1 TB table when stored as parquet only and parquet with snappy compression. Put the above in location in CDH /etc/hive/1 if dont find one you can always create. Globally - i,e file is executed when you launch the hive shell Note - but you will always see the compression as NO because the compression data format is not stored in metadata of the table, the best way is to do dfs -ls -r to the table location and see the file format for compression.ģ) Also how to specify snappy compression for table level whiel creating and also at global level, even if nobody specified at table level (all table stored as parquet should be snappy compressed).ĬREATE TABLE external_parquet (c1 INT, c2 STRING) File compression can reduce bandwidth costs and provide a better experience for your users. The compression reduces the size of the file before its sent by the server. Will that be snappy compressed by default in CDH?Ĭurrently the default compression is - Snappy with Impala tables.Ģ) If not how do i identify a parquet table with snappy compression and parquet table without snappy compression?. File compression is an effective method to improve file transfer speed and increase page-load performance. 1) If we create a table (both hive and impala)and just specify stored as parquet.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |