GET_COMPLIANCE_STATUS ==> Raw Data Size is displaying more than disk usage
PavanVejju
Vertica Customer ✭
Command: SELECT GET_COMPLIANCE_STATUS();
Current raw data size is 2TB
Disk usage is 1.5 TB
Why it is showing more than disk usage
Tagged:
0
Answers
License usage represents actual data size stored by Vertica.
Disk usage shows compressed data size recorded on disk. Disk usage is less than license due to compression and encoding.
SELECT * FROM DELETE_VECTORS;
For above command, it shows few records.
so if we execute purge(), will it minimise actual data size?
or
any alternative to minimise actual data size
Will projections storage is also consider as license usage
Removing deleted data with PURGE() will reduce data size and license usage.
Projection storage does not count as license usage, as it is duplicate data.
To minimize actual db size on disk, check you current status first.
Then drop unnecessary projections.
Try to choose better encoding and sort order on Correlated Columns for projections with weak compression.
Better encoding can improve your compression ratio and might effect queries performance, so it is advised first to check its size and performance on data sample.