Database Space
Hi,
as in vertica documentation , it has mentioned -
HP Vertica excludes,
will HP Vertica excludes this new projection?
as in vertica documentation , it has mentioned -
HP Vertica excludes,
- Multiple projections (underlying physical copies) of data from a logical database entity (table). Data appearing in multiple projections of the same table is counted only once.
will HP Vertica excludes this new projection?
0
Comments
When you are talking about the space used by the projection created in you database, is good to know they will occupy (extra - beside the actual super projection)space.
Here is a demo example:
Now when talking about the Raw Vertica License - the data once dumped into Vertica it is calculated(to the license total) once. Once inside the database you can create as many projecitons(buddy of segmented) as you.
Also the compression(benefits/gains) is not applied to the raw license data calculation.
This one is fine.
But i am talking about something like,
create table table1(x varchar primary key not null,y varchar).
create table table2(x varchar primary key not null,y varchar)
create projection table1_proj_1 as select x from table1;
create projection table1_proj_2 as select y from table1;
i know HP Vertica excludes, these projections as well as buddy projections while auditing database size.
but if i do
create projection proj1_2 as select a.x,b.y from table1 a join table2 b on (a.x=b.x);
will it be incliuded as raw data size ?
Thanks,
The projection proj1_2 size will not be counted in raw data size.
As in Vertica Raw data size means whatever data you have loaded/dumped into vertica.
Creating a projections manually will not increase the raw data size, because it is created internally by you and not loaded/dumped, but it will increase the compressed data size of database.
The used_bytes column in the projection_storage table shows the compressed data size of the projections.
Hope this helps
No License data will be used by internal copy of the data.
Even because Vertica will still be using less space internally then your loaded RAW file size. Of course this depends of the data type and the used encoding
Take this scenario :
You have a raw file of 728M(25 millions rows) holding a table called personal_data with columns(name,gender,country).
After you load this one on you db the file size will be 90% smaller because of you RLE encoding applied on you gender and country fields.
So now you have a table that has 0,2M
Se example: See RLE encoding gains (is like 99% saving in space)
as