Marge two Vertica clusters
Hi all,
We have two environments of Vertica 7.1.1-10, 10 node each, database of 40TB, K-safety 1.
One as the production and the other is DR.
We want to close the DR site and merge the two clusters to one 20 nodes production cluster.
Several questions:
1. Do I need to uninstall Vertica from the DR site before I'm adding the nodes to the production site? or I need only to drop the existing database on the DR site?
2. How it's work with the downtime when I'm adding the nodes? There will be downtime? Can I still work with the production environment while I'm adding the nodes (at the re-balancing time)?
3. Can I add 10 nodes together? or I need to add them in batches, 5 nodes each time for example?
4. What is the overhead on the system?
5. How can I determent the re-balancing time?
If you have any more insights and things that I need to check or to give more attention... I'd love to hear
Thanks
Chen
Comments
So you wana decomision on Vertica Cluster and reuse the hosts in ass new nodes for the left cluster ?
Before you add any new nodes you need to
2. Purge your data.(less work to be done during the rebalance).
--look for delete vectors
3. Drop any unused projections(less work to be done during the rebalance).
- i use this sql to look for such
4. Optimize your REFRESH resource pool for the rebalance task.
- see if you can increase the PLANNEDCONCURRENCY value for this pool, will improve prerformance.
and after rebalance you can put it back to default.
To add a new node to an existing Vertica Cluster :
1- Make sure you have a host with the same operation system installed as the running Vertica O.S.
2- Prepare the box to receive Vertica software and create the same O.S. layout(partitions).
3- Setup the sshd password-less for root user between the two hosts(running and the host that will be added)
3- Choose the way you want to add the new host to the cluster
A – Using Vertica Management Console.
B – Using the update_vertica utility.
-- run this from one of the nodes that are part of the receveing cluster (as root user)
Once done - list avilable nodes in the cluster
- his command will list all hosts and not all nodes.
This will show you all nodes the run the db
Add the hosts to your db
Rebalance the data to apply to your required K-safety
1. Do I need to uninstall Vertica from the DR site before I'm adding the nodes to the production site? or I need only to drop the existing database on the DR site?
- i would do a clean install
2. How it's work with the downtime when I'm adding the nodes? There will be downtime? Can I still work with the production environment while I'm adding the nodes (at the re-balancing time)?
- no downtime is required(some lock might happen on the tables), it`s just resource intensive so ideal is to do it when you less have access to the db.
3. Can I add 10 nodes together? or I need to add them in batches, 5 nodes each time for example?
- i sudgesst doing in two nodes at a time. Or actually i don`t think i know the right answer, never added more then 2 at a time in the past so i don`t actually know !
4. What is the overhead on the system?
- it is resoirce intensive as you basecly replicate the objects across the cluster.(network perf will impact the times if network is slow)
vioperf: Measures the speed and consistency of your hard drives.
vnetperf: Measures the latency and throughput of your network between nodes.
5. How can I determent the re-balancing time?
What do you mean ?
How long is going to take ? (it all depends on size/number of projecitons/Disk speed/Network latency)
or
When should i do it ? (after you add or remove a node)
hope this small walk thru helped you !
Does anyone know if I can stop the "add nodes" command while it running?
for example, if it takes to much time and it start to overhead on the system and I want to stop the process... can it be done?
thanks