Question About COPY command in 3 nodes Cluster vs 1 standalone node
I have 3 nodes cluster and a standalone node too and want to test load data with COPY command in each of them ...
I have 10 files with 1 million rows and put them into first node of cluster and also in standalone node .
COPY command in 3 node cluster and standalone for same file, takes 50 second ...
so I don't see any optimization in 3 node Cluster
what is the problem ?
0
Comments
It seems that you need to parallelize loading for 3 nodes cluster explicitly.
Please go through below page
https://my.vertica.com/kb/Data-Loading-in-Vertica-Using-COPY/Content/BestPractices/Data-Loading-in-Vertica-Using-COPY.htm
and check
Troubleshooting Load Scenarios ->Loading Large Files ->Use Case: Large files require a long time to load
You can also try using parallel load streams as stated below
https://my.vertica.com/docs/7.1.x/HTML/Content/Authoring/AdministratorsGuide/BulkLoadCOPY/UsingParallelLoadStreams.htm
Let us know if that helps.