If loading data in through a secondary cluster is less performant, should ETL be done through the primary cluster?@skeswani , @skamat Sizing and Configuring Vertica in Eon Mode for Different Use Cases
yes, it desirable (but not required) required to do ETL on primary subcluster.
The Tuple Mover runs on the primary subcluster. the benefit of doing ETL there is the TM will always find the data in the depot and will save a full round trip to S3. so its efficient , performant and economical.
However doing ETL on secondary subcluster will also work just fine. its merely not the most efficient , performant and economical way of doing it.
Can't find what you're looking for? Search the Vertica Documentation, Knowledge Base, or Blog for more information.