Please take this survey to help us learn more about how you use third party tools. Your input is greatly appreciated!
Should ETL be done through the primary cluster?

If loading data in through a secondary cluster is less performant, should ETL be done through the primary cluster?
@skeswani , @skamat
Sizing and Configuring Vertica in Eon Mode for Different Use Cases
Tagged:
0
Leave a Comment
Can't find what you're looking for? Search the Vertica Documentation, Knowledge Base, or Blog for more information.
Answers
yes, it desirable (but not required) required to do ETL on primary subcluster.
The Tuple Mover runs on the primary subcluster. the benefit of doing ETL there is the TM will always find the data in the depot and will save a full round trip to S3. so its efficient , performant and economical.
However doing ETL on secondary subcluster will also work just fine. its merely not the most efficient , performant and economical way of doing it.