Please take this survey to help us learn more about how you use third party tools. Your input is greatly appreciated!

Should ETL be done through the primary cluster?

Deb_ChatignyDeb_Chatigny Administrator
edited April 2020 in General Discussion

If loading data in through a secondary cluster is less performant, should ETL be done through the primary cluster?
@skeswani , @skamat
Sizing and Configuring Vertica in Eon Mode for Different Use Cases

Answers

  • skeswaniskeswani Employee

    yes, it desirable (but not required) required to do ETL on primary subcluster.
    The Tuple Mover runs on the primary subcluster. the benefit of doing ETL there is the TM will always find the data in the depot and will save a full round trip to S3. so its efficient , performant and economical.

    However doing ETL on secondary subcluster will also work just fine. its merely not the most efficient , performant and economical way of doing it.

Leave a Comment

BoldItalicStrikethroughOrdered listUnordered list
Emoji
Image
Align leftAlign centerAlign rightToggle HTML viewToggle full pageToggle lights
Drop image/file

Can't find what you're looking for? Search the Vertica Documentation, Knowledge Base, or Blog for more information.