Vertica File Format?

Are there any resources out there that explain how a Vertica ROS container is structured?  Something like this resource about how ORC files are structured in HDFS.

https://cwiki.apache.org/confluence/display/Hive/LanguageManual+ORC

Thanks!

Comments

  • Hi!

          Can I ask what the intention is this information?  Accessing data directly from the ROS containers is really not advisable.  Let me see if I can find you a resource.

     

    -Chris

  • Really just for understanding and curiousity.  Every system I've worked with I learned about how writes happen at the file/block level and thought it'd be interesting to know.

     

    Thanks for the help!

  • Try to find "Materialization Strategies in the Vertica Analytic Database: Lessons Learned"

    It explains a lot. The Best Vertica archictural document I ever read!

  • Wow, thanks for that!  It was a very good read and very informational.

     

    I have a follow up question about container storage.  So I know data is stored in pairs of files per column, in multiple containers. But what determines the maximum number of tuples per container?  

  • Have you read about TP strata algorithm?

  • Hi ,
    Their is no such setting for max tuples per contener.

     

    The setting is on disk storage size level , meaning when container reach certain size threshold ,  vertica will no longer try to merge to it new data. at  this point vertica will create new container.


    This concept is implemented inside their internal strata algorithm
    I hope you will find it useful


    Thanks

Leave a Comment

BoldItalicStrikethroughOrdered listUnordered list
Emoji
Image
Align leftAlign centerAlign rightToggle HTML viewToggle full pageToggle lights
Drop image/file