We're Moving!

The Vertica Forum is moving to a new OpenText Analytics Database (Vertica) Community.

Join us there to post discussion topics, learn about

product releases, share tips, access the blog, and much more.

Create My New Community Account Now


CSV vs ORC vs Parquet — Vertica Forum

CSV vs ORC vs Parquet

adriantarauadriantarau Vertica Customer

Which format in Vertica is better for loading a lot of data (>100 billion rows per day, average file size ~20MB, 50-100 columns)?

Regardless of how long it takes the client to write the file, I'm interested in knowing the fastest way to get the data.

Tagged:

Answers

  • Parquet. It's easy to use, and faster than Orc. I'm not sure I've seen comparisons between Parquet and CSV, but it's easier to load Parquet generally.

Leave a Comment

BoldItalicStrikethroughOrdered listUnordered list
Emoji
Image
Align leftAlign centerAlign rightToggle HTML viewToggle full pageToggle lights
Drop image/file