We're Moving!

The Vertica Forum is moving to a new OpenText Analytics Database (Vertica) Community.

Join us there to post discussion topics, learn about

product releases, share tips, access the blog, and much more.

Create My New Community Account Now


vdf.describe() really slow — Vertica Forum

vdf.describe() really slow

BHIFPBHIFP Vertica Customer



SUMMARIZE_NUMCOL in the 3rd SQL is really slow, finishing in more than 10mins. Is there a way to speed it up?

Answers

  • Well, yea, you're running a statistical summary across 17B rows. I'd expect that to take at least a little while...

    It's meant as an exploratory function - not something you'd actually bake into a production process. Also, if you don't need all those statistics, you could just call specific things like stddev() or whatever, if that's what you're actually interested in.

Leave a Comment

BoldItalicStrikethroughOrdered listUnordered list
Emoji
Image
Align leftAlign centerAlign rightToggle HTML viewToggle full pageToggle lights
Drop image/file