vdf.describe() really slow

BHIFPBHIFP Vertica Customer



SUMMARIZE_NUMCOL in the 3rd SQL is really slow, finishing in more than 10mins. Is there a way to speed it up?

Answers

  • Well, yea, you're running a statistical summary across 17B rows. I'd expect that to take at least a little while...

    It's meant as an exploratory function - not something you'd actually bake into a production process. Also, if you don't need all those statistics, you could just call specific things like stddev() or whatever, if that's what you're actually interested in.

Leave a Comment

BoldItalicStrikethroughOrdered listUnordered list
Emoji
Image
Align leftAlign centerAlign rightToggle HTML viewToggle full pageToggle lights
Drop image/file