vdf.describe() really slow

BHIFPBHIFP Vertica Customer

SUMMARIZE_NUMCOL in the 3rd SQL is really slow, finishing in more than 10mins. Is there a way to speed it up?


  • Options

    Well, yea, you're running a statistical summary across 17B rows. I'd expect that to take at least a little while...

    It's meant as an exploratory function - not something you'd actually bake into a production process. Also, if you don't need all those statistics, you could just call specific things like stddev() or whatever, if that's what you're actually interested in.

Leave a Comment

BoldItalicStrikethroughOrdered listUnordered list
Align leftAlign centerAlign rightToggle HTML viewToggle full pageToggle lights
Drop image/file