We're Moving!

The Vertica Forum is moving to a new OpenText Analytics Database (Vertica) Community.

Join us there to post discussion topics, learn about

product releases, share tips, access the blog, and much more.

Create My New Community Account Now


Best practice for Data Aging/ Deleting old data — Vertica Forum

Best practice for Data Aging/ Deleting old data

rajatpaliwal86rajatpaliwal86 Vertica Customer

What is the best practice to clean up old data automatically? In our design, we have defined large facts tables partitioned by a day id. Then we have scheduled a cron job that is scheduled to run once a day - to delete the old partition's data that are being aged out. However, at the same time, we also want to monitor the disk storage and drop partition accordingly, what could be the best strategy to do it?
Can we make use of DISK_STORAGE table to achieve the same? https://www.vertica.com/docs/9.2.x/HTML/Content/Authoring/SQLReferenceManual/SystemTables/MONITOR/DISK_STORAGE.htm

Answers

  • moshegmosheg Vertica Employee Administrator

    The best practice is to schedule via cron the use of DROP_PARTITIONS function to drop old partitions as you do.
    There might be a gap between the time you drop partitions for this to be reflected in DISK_STORAGE.
    For up-to-date status check of disk usage of files and directories use the Linux commands df -h
    or du -sh /file path

Leave a Comment

BoldItalicStrikethroughOrdered listUnordered list
Emoji
Image
Align leftAlign centerAlign rightToggle HTML viewToggle full pageToggle lights
Drop image/file