Automated profiling tools to sift through the pre-made scripts for object migration
Min_Yun_Chan
Vertica Employee Employee
Which are the automated profiling tools to sift through the pre-made scripts for object migration. Can you please share the links? Is there any documentation on this?
Question from The shortest path to Vertica: Best practices for data warehouse migration & ETL
@marcothesane @Maurizio
Tagged:
0
Answers
Hi @Min_Yun_Chan -
d2l - which reads one csv file and generates a matching CREATE TABLE statement, can be found here:
https://github.com/marco-the-sane/d2l
And I have a combination of two pretty complex scripts, SQL generating SQL; they scan the "v_catalog.columns" system table to create matching profiling queries, and generate runnable SQL code out of those scans.
genrecast.sqlscans the string representation of all strings in the table, and creates a CREATE VIEW that creates a view with better high-level data types, like NUMERIC, TIMESTAMP, or, in fact , strings.genpf2ddl.sqlscans the views created in the previous step, to discernDATEs fromTIMESTAMPs,INTEGERs fromNUMERICs, etc. to deduct ideal and performant data types, creating aCREATE TABLEstatement, and anINSERT ... SELECTstatement selecting from the view from the previous step and writing into the target table.genrecast.sqlis a 99-line script;genpf2ddl.sqlis a270-line script. They are subject to changes at any time, as they are maturing along with our presales organisation's work on real assignments. So they're not yet on GitHub. On request, with a heavy caveat on its un-foolproofness and volatility, I might change my mind, though ...Good luck ...