Hive to vertica data export with Unix named pipe
Hi,
Can someone please help me that how to do large and fast export to Hive to Vetica without any Hadoop connector?
Currently, i am exporting the same thing via Unix Namedpipe but performance is not that good.
almost 5 parallel thread to load the data into Vertica and time is approx 230 min for 1.6 billion recordsets?
can someone please help me to improve this performance and if we can optimize this export?
Thank You
kosmiktechnologies.com
0
Comments
Why not just read directly from HDFS?
See:
https://my.vertica.com/docs/9.0.x/HTML/index.htm#Authoring/HadoopIntegrationGuide/libhdfs/ReadingDirectlyFromHDFS.htm
Hi!
Can you specify where is a bottleneck(on data reading from Hive or on data writing to Vertica)? Is it IO problem or Network problem? Is it CPU bottleneck(some custom parser)?
It doesn't say a thing
PS: too many questions.
You need to specify where is a problem(may be a problem with hardware and your disk can't give better IO).