We're Moving!

The Vertica Forum is moving to a new OpenText Analytics Database (Vertica) Community.

Join us there to post discussion topics, learn about

product releases, share tips, access the blog, and much more.

Create My New Community Account Now


Too many ROS containers loading data from HDFS to Vertica — Vertica Forum

Too many ROS containers loading data from HDFS to Vertica

I would like to load data from HDFS to Vertica. I'm testing with pig-connector. It works well when loading smaller size data, like 1G. But it would have "Too many ROS containers" problem when loading larger-size data, like 25G, to Vertica. The only config parameter I've changed is "pig.maxCombinedSplitSize". I modified it to reduce number of mappers. 

 

I actually had the same ROS container problem when loading data from HDFS to Vertica using Sqoop Export. 

 

Does anyone know how to fix this problem? And is there "direct" mode for pig-connector as "COPY", to load directly to ROS? Thank you!

Leave a Comment

BoldItalicStrikethroughOrdered listUnordered list
Emoji
Image
Align leftAlign centerAlign rightToggle HTML viewToggle full pageToggle lights
Drop image/file