Error while writing to vertica from spark using Vertica spark connector

Exception in thread "main" java.lang.Exception: S2V: FATAL ERROR for job S2V_job1251056748245412653. Job status information is available in the Vertica table wcc.S2V_JOB_STATUS_USER_SAMPLE. Unable to create/insert into target table wcc.T_SAMPLE_TABLE with SaveMode: Append. ERROR MESSAGE: ERROR: java.sql.SQLException: [Vertica]VJDBC INTERNAL: VIAssert((size_t)pio[i].attnum <= loadStep->queryCols.size()) failed
[Vertica][VJDBC]Detail: /scratch_a/release/svrtar14870/vbuild/vertica/Optimizer/Path/BulkLoad.cpp: 1710
at com.vertica.spark.s2v.S2V.do2Stage(S2V.scala:339)
at com.vertica.spark.datasource.DefaultSource.createRelation(VerticaSource.scala:88)
at org.apache.spark.sql.execution.command.ExecutedCommandExec.sideEffectResult$lzycompute(commands.scala:70)
at org.apache.spark.sql.execution.command.ExecutedCommandExec.sideEffectResult(commands.scala:68)
at org.apache.spark.sql.execution.command.ExecutedCommandExec.doExecute(commands.scala:86)
at org.apache.spark.sql.execution.SparkPlan$$anonfun$execute$1.apply(SparkPlan.scala:131)
at org.apache.spark.sql.execution.SparkPlan$$anonfun$execute$1.apply(SparkPlan.scala:127)
at org.apache.spark.sql.execution.SparkPlan$$anonfun$executeQuery$1.apply(SparkPlan.scala:155)
at org.apache.spark.rdd.RDDOperationScope$.withScope(RDDOperationScope.scala:151)
at org.apache.spark.sql.execution.SparkPlan.executeQuery(SparkPlan.scala:152)
at org.apache.spark.sql.execution.SparkPlan.execute(SparkPlan.scala:127)
at org.apache.spark.sql.execution.QueryExecution.toRdd$lzycompute(QueryExecution.scala:80)
at org.apache.spark.sql.execution.QueryExecution.toRdd(QueryExecution.scala:80)
at org.apache.spark.sql.DataFrameWriter$$anonfun$runCommand$1.apply(DataFrameWriter.scala:668)
at org.apache.spark.sql.DataFrameWriter$$anonfun$runCommand$1.apply(DataFrameWriter.scala:668)
at org.apache.spark.sql.execution.SQLExecution$$anonfun$withNewExecutionId$1.apply(SQLExecution.scala:78)
at org.apache.spark.sql.execution.SQLExecution$.withSQLConfPropagated(SQLExecution.scala:125)
at org.apache.spark.sql.execution.SQLExecution$.withNewExecutionId(SQLExecution.scala:73)
at org.apache.spark.sql.DataFrameWriter.runCommand(DataFrameWriter.scala:668)
at org.apache.spark.sql.DataFrameWriter.saveToV1Source(DataFrameWriter.scala:276)
at actualLoad.gspTranLoad$.main(gspTranLoad.scala:37)
at actualLoad.gspTranLoad.main(gspTranLoad.scala)


  • Can some one please help me? I tried to search this issue, but no single answer

  • SruthiASruthiA Employee

    It looks like a bug. please open a support case and share the logs.

  • That is a very odd error. Can you provide the Spark code you are running to get this error? what is the structure of the DF? what is the structure of your target table in Vertica? what parameters are you using? What version of the connector and Spark and Vertica and JDBC driver you are using?

  • Hi, Thank you very much for the response. Please see the details below.

    This is the command I am using for the load,

    val opts: Map[String, String] = Map(
    "table" -> "T_SAMPLE_TABLE",
    "dbschema" -> "wcc",
    "db" -> "dbname",
    "user" -> "user",
    "password" -> "password",
    "host" -> "",
    "hdfs_url" -> "hdfs://",
    "web_hdfs_url" -> "webhdfs://hdfs://"

  • Kindly note that, I am selecting the data from HIVE table in the above mentioned DF structure

  • JAR Version is - vertica-9.0.1_spark2.1_scala2.11.jar

  • SruthiASruthiA Employee

    what is your vertica version?

  • vertica version is 9.1.1-5

