Hi Team As part of a POC We are trying to load pinot table d Apache Pinot #general

Hi Team, As part of a POC, We are trying to load p...

suraj kamath

04/11/2022, 7:42 AM

Hi Team, As part of a POC, We are trying to load pinot table data into a spark dataFrame using the spark JDBC option. However when we try we are seeing the following error:

Copy code

Exception in thread "main" java.sql.SQLFeatureNotSupportedException
	at org.apache.pinot.client.base.AbstractBaseStatement.setQueryTimeout(AbstractBaseStatement.java:167)
	at org.apache.spark.sql.execution.datasources.jdbc.JDBCRDD$.resolveTable(JDBCRDD.scala:60)
	at org.apache.spark.sql.execution.datasources.jdbc.JDBCRelation$.getSchema(JDBCRelation.scala:226)
	at org.apache.spark.sql.execution.datasources.jdbc.JdbcRelationProvider.createRelation(JdbcRelationProvider.scala:35)
	at org.apache.spark.sql.execution.datasources.DataSource.resolveRelation(DataSource.scala:355)
	at org.apache.spark.sql.DataFrameReader.loadV1Source(DataFrameReader.scala:325)
	at org.apache.spark.sql.DataFrameReader.$anonfun$load$3(DataFrameReader.scala:307)
	at scala.Option.getOrElse(Option.scala:189)
	at org.apache.spark.sql.DataFrameReader.load(DataFrameReader.scala:307)
	at org.apache.spark.sql.DataFrameReader.load(DataFrameReader.scala:225)

Kartik Khare

04/11/2022, 7:44 AM

Hi, all the features of JDBC driver are not supported currently. e.g. setQueryTimeout method here. Can I understand your use case so that I can suggest some alternatives?

suraj kamath

04/11/2022, 8:02 AM

We are exploring the possibility of querying Apache pinot using spark JDBC to gather distinct column values from a table.

Kartik Khare

04/11/2022, 8:03 AM

Why not get distinct values directly from pinot?

Kartik Khare

04/11/2022, 8:03 AM

It will be much more efficient

suraj kamath

04/11/2022, 10:02 AM

Hi Kartik, Below is the flow: Pinot table --> spark(read, filter, transform) --> use column data to fetch data from Postgres

Kartik Khare

04/12/2022, 7:22 PM

got it. We haven't verified our JDBC driver compatibility with Spark yet. I will need to check what methods will need to be implemented here.

Open in Slack

Previous Next