Spark-sql cannot read the hive table in parquet format

Others 2020-09-21 12:28:31 views: null

When reading and writing Parquet tables to Hive metastore, Spark SQL will use Spark SQL's own Parquet SerDe (SerDe: the abbreviation of Serialize/Deserilize, which is used for serialization and deserialization) instead of Hive's SerDe. The SerDe that comes with Spark SQL has better performance. The optimized configuration parameter is spark.sql.hive.convertMetastoreParquet, which is enabled by default.

So sometimes it happens that the serialization method that comes with spark cannot parse the parquet data in hive, and the data cannot be read. In this case, you can set this parameter to false.

SET spark.sql.hive.convertMetastoreParquet = false;

Guess you like

Origin blog.csdn.net/x950913/article/details/106211587

Spark-sql cannot read the hive table in parquet format

Executing the TPC-DS benchmark on Hive/Spark (PARQUET format)

HIVE storage format ORC, PARQUET Comparative

Json object to Parquet format using Java without converting to AVRO(Without using Spark, Hive, Pig,Impala)

Spark sql parquet format abnormal modification Jar version parquet-hadoop-bundle-1.6.0

[Original] Uncle Experience Sharing (65) spark not read hive table

Spark can not read table of data hive 3.x

Spark SQL to read the data in the library can not be found in the table Hive Cluster mode

Several storage format table Hive

Spark-sql ne peut pas lire la table de la ruche au format parquet

SparkSql connects to hive, but the Hive database or Hive table cannot be found

Parquet columnar storage format

Read-write query performance test of several common compression formats (ORC, Parquet, Sequencefile, RCfile, Avro) of Hive

Spark writes to hive table

sparkf: spark-sql query engine replacement hive

Should I choose ORC or Parquet for Hive data warehouse table building, LZO or Snappy for compression?

The field name of the spark-sql subquery cannot be resolved in the parent query

Spark creates hive table error ROW FORMAT DELIMITED is only compatible with 'textfile', not 'orc'

Record kettle cannot connect to hive table exception

Unzip the parquet format file to text

Spark Tutorial - (3) prepared by spark-sql program read timings report HBase

Troubleshooting: spark access hive library, table error org.apache.spark.sql.AnalysisException: Table or view not found

[Sqoop] Import data into orc format hive specified partition table

Hive data warehouse table data storage format selection method

There is table data in the library under hive, but the solution cannot be deleted.

Read parquet data from ByteArrayOutputStream instead of file

Parquet, a new generation of columnar storage format

Spark reads the hive table and gets the partition fields

SQL server cannot alter table

Recommended

Ranking

#2019110700005

What materials and procedures are required for patent transfer

What is the blockchain Ethereum triplet state root transaction root receipt root

Front-end study notes 04 --- About the insertion of html pictures and videos

Documents required for the filing of WeChat Mini Programs in special industries, the filing process of WeChat Mini Programs in special industries, how to file WeChat Mini Programs in special industries

2017 Qingdao-site tournament I The Squared Mosquito Coil

[BZOJ3165][HEOI2013]Segment (line segment tree without marking)

Kettle series: KettleEasyExpand, an open source Kettle universal plugin by Ma Jinju

The latest tutorial on making framework for iOS

DAX Section 6: Statistical Functions

Daily

2024-05-14(9)

2024-05-13(8)

2024-05-12(28)

2024-05-11(32)

2024-05-10(34)

2024-05-09(32)

2024-05-08(18)

2024-05-07(34)

2024-05-06(6)

2024-05-05(0)