spark-thrift-server 报错 Wrong FS

文章目录

@[toc]
具体报错
实际原因
查看 hive 元数据
修改 spark-thrift-server 配置
修改 hive 元数据

具体报错

spark-thrift-server 执行删表语句，出现如下报错

Error: org.apache.hive.service.cli.HiveSQLException: Error running query: org.apache.spark.sql.AnalysisException: org.apache.hadoop.hive.ql.metadata.HiveException: MetaException(message:java.lang.IllegalArgumentException: Wrong FS: hdfs://RMSS02ETL:9000/user/hive/warehouse/meta_data.db/dt_segment, expected: hdfs://hadoopmaster) at org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation.org$apache$spark$sql$hive$thriftserver$SparkExecuteStatementOperation$$execute(SparkExecuteStatementOperation.scala:361) at org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$2$$anon$3.$anonfun$run$2(SparkExecuteStatementOperation.scala:263) at scala.runtime.java8.JFunction0$mcV$sp.apply(JFunction0$mcV$sp.java:23)at org.apache.spark.sql.hive.thriftserver.SparkOperation.withLocalProperties(SparkOperation.scala:78) at org.apache.spark.sql.hive.thriftserver.SparkOperation.withLocalProperties$(SparkOperation.scala:62) at org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation.withLocalProperties(SparkExecuteStatementOperation.scala:43) at org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$2$$anon$3.run(SparkExecuteStatementOperation.scala:263) at org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$2$$anon$3.run(SparkExecuteStatementOperation.scala:258) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:422) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1746) at org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation$$anon$2.run(SparkExecuteStatementOperation.scala:272) at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) at java.util.concurrent.FutureTask.run(FutureTask.java:266)at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) at java.lang.Thread.run(Thread.java:748)
Caused by: org.apache.spark.sql.AnalysisException: org.apache.hadoop.hive.ql.metadata.HiveException: MetaException(message:java.lang.IllegalArgumentException: Wrong FS: hdfs://RMSS02ETL:9000/user/hive/warehouse/meta_data.db/dt_segment, expected: hdfs://hadoopmaster) at org.apache.spark.sql.hive.HiveExternalCatalog.withClient(HiveExternalCatalog.scala:112) at org.apache.spark.sql.hive.HiveExternalCatalog.dropTable(HiveExternalCatalog.scala:517) at org.apache.spark.sql.catalyst.catalog.ExternalCatalogWithListener.dropTable(ExternalCatalogWithListener.scala:104) at org.apache.spark.sql.catalyst.catalog.SessionCatalog.dropTable(SessionCatalog.scala:778) at org.apache.spark.sql.execution.command.DropTableCommand.run(ddl.scala:248) at org.apache.spark.sql.execution.command.ExecutedCommandExec.sideEffectResult$lzycompute(commands.scala:70) at org.apache.spark.sql.execution.command.ExecutedCommandExec.sideEffectResult(commands.scala:68) at org.apache.spark.sql.execution.command.ExecutedCommandExec.executeCollect(commands.scala:79) at org.apache.spark.sql.Dataset.$anonfun$logicalPlan$1(Dataset.scala:228) at org.apache.spark.sql.Dataset.$anonfun$withAction$1(Dataset.scala:3687) at org.apache.spark.sql.execution.SQLExecution$.$anonfun$withNewExecutionId$5(SQLExecution.scala:103) at org.apache.spark.sql.execution.SQLExecution$.withSQLConfPropagated(SQLExecution.scala:163) at org.apache.spark.sql.execution.SQLExecution$.$anonfun$withNewExecutionId$1(SQLExecution.scala:90)at org.apache.spark.sql.SparkSession.withActive(SparkSession.scala:772) at org.apache.spark.sql.execution.SQLExecution$.withNewExecutionId(SQLExecution.scala:64) at org.apache.spark.sql.Dataset.withAction(Dataset.scala:3685) at org.apache.spark.sql.Dataset.<init>(Dataset.scala:228) at org.apache.spark.sql.Dataset$.$anonfun$ofRows$2(Dataset.scala:99) at org.apache.spark.sql.SparkSession.withActive(SparkSession.scala:772) at org.apache.spark.sql.Dataset$.ofRows(Dataset.scala:96) at org.apache.spark.sql.SparkSession.$anonfun$sql$1(SparkSession.scala:615) at org.apache.spark.sql.SparkSession.withActive(SparkSession.scala:772) at org.apache.spark.sql.SparkSession.sql(SparkSession.scala:610) at org.apache.spark.sql.SQLContext.sql(SQLContext.scala:650) at org.apache.spark.sql.hive.thriftserver.SparkExecuteStatementOperation.org$apache$spark$sql$hive$thriftserver$SparkExecuteStatementOperation$$execute(SparkExecuteStatementOperation.scala:325) ... 16 more

实际原因

hadoop 使用了 ha 模式，有双 namenode，spark-thrift-server 配置的 --conf spark.sql.warehouse.dir 地址是其中一个 namenode 地址，需要修改成 nameservice 的地址
原因是 hive-metastore 配置的地址是 nameservice 地址，hive 元数据有问题，所以可以建库建表，可以查询，但是不能删表

查看 hive 元数据

hive.dbs - hive 库元数据信息
hive.sds - hive 表元数据信息

查看默认的 hdfs 路径

 select * from hive.dbs where NAME='default';

默认的 hdfs 地址是走的 nameservice

+-------+-----------------------+-----------------------------------------+---------+------------+------------+ 
| DB_ID | DESC                  | DB_LOCATION_URI                         | NAME    | OWNER_NAME | OWNER_TYPE |
+-------+-----------------------+-----------------------------------------+---------+------------+------------+ 
| 1     | Default Hive database | hdfs://hadoopmaster/user/hive/warehouse | default | public     | ROLE       | 
+-------+-----------------------+-----------------------------------------+---------+------------+------------+

查看错误的 hdfs 地址

select * from hive.dbs where DB_LOCATION_URI like '%RMSS02ETL%';

错误的 hdfs 地址走的是 namenode 地址

+-------+------+--------------------------------------------------------+-----------+------------+------------+ 
| DB_ID | DESC | DB_LOCATION_URI                                        | NAME      | OWNER_NAME | OWNER_TYPE |
+-------+------+--------------------------------------------------------+-----------+------------+------------+ 
|    12 |      | hdfs://RMSS02ETL:9000/user/hive/warehouse/meta_data.db | meta_data | hive       |     USER   |
+-------+------+--------------------------------------------------------+-----------+------------+------------+

查看 hive 表元数据数据

select * from hive.sds where LOCATION like '%RMSS02ETL%' \G;

LOCATION 处的地址也是 namenode 的地址

                    SD_ID: 3768CD_ID: 378INPUT_FORMAT: org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormatIS_COMPRESSED:
IS_STOREDASSUBDIRECTORIES:LOCATION: hdfs://RMSS02ETL:9000/user/hive/warehouse/meta_data.db/dt_segmentNUM_BUCKETS: -1OUTPUT_FORMAT: org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormatSERDE_ID: 3768

修改 spark-thrift-server 配置

--conf spark.sql.warehouse.dir 参数修改成 nameservice 的地址，重启 spark-thrift-server 使配置生效

修改 hive 元数据

修改 hive 库元数据

update hive.dbs set DB_LOCATION_URI=REPLACE(DB_LOCATION_URI,'RMSS02ETL:9000','hadoopmaster');

修改 hive 表元数据

update hive.sds set LOCATION=REPLACE(LOCATION,'RMSS02ETL:9000','hadoopmaster');

最后重新尝试删表，可以成功

本文来自互联网用户投稿，该文观点仅代表作者本人，不代表本站立场。本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如若转载，请注明出处：http://www.mzph.cn/news/237859.shtml

如若内容造成侵权/违法违规/事实不符，请联系多彩编程网进行投诉反馈email:809451989@qq.com，一经查实，立即删除！

spark-thrift-server 报错 Wrong FS

文章目录

@[toc]
具体报错
实际原因
查看 hive 元数据
修改 spark-thrift-server 配置
修改 hive 元数据

文章目录

具体报错

实际原因

查看 hive 元数据

修改 spark-thrift-server 配置

修改 hive 元数据

相关文章

kibana-7.15.2 一分钟下载、安装、部署 linux

HTML美化网页

面试题：JVM 对锁都进行了哪些优化？

易点易动设备管理系统：解决企业设备管理难题的利器

C语言数组与指针的关系，使用指针访问数组元素方法

[vue]Echart使用手册

Pycharm解释器的配置： System Intgerpreter 、Pipenv Environment、Virtualenv Environment

程序员福利：好用的第三方api接口

计算机网络个人小结

Centos9(Stream）配置Let‘s Encrypt （免费https证书）

Ubuntu搭建Nodejs服务器

基于Java SSM框架实现人事员工考勤签到请假管理系统项目【项目源码+论文说明】

css图片属性，图片自适应

制作系统安装盘教程——烧录Windows原版镜像

【JavaWeb学习笔记】13 - JSP浏览器渲染技术

【设计模式-2.5】创建型——建造者模式

Spring 5.x较上一版本的主要特性

LLaMA-Factory如何对Tokenization步骤提速

c/c++实现隐藏密码

Lua脚本在Redis中的高效应用

spark-thrift-server 报错 Wrong FS

文章目录 @[toc]具体报错实际原因查看 hive 元数据修改 spark-thrift-server 配置修改 hive 元数据

文章目录

具体报错

实际原因

查看 hive 元数据

修改 spark-thrift-server 配置

修改 hive 元数据

相关文章

文章目录

@[toc]
具体报错
实际原因
查看 hive 元数据
修改 spark-thrift-server 配置
修改 hive 元数据