paimon表读优化-Read-optimized Table

目录

  • 概述
  • 实践
    • 文档
    • 测试
  • 结束

概述

paimon 版本 : 0.7

测试目标: 类似 hudi ro

实践

文档

Read-optimized Table

在这里插入图片描述

测试

0: jdbc:hive2://10.32.36.142:10009/> select  * from `trace_log_refdes_hive_ro$ro` limit 10;24/02/28 14:24:33 INFO ExecuteStatement: Execute in full collect mode
24/02/28 14:24:33 INFO V2ScanRelationPushDown: 
Output: pcbid#139, rid#140, refdes#141, bm_circuit_no#142, timestamp#143, pickupstatus#144, serial_number#145, flag#146, kitid#147, id#148, createdate#149, etl#150, opt1#151, opt2#152, opt3#153, opt4#154, opt5#155, nozzleid#156, laneno#157, componentbarcode#158, pn#159, lotcode#160, datecode#161, verdor#162, workorder#163, dt#1642024-02-28 14:24:34.138 INFO KyuubiSessionManager-exec-pool: Thread-1643 org.apache.kyuubi.operation.ExecuteStatement: Processing root's query[defb80f5-4309-449d-b9a6-606fccf3fde8]: RUNNING_STATE -> FINISHED_STATE, time taken: 0.417 seconds
+-----------------+------------------------------+---------+----------------+----------------------+---------------+--------------------+-------+--------+-----------------------------------+----------------------+------+-------+-------+-------+-------+-------+-----------+---------+-------------------+-------+----------+-----------+---------+------------+-------------+
|      pcbid      |             xxx              | yyyyff  | zzzzzxxxxxxxx  |      timestamp       | gggggggfffff  |   ddddddddddddd    | cccc  | aaaaa  |                id                 |      createdate      | etl  | opt1  | opt2  | opt3  | opt4  | opt5  | nozzleid  | laneno  | componentbarcode  |  pn   | xyzzzzx  | xzzzzzzz  | txxxxx  | zxxxxxxxx  |     dt      |
+-----------------+------------------------------+---------+----------------+----------------------+---------------+--------------------+-------+--------+-----------------------------------+----------------------+------+-------+-------+-------+-------+-------+-----------+---------+-------------------+-------+----------+-----------+---------+------------+-------------+
| E23MPM42203175  | 514S00292-11420240109000060  | J0200   | 5              | 2024-02-23 18:16:42  | 0             | DLC4084004RPQVLAG  | 0     | NXT    | 11C0928D5A26E048E063AA2C200ABEF3  | 2024-02-23 18:14:50  | N    | NULL  | NULL  | NULL  | NULL  | NULL  | NULL      | NULL    | NULL              | NULL  | NULL     | NULL      | NULL    | NULL       | 2024-02-23  |
| E23MPM42203175  | 514S00292-11420240109000060  | J0200   | 4              | 2024-02-23 18:16:42  | 0             | DLC4084004SPQVLAF  | 0     | NXT    | 11C0928D5A27E048E063AA2C200ABEF3  | 2024-02-23 18:14:50  | N    | NULL  | NULL  | NULL  | NULL  | NULL  | NULL      | NULL    | NULL              | NULL  | NULL     | NULL      | NULL    | NULL       | 2024-02-23  |
| E23MPM42203175  | 514S00292-11420240117000057  | J0200   | 7              | 2024-02-23 18:16:42  | 0             | DLC4084004PPQVLAJ  | 0     | NXT    | 11C0928D5A28E048E063AA2C200ABEF3  | 2024-02-23 18:14:50  | N    | NULL  | NULL  | NULL  | NULL  | NULL  | NULL      | NULL    | NULL              | NULL  | NULL     | NULL      | NULL    | NULL       | 2024-02-23  |
| E23MPM42203175  | 514S00292-11420240117000057  | J0200   | 9              | 2024-02-23 18:16:42  | 0             | DLC4084004MPQVLAL  | 0     | NXT    | 11C0928D5A29E048E063AA2C200ABEF3  | 2024-02-23 18:14:50  | N    | NULL  | NULL  | NULL  | NULL  | NULL  | NULL      | NULL    | NULL              | NULL  | NULL     | NULL      | NULL    | NULL       | 2024-02-23  |
| E23MPM42203175  | 514S00292-11420240117000056  | J0200   | 12             | 2024-02-23 18:16:42  | 0             | DLC4084004VPQVLAC  | 0     | NXT    | 11C0928D5A2CE048E063AA2C200ABEF3  | 2024-02-23 18:14:50  | N    | NULL  | NULL  | NULL  | NULL  | NULL  | NULL      | NULL    | NULL              | NULL  | NULL     | NULL      | NULL    | NULL       | 2024-02-23  |
| E23MPM42203176  | 514S00292-11420240117000056  | J0200   | 1              | 2024-02-23 18:16:42  | 0             | DLC4084005NPQVLAG  | 0     | NXT    | 11C0928D5A31E048E063AA2C200ABEF3  | 2024-02-23 18:14:50  | N    | NULL  | NULL  | NULL  | NULL  | NULL  | NULL      | NULL    | NULL              | NULL  | NULL     | NULL      | NULL    | NULL       | 2024-02-23  |
| E23MPM42203176  | 514S00292-11420240117000059  | J0200   | 4              | 2024-02-23 18:16:42  | 0             | DLC4084005GPQVLAN  | 0     | NXT    | 11C0928D5A34E048E063AA2C200ABEF3  | 2024-02-23 18:14:50  | N    | NULL  | NULL  | NULL  | NULL  | NULL  | NULL      | NULL    | NULL              | NULL  | NULL     | NULL      | NULL    | NULL       | 2024-02-23  |
| E23MPM42203176  | 514S00292-11420240116000073  | J0200   | 10             | 2024-02-23 18:16:42  | 0             | DLC4084005MPQVLAH  | 0     | NXT    | 11C0928D5A3AE048E063AA2C200ABEF3  | 2024-02-23 18:14:50  | N    | NULL  | NULL  | NULL  | NULL  | NULL  | NULL      | NULL    | NULL              | NULL  | NULL     | NULL      | NULL    | NULL       | 2024-02-23  |
| E23MPM42201540  | 117S0158-A420240115001804    | R0514   | 7              | 2024-02-23 18:16:42  | 0             | DLC40860DTVPQVLAW  | 0     | NXT    | 11C0928D5A59E048E063AA2C200ABEF3  | 2024-02-23 18:14:51  | N    | NULL  | NULL  | NULL  | NULL  | NULL  | NULL      | NULL    | NULL              | NULL  | NULL     | NULL      | NULL    | NULL       | 2024-02-23  |
| E23MPM42201540  | 117S0158-A420240115001804    | R0401   | 1              | 2024-02-23 18:16:42  | 0             | DLC40860DU4PQVLAJ  | 0     | NXT    | 11C0928D5A5DE048E063AA2C200ABEF3  | 2024-02-23 18:14:51  | N    | NULL  | NULL  | NULL  | NULL  | NULL  | NULL      | NULL    | NULL              | NULL  | NULL     | NULL      | NULL    | NULL       | 2024-02-23  |
+-----------------+------------------------------+---------+----------------+----------------------+---------------+--------------------+-------+--------+-----------------------------------+----------------------+------+-------+-------+-------+-------+-------+-----------+---------+-------------------+-------+----------+-----------+---------+------------+-------------+
10 rows selected (0.435 seconds)
0: jdbc:hive2://10.32.36.142:10009/> select * from `trace_log_refdes_hive_ro$ro` where id='11C0928D5A2CE048E063AA2C200ABEF3';
2024-02-28 14:25:14.794 INFO KyuubiSessionManager-exec-pool: Thread-1646 org.apache.kyuubi.operation.ExecuteStatement: Processing root's query[7cf0bdca-b375-4287-8f3b-64968d09b69c]: PENDING_STATE -> RUNNING_STATE, statement:
select * from `trace_log_refdes_hive_ro$ro` where id='11C0928D5A2CE048E063AA2C200ABEF3'24/02/28 14:25:14 INFO V2ScanRelationPushDown: 
Pushing operators to trace_log_refdes_hive_ro$ro
Pushed Filters: IsNotNull(id), EqualTo(id,11C0928D5A2CE048E063AA2C200ABEF3)
Post-Scan Filters: isnotnull(id#253),(id#253 = 11C0928D5A2CE048E063AA2C200ABEF3)24/02/28 14:25:14 INFO V2ScanRelationPushDown: 
Output: pcbid#244, rid#245, refdes#246, bm_circuit_no#247, timestamp#248, pickupstatus#249, serial_number#250, flag#251, kitid#252, id#253, createdate#254, etl#255, opt1#256, opt2#257, opt3#258, opt4#259, opt5#260, nozzleid#261, laneno#262, componentbarcode#263, pn#264, lotcode#265, datecode#266, verdor#267, workorder#268, dt#2692024-02-28 14:25:16.622 INFO KyuubiSessionManager-exec-pool: Thread-1646 org.apache.kyuubi.operation.ExecuteStatement: Processing root's query[7cf0bdca-b375-4287-8f3b-64968d09b69c]: RUNNING_STATE -> FINISHED_STATE, time taken: 1.828 seconds
+-----------------+------------------------------+---------+----------------+----------------------+---------------+--------------------+-------+--------+-----------------------------------+----------------------+------+-------+-------+-------+-------+-------+-----------+---------+-------------------+-------+----------+-----------+---------+------------+-------------+
|      pcbid      |             xxx              | yyyyff  | zzzzzxxxxxxxx  |      timestamp       | gggggggfffff  |   ddddddddddddd    | cccc  | aaaaa  |                id                 |      createdate      | etl  | opt1  | opt2  | opt3  | opt4  | opt5  | nozzleid  | laneno  | componentbarcode  |  pn   | xyzzzzx  | xzzzzzzz  | txxxxx  | zxxxxxxxx  |     dt      |
+-----------------+------------------------------+---------+----------------+----------------------+---------------+--------------------+-------+--------+-----------------------------------+----------------------+------+-------+-------+-------+-------+-------+-----------+---------+-------------------+-------+----------+-----------+---------+------------+-------------+
| E23MPM42203175  | 514S00292-11420240117000056  | J0200   | 12             | 2024-02-23 18:16:42  | 0             | DLC4084004VPQVLAC  | 0     | NXT    | 11C0928D5A2CE048E063AA2C200ABEF3  | 2024-02-23 18:14:50  | N    | NULL  | NULL  | NULL  | NULL  | NULL  | NULL      | NULL    | NULL              | NULL  | NULL     | NULL      | NULL    | NULL       | 2024-02-23  |
+-----------------+------------------------------+---------+----------------+----------------------+---------------+--------------------+-------+--------+-----------------------------------+----------------------+------+-------+-------+-------+-------+-------+-----------+---------+-------------------+-------+----------+-----------+---------+------------+-------------+
1 row selected (1.839 seconds)
0: jdbc:hive2://10.32.36.142:10009/> select count(*) from  `trace_log_refdes_hive_ro`
. . . . . . . . . . . . . . . . . .> ;
2024-02-28 14:30:14.554 INFO KyuubiSessionManager-exec-pool: Thread-1649 org.apache.kyuubi.operation.ExecuteStatement: Processing root's query[fbf94a7a-ac81-4e08-9316-63179fab1658]: PENDING_STATE -> RUNNING_STATE, statement:
select count(*) from  `trace_log_refdes_hive_ro`
24/02/28 14:30:14 INFO ExecuteStatement: Processing root's query[fbf94a7a-ac81-4e08-9316-63179fab1658]: PENDING_STATE -> RUNNING_STATE, statement:
select count(*) from  `trace_log_refdes_hive_ro`
24/02/28 14:30:14 INFO ExecuteStatement: Spark application name: kyuubi_USER_SPARK_SQL_root_default_6fdc3b8a-61da-4d54-80bc-74a358c64f6eapplication ID: application_1708505130791_0031application web UI: http://hadoop01:8088/proxy/application_1708505130791_0031master: yarndeploy mode: clusterversion: 3.4.2Start time: 2024-02-28T14:23:25.247User: root
24/02/28 14:30:14 INFO ExecuteStatement: Execute in full collect mode
24/02/28 14:30:14 INFO V2ScanRelationPushDown: 
Output: 24/02/28 14:30:36 INFO AdaptiveSparkPlanExec: Final plan:
*(2) HashAggregate(keys=[], functions=[count(1)], output=[count(1)#376L])
+- ShuffleQueryStage 0+- Exchange SinglePartition, ENSURE_REQUIREMENTS, [plan_id=87]+- *(1) HashAggregate(keys=[], functions=[partial_count(1)], output=[count#379L])+- *(1) Project+- BatchScan trace_log_refdes_hive_ro[] PaimonScan: [trace_log_refdes_hive_ro] RuntimeFilters: []24/02/28 14:30:36 INFO CodeGenerator: Code generated in 6.663188 ms
24/02/28 14:30:36 INFO ExecuteStatement: Processing root's query[fbf94a7a-ac81-4e08-9316-63179fab1658]: RUNNING_STATE -> FINISHED_STATE, time taken: 21.644 seconds
2024-02-28 14:30:36.200 INFO KyuubiSessionManager-exec-pool: Thread-1649 org.apache.kyuubi.operation.ExecuteStatement: Query[fbf94a7a-ac81-4e08-9316-63179fab1658] in FINISHED_STATE
2024-02-28 14:30:36.200 INFO KyuubiSessionManager-exec-pool: Thread-1649 org.apache.kyuubi.operation.ExecuteStatement: Processing root's query[fbf94a7a-ac81-4e08-9316-63179fab1658]: RUNNING_STATE -> FINISHED_STATE, time taken: 21.646 seconds
+-----------+
| count(1)  |
+-----------+
| 77134590  |
+-----------+
1 row selected (21.667 seconds)
0: jdbc:hive2://10.32.36.142:10009/> select count(*) from  `trace_log_refdes_hive_ro$ro`
. . . . . . . . . . . . . . . . . .> ;24/02/28 14:30:45 INFO AdaptiveSparkPlanExec: Final plan:
*(2) HashAggregate(keys=[], functions=[count(1)], output=[count(1)#409L])
+- ShuffleQueryStage 0+- Exchange SinglePartition, ENSURE_REQUIREMENTS, [plan_id=122]+- *(1) HashAggregate(keys=[], functions=[partial_count(1)], output=[count#412L])+- *(1) Project+- BatchScan trace_log_refdes_hive_ro$ro[] PaimonScan: [trace_log_refdes_hive_ro$ro] RuntimeFilters: []24/02/28 14:30:45 INFO SQLOperationListener: Query [f50c8cb8-165a-4f9b-9c48-8a0149208754]: Job 7 succeeded, 0 active jobs running
+-----------+
| count(1)  |
+-----------+
| 73545958  |
+-----------+
1 row selected (4.424 seconds)
0: jdbc:hive2://10.32.36.142:10009/> select count(*) from  `trace_log_refdes_hive_ro$ro`;24/02/28 14:31:07 INFO StatsReportListener: task runtime:(count: 1, mean: 31.000000, stdev: 0.000000, max: 31.000000, min: 31.000000)
24/02/28 14:31:07 INFO DAGScheduler: Job 9 finished: collect at ExecuteStatement.scala:72, took 0.039294 s
24/02/28 14:31:07 INFO AdaptiveSparkPlanExec: Final plan:
*(2) HashAggregate(keys=[], functions=[count(1)], output=[count(1)#442L])
+- ShuffleQueryStage 0+- Exchange SinglePartition, ENSURE_REQUIREMENTS, [plan_id=157]+- *(1) HashAggregate(keys=[], functions=[partial_count(1)], output=[count#445L])+- *(1) Project+- BatchScan trace_log_refdes_hive_ro$ro[] PaimonScan: [trace_log_refdes_hive_ro$ro] RuntimeFilters: []defe-4000-bbb1-bcc3c08fec7f]: RUNNING_STATE -> FINISHED_STATE, time taken: 4.045 seconds
+-----------+
| count(1)  |
+-----------+
| 73545958  |
+-----------+
1 row selected (4.061 seconds)
0: jdbc:hive2://10.32.36.142:10009/> 

由上可以看出,在涉及需要数据合并时,性能提升巨大。

结束

paimon表读优化-Read-optimized Table 至此结束。

本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若转载,请注明出处:http://www.mzph.cn/news/708525.shtml

如若内容造成侵权/违法违规/事实不符,请联系多彩编程网进行投诉反馈email:809451989@qq.com,一经查实,立即删除!

相关文章

获取tensorflow lite模型指定中间层的输出

以mobilenet v2为例子,我已经训练好了模型,有tflite格式和onnx格式两种模型的权重文件,我想获取tflite模型在推理阶段neck部分的输出。 查看onnx模型得到neck最后一层位置 使用netron查看onnx模型结构 从name中能知道Reshape是neck的最后一…

微信小程序固定头部-CSS实现

效果图 代码逻辑:设置头部的高度,浮动固定后,再加个这个高度的大小的外边距 .weui-navigation-bar {position: fixed;top: 0px;left: 0px;right: 0px;height:90px; } .weui-navigation-bar_bottom{height:90px; }

SpringCloud 基本概念

开篇 学习springcloud的前提我已经认为你已经具备: 微服务的基本概念具备springboot的基本用法 eurake server:注册中心,对标zookeeper eurake client:服务,对标dubbo ribbon:负载均衡,对标nginx feign:与ribbon类似,目前项目没有使用,暂时就不写 hystrix:断路…

双指令集成一体控制比例放大器

双指令独立输入比例放大器是一种能够接收两个独立指令输入来控制两个比例电磁铁的比例阀放大器。 该类放大器可以同时控制两个单电磁铁比例阀,每一组都可以根据不同的指令输入进行独立操作。 它通常兼容多种类型的指令输入,如0-10V、0-5V以及4-20mA等&…

docker版本 jenkins配置gitlab自动部署

前端项目 Build steps pwd npm config set registry https://registry.npm.taobao.org npm -v node -v #npm install npm run build:prod tar -czvf QASystem.tar.gz distpwd cd /data/zhouxy37/vue_deploy tar -zxvf QASystem.tar.gz sudo mv dist QASystem cp -r QASyste…

机器人内部传感器阅读梳理及心得-速度传感器-模拟式速度传感器

速度传感器是机器人内部传感器之一,是闭环控制系统中不可缺少的重要组成部分,它用来测量机器人关节的运动速度。可以进行速度测量的传感器很多,如进行位置测量的传感器大多可同时获得速度的信息。但是应用最广泛、能直接得到代表转速的电压且…

fastjson序列化MessageExt对象问题(1.2.78之前版本)

前言 无论是kafka&#xff0c;还是RocketMq&#xff0c;消费者方法参数中的MessageExt对象不能被 fastjson默认的方式序列化。 一、查看代码 Override public ConsumeConcurrentlyStatus consumeMessage(List<MessageExt> msgs,ConsumeConcurrentlyContext context) {t…

江科大stm32学习笔记——【4-1】OLED

一.原理 1.调试方式 串口调试&#xff1a;通过串口通信&#xff0c;将调试信息发送到电脑端&#xff0c;电脑使用串口助手显示调试信息。 显示屏调试&#xff1a;直接将显示屏连接到单片机&#xff0c;将调试信息打印在显示屏上。 Keil调试模式&#xff1a;借助Keil软件的调…

resilience4j 2.0.0版本使用要求最低JDK17(使用踩坑记录)

文章目录 &#x1f50a;博主介绍&#x1f964;本文内容&#x1f4e2;文章总结&#x1f4e5;博主目标 &#x1f50a;博主介绍 &#x1f31f;我是廖志伟&#xff0c;一名Java开发工程师、Java领域优质创作者、CSDN博客专家、51CTO专家博主、阿里云专家博主、清华大学出版社签约作…

六自由度Stewart平台的matlab模拟与仿真

目录 1.课题概述 2.系统仿真结果 3.核心程序与模型 4.系统原理简介 4.1运动学原理 4.2 Stewart平台运动学方程 5.完整工程文件 1.课题概述 六自由度Stewart平台的matlab模拟与仿真&#xff0c;模拟六自由度Stewart平台的动态变化情况以及伺服角度。 2.系统仿真结果 3.核…

v-rep--websocket接口

websocket是什么 V-REP 中的 Web Socket 是一种用于在 V-REP 和外部应用程序之间进行通信的协议和技术。Web Socket 基于 TCP 连接&#xff0c;可以提供双向、实时的数据传输&#xff0c;适用于互动性或实时交互性应用。 (比如v-rep在云服务器上运行&#xff0c;通过websocke…

【国产MCU】-CH32V307-定时器同步模式

定时器同步模式 文章目录 定时器同步模式1、定时器同步模式介绍2、驱动API介绍3、定时器同步模式实例1、定时器同步模式介绍 CH32V307的定时器能够输出时钟脉冲(TRGO),也能接收其他定时器的输入(ITRx)。不同的定时器的ITRx的来源(别的定时器的TRGO)是不一样的。 通用定…

Covalent Network的长期数据可用性 获得了众多加密 KOL的肯定及支持

随着 Web3 生态系统的动态发展&#xff0c;Covalent Network&#xff08;CQT&#xff09;的关键性正在显现&#xff0c;通过提供分布式、加密安全的数据层&#xff0c;以解决长期数据可用性的问题。Covalent Network&#xff08;CQT&#xff09;不仅仅是一个工具&#xff0c;更…

企业内部文件资料如何进行加密 ——防止泄露?

企业内部文件资料的加密是防止数据泄露的关键措施之一。 www.weaem.com 以下是一些建议&#xff0c;用于在企业内部进行文件资料的加密&#xff0c;以防止数据泄露&#xff1a; 选择适合的加密技术&#xff1a; 透明加密&#xff1a;这种加密方式允许用户在不改变原有操作习惯的…

简单网站模板1(HTML)

想要拥有自己的网站&#xff0c;却不知该如何才能简约好看&#xff0c;接下来分享一种自己搭建的网站模板&#xff0c;希望大家喜欢。 展示图&#xff1a; CODE: <!DOCTYPE html> <html> <head><title>我的网站</title><style>body {fo…

eltable 合计行添加tooltip

eltable 合计行添加tooltip 问题描述&#xff1a; eltable 合计行单元格内容过长会换行&#xff0c;需求要求合计行数据超长显示 … &#xff0c;鼠标 hover 时显示提示信息。 解决方案&#xff1a;eltable合计行没有对外的修改接口&#xff0c;想法是 自己实现一个tooltip&a…

JVM对象创建与内存分配机制

JVM对象创建与内存分配机制 JVM对象创建与内存分配机制 JVM对象创建与内存分配机制对象的创建过程内存分配对象栈上分配对象逃逸分析标量替换 对象在Eden区分配大对象直接进入老年代长期存活的对象将进入老年代对象年龄动态判断老年代空间分配担保机制 对象头与指针压缩对象头利…

课时49:表达式_表达式进阶_集合基础

3.3.2 集合基础 学习目标 这一节&#xff0c;我们从 基础知识、简单实践、小结 三个方面来学习。 基础知识 简介 所谓的集合&#xff0c;主要是针对多个条件表达式组合后的结果&#xff0c;尤其是针对于逻辑场景的组合。初中数学的相关逻辑示意图&#xff1a;表现样式 两个…

将四个主页面配置为 tab-bar 的子路由

使用vant组件库 路由 {path: /, name: layout,component: () > import(/views/layout),children: [{path: , // 默认子路由name: home,component: () > import(/views/home)},{path: qa,name: qa,component: () > import(/views/qa)},{path: video,name: video,compo…

阿里云2核4G服务器租用价格85元一年,30元3个月

阿里云2核4G服务器多少钱一年&#xff1f;2核4G服务器1个月费用多少&#xff1f;2核4G服务器30元3个月、85元一年&#xff0c;轻量应用服务器2核4G4M带宽165元一年&#xff0c;本文阿里云服务器网整理的2核4G参加活动的主机是ECS经济型e实例和u1云服务器&#xff0c;阿里云服务…