DataSophon集成ApacheImpala的过程

注意: 本次安装操作系统环境为Anolis8.9(Centos7和Centos8应该也一样) 

        DataSophon版本为DDP-1.2.1

整合的安装包我放网盘了:

通过网盘分享的文件:impala-4.4.1.tar.gz等2个文件
链接: https://pan.baidu.com/s/18KfkO_BEFa5gVcc16I-Yew?pwd=za4k 提取码: za4k

1. Apache Impala的版本我选择的是目前Github上的最新版本4.4.1

Github上 Impala提供了rpm和deb两种安装方式,由于编译源码涉及到python包以及C++包机器难下载,所以我选择了Github提供的rpm包.

2. 首先下载apache-impala-4.4.1-RELEASE_hive-3.1.3-x86_64.el8.8.rpm

        将rpm包移动到/opt目录下,然年执行yum安装

wget https://github.com/apache/impala/releases/download/4.4.1/apache-impala-4.4.1-RELEASE_hive-3.1.3-x86_64.el8.8.rpm
mv apache-impala-4.4.1-RELEASE_hive-3.1.3-x86_64.el8.8.rpm /opt
yum install -y apache-impala-4.4.1-RELEASE_hive-3.1.3-x86_64.el8.8.rpm

3. 安装之后得到/opt/impala文件夹,重命名为/opt/impala-4.4.1

cd /opt
mv impala impala-4.4.1

4. 修改conf/impala-env.sh脚本命令,修改以下三个的值

: ${JAVA_HOME:=/usr/local/jdk}# Specify extra CLASSPATH.
: ${CLASSPATH:=${IMPALA_HOME}/conf/:${IMPALA_HOME}/lib/jars/*}# Specify extra LD_LIBRARY_PATH.
: ${LD_LIBRARY_PATH:=${IMPALA_HOME}/lib/native/:${JAVA_HOME}/jre/lib/amd64/server/}

5.  在conf下将Hadoop的core-site.xml hdfs-site.xml以及hive的core-site.xml拷贝进来

hive-site.xml内容我只放了这些

<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>
<configuration><property><name>hive.metastore.warehouse.dir</name><value>/user/hive/warehouse</value></property><property><name>hive.metastore.port</name><value>9083</value></property><property><name>hive.metastore.uris</name><value>thrift://node01:9083</value></property><property><name>hive.metastore.event.db.notification.api.auth</name><value>false</value></property><property><name>hive.metastore.dml.events</name><value>true</value></property><property><name>hive.metastore.transactional.event.listeners</name><value>org.apache.hive.hcatalog.listener.DbNotificationListener</value></property></configuration>

6. 打包tar.gz,并拷贝到DDP/packages目录下以及生成md5文件

cd /opt
# 打包tar.gz
tar -zcvf impala-4.4.1.tar.gz impala-4.4.1
# 将tar.gz包拷贝到DDP/packages目录下
cp impapa-4.4.1.tar.gz /opt/datasophon/DDP/packages
cd /opt/datasophon/DDP/packages
# 生成md5文件
java -jar file-md5-1.0-SNAPSHOT-jar-with-dependencies.jar impala-4.4.1.tar.gz

7. 编写IMPALA的service_ddl.json配置文件

cd /opt/datasophon/datasophon-manager-1.2.1/conf/meta/DDP-1.2.1/
mkdir IMPALA
cd IMPALA
# 创建json文件并写入内容
vi service_ddl.json
{"name": "IMPALA","label": "Impala","description": "MPP(大规模并行处理)SQL查询引擎","version": "4.4.1","sortNum": 22,"dependencies": ["HDFS", "HIVE"],"packageName": "impala-4.4.1.tar.gz","decompressPackageName": "impala-4.4.1","roles": [{"name": "StateStored","label": "StateStored","roleType": "master","runAs": {"user": "impala","group": "hadoop"},"cardinality": "1+","sortNum": 1,"logFile": "/var/log/impala/statestored.INFO","jmxPort": 2191,"startRunner": {"timeout": "60","program": "bin/impala.sh","args": ["start", "statestored"]},"stopRunner": {"timeout": "600","program": "bin/impala.sh","args": ["stop", "statestored"]},"statusRunner": {"timeout": "60","program": "bin/impala.sh","args": ["status", "statestored"]},"restartRunner": {"timeout": "60","program": "bin/impala.sh","args": ["restart", "statestored"]},"externalLink": {"name": "StateStored Ui","label": "StateStored Ui","url": "http://${host}:25010"}}, {"name": "Catalogd","label": "Catalogd","roleType": "master","runAs": {"user": "impala","group": "hadoop"},"cardinality": "1+","sortNum": 2,"logFile": "/var/log/impala/Catalogd.INFO","jmxPort": 2191,"startRunner": {"timeout": "60","program": "bin/impala.sh","args": ["start", "catalogd"]},"stopRunner": {"timeout": "600","program": "bin/impala.sh","args": ["stop", "catalogd"]},"statusRunner": {"timeout": "60","program": "bin/impala.sh","args": ["status", "catalogd"]},"restartRunner": {"timeout": "60","program": "bin/impala.sh","args": ["restart", "catalogd"]},"externalLink": {"name": "Catalogd Ui","label": "Catalogd Ui","url": "http://${host}:25020"}}, {"name": "Impalad","label": "Impalad","roleType": "worker","runAs": {"user": "impala","group": "hadoop"},"cardinality": "1+","sortNum": 3,"logFile": "/var/log/impala/Impalad.INFO","jmxPort": 2191,"startRunner": {"timeout": "60","program": "bin/impala.sh","args": ["start", "impalad", "--enable_legacy_avx_support"]},"stopRunner": {"timeout": "600","program": "bin/impala.sh","args": ["stop", "impalad"]},"statusRunner": {"timeout": "60","program": "bin/impala.sh","args": ["status", "impalad"]},"restartRunner": {"timeout": "60","program": "bin/impala.sh","args": ["restart", "impalad", "--enable_legacy_avx_support"]}}],"configWriter": {"generators": [{"filename": "statestored_flags","configFormat": "properties","outputDirectory": "conf","includeParams": ["-hostname", "-log_dir", "-minidump_path", "custom.statestored_flags"]}, {"filename": "catalogd_flags","configFormat": "properties","outputDirectory": "conf","includeParams": ["-hostname", "-state_store_host", "-log_dir", "-minidump_path", "custom.catalogd_flags"]}, {"filename": "impalad_flags","configFormat": "properties","outputDirectory": "conf","includeParams": ["-hostname", "-state_store_host", "-catalog_service_host", "-log_dir", "-minidump_path", "-mem_limit", "custom.impalad_flags"]}]},"parameters": [{"name": "-hostname","label": "impalad部署节点IP","description": "impalad部署节点IP","required": true,"type": "input","value": "${host}","configurableInWizard": true,"hidden": false,"defaultValue": "${host}"}, {"name": "-catalog_service_host","label": "catalog_service_host部署节点IP","description": "catalog_service_host部署节点IP","required": true,"type": "input","value": "node01","configurableInWizard": true,"hidden": false,"defaultValue": "${host}"}, {"name": "-state_store_host","label": "statestore部署节点IP","description": "statestore部署节点IP","required": true,"type": "input","value": "node01","configurableInWizard": true,"hidden": false,"defaultValue": "${host}"}, {"name": "-log_dir","label": "log_dir日志路径","description": "log_dir日志路径","required": true,"type": "input","value": "/var/log/impala","configurableInWizard": true,"hidden": false,"defaultValue": "/var/log/impala"}, {"name": "-minidump_path","label": "minidump_path路径","description": "minidump_path路径","required": true,"type": "input","value": "/var/log/impala/minidumps","configurableInWizard": true,"hidden": false,"defaultValue": "/var/log/impala/minidumps"}, {"name": "-mem_limit","label": "mem_limit","description": "mem_limit","required": true,"type": "input","value": "80%","configurableInWizard": true,"hidden": false,"defaultValue": "80%"}, {"name": "custom.statestored_flags","label": "自定义配置statestored_flags","description": "自定义配置","configType": "custom","required": true,"type": "multipleWithKey","value": [{"-v": "1"},{"-log_filename": "statestored"},{"-max_log_files": "10"},{"-max_log_size": "200"}],"configurableInWizard": true,"hidden": false,"defaultValue": ""}, {"name": "custom.catalogd_flags","label": "自定义配置catalogd_flags","description": "自定义配置","configType": "custom","required": true,"type": "multipleWithKey","value": [{"-v": "1"},{"-log_filename": "catalogd"},{"-max_log_files": "10"},{"-max_log_size": "200"}],"configurableInWizard": true,"hidden": false,"defaultValue": ""}, {"name": "custom.impalad_flags","label": "自定义配置impalad_flags","description": "自定义配置","configType": "custom","required": true,"type": "multipleWithKey","value": [{"-v": "1"},{"-log_filename": "impalad"},{"-max_log_files": "10"},{"-max_log_size": "200"},{"-scratch_dirs": "/data/impala/impalad"}],"configurableInWizard": true,"hidden": false,"defaultValue": ""}]
}

8. 重启datasophon的api服务

cd /opt/datasophon/datasophon-manager-1.2.1/
启动:sh bin/datasophon-api.sh start api
停止:sh bin/datasophon-api.sh stop api
重启:sh bin/datasophon-api.sh restart api

9. 回到网页中就可以进行服务的安装了(注意安装过程中修改catalogd以及satastored所在的服务器的ip或者host名称)

由于我的目录设置为/var/log/impala, impala并没有权限进行创建该目录,只能手动创建(这个问题我知道咋解决),这个步骤要在安装服务之前就做吧.

mkdir -p /var/log/impala/minidumps
chmod 777 /var/log/impala
chmod 777 var/log/impala/minidumps

如果有读者想自己打包安装可以参考以下博客

编译部署apache-impala | 子崖说

Datasophon集成impala | 子崖说

本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若转载,请注明出处:http://www.mzph.cn/news/883362.shtml

如若内容造成侵权/违法违规/事实不符,请联系多彩编程网进行投诉反馈email:809451989@qq.com,一经查实,立即删除!

相关文章

C#与C++交互开发系列(十二):托管和非托管内存管理策略

前言 在进行C#与C互操作开发时&#xff0c;内存管理是一个非常重要的环节。由于C#采用托管内存管理&#xff08;由垃圾回收机制GC控制&#xff09;&#xff0c;而C则使用手动内存管理&#xff08;需要开发者负责分配和释放内存&#xff09;&#xff0c;因此跨语言调用时&#…

光耦的应用

什么是光耦 光耦是一种实现信号隔离的元器件&#xff0c;通常用于各部分电路之间&#xff0c;使其不互相受到影响。 工作原理 光耦是由一个发光二极管和一个光敏三极管封装而成的。其使用原理为&#xff1a; 当发光二极管有信号输入时&#xff0c;则会被点亮&#xff0c;此时…

PHP-FPM 性能配置优化

4 核 8 G 服务器大约可以开启 500 个 PHP-FPM&#xff0c;极限吞吐量在 580 qps &#xff08;Query Per Second 每秒查询数&#xff09;左右。 Nginx php-fpm 是怎么工作的&#xff1f; php-fpm 全称是 PHP FastCGI Process Manager 的简称&#xff0c;从名字可得知&#xff…

OJ在线编程常见输入输出(java版本)

基础知识 BufferedReader 概述 BufferedReader是Java I/O中的一个类&#xff0c;它是一个带缓冲区的字符输入流&#xff0c;用于从字符输入流中读取字符。它提供了一种逐行读取文本文件的方法&#xff0c;可以轻松地读取大量文本数据&#xff0c;并且可以通过使用缓冲区来提…

基于SSM的“众优”大学生家教平台的设计与实现

前言 对于当今社会的人们来说&#xff0c;互联网技术是必不可少的&#xff0c;随着经济和技术的不断发展&#xff0c;计算机已经深入到各个领域。“众优”大学生家教平台将人们的时间需求与计算机技术结合起来&#xff0c;架起一座桥梁&#xff0c;使用在线查看“众优”大学生…

设计模式讲解

设计原则 单一职责原则 > 一个对象应该只包含单一的职责&#xff0c;并且该职责被完整地封装在一个类中 > //一个人类 public class People {/*** 人类会编程*/public void coding(){System.out.println("int mian() {");System.out.println(" print…

CI/CD 流水线系统-开源框架Tekton

文章目录 CI/CD 流水线系统-开源框架Tekton什么是TektonTekton优点Tekton 组件介绍Tekton 概念术语 CI/CD 流水线系统-开源框架Tekton 什么是Tekton 官网&#xff1a;https://tekton.dev/ Tekton 是一个强大、灵活的构建 CI/CD 流水线系统的开源框架,允许开发者构建、测试和…

【牛客算法】某司面试算法题:循环右移二叉树

一、算法题描述 1.1 算法描述 现有一棵n个节点构成的二叉树&#xff0c;请你将每一层的节点向右循环位移k位。某层向右位移一位(即k1)的含义为&#xff1a; 若当前节点为左孩子节点&#xff0c;会变成当前节点的双亲节点的右孩子节点。 若当前节点为右儿子&#xff0c;会变成…

直播系统源码技术搭建部署流程及配置步骤

系统环境要求 PHP版本&#xff1a;5.6、7.3 Mysql版本&#xff1a;5.6&#xff0c;5.7需要关闭严格模式 Nginx&#xff1a;任何版本 Redis&#xff1a;需要给所有PHP版本安装Redis扩展&#xff0c;不需要设置Redis密码 最好使用面板安装&#xff1a;宝塔面板 - 简单好用的…

springboot中redis并发锁的等待时间设置长短

问题&#xff1a;等待时间我设置的大一点&#xff0c;还是立刻抛出错误不等待&#xff1b;个人是觉得要立刻抛出错误&#xff0c;首先如果同一把锁并发量太大&#xff0c;没有限流&#xff0c;会造成服务器连接数立刻达到最大连接数&#xff0c;服务不可用&#xff1b;其次等待…

Android——事件冲突处理

当我们给列表的item设置了点击事件后&#xff0c;又给item中的按钮设置了点击事件&#xff0c;此时item的点击事件会失效。 解决 给item的布局xml中设置以下属性 android:descendantFocusability"blocksDescendants"<LinearLayout xmlns:android"http://sc…

HT7181 16.8V,14A高效升压转换器

1、特征 输入电压范围:2.7V-16V 输出电压范围:最高16.8V 固定开关频率:360kHz 可编程峰值电流:14A 高转换效率: 94% (VIN 7.2V, VOUT9.3V, IOUT1.5A) 90% (VIN 7.2V, VOUT9.3V, IOUT 7A) 93% (VIN 7.2V, VOUT12V, IOUT 1.5A) 90% (VIN 7.2V, VOUT12V, IOUT 5.5A) 90% (VIN …

电脑磁盘再次分区

如何把c盘变成一个大的盘&#xff1f; 给电脑的C盘扩容通常涉及到调整硬盘分区的大小&#xff0c;这可以通过多种方法实现&#xff0c;以下是一些常见的步骤&#xff1a; 1. **使用磁盘管理工具**&#xff1a; - 在Windows系统中&#xff0c;可以使用内置的磁盘管理工具来调整…

220V降12V1A恒流点灯WT5112

220V降12V1A恒流点灯WT5112 芯片特点 高精度恒流输出&#xff1a;WT5112 是一款适用于非隔离降压型恒流 LED 驱动芯片。在 220V 降 12V、1A 恒流点灯应用中&#xff0c;它能够提供高精度的恒流输出。其恒流精度通常可以达到 3% - 5% 左右&#xff0c;这对于 LED 灯的稳定发光非…

从Docker容器中备份整个PostgreSQL

问题 现在需要从Docker容器中备份整个PostgreSQL后&#xff0c;然后&#xff0c;使用备份文件在另外一个pg的docker容器中恢复过来。 步骤 备份旧容器中的PG # 登录到旧的PG容器中 docker exec -it postgres bash # 备份数据库 pg_dumpall -c -U postgres > dump_date %…

安卓基础001

前言 也是好久没有更新博客了,最近实习也是需要学习一些知识哈哈哈哈哈哈为了更好的发展嘛,咱们从客户端开始,过程可能有点像写前端,不喜勿喷,希望在学习的过程中也可以给大家带来一些简单得帮助吧....... tips:这里跳过安卓studio安装,大家可自行寻找教程 写的不详细,只是为了…

Zookeeper面试整理-源码及实现细节

Zookeeper 的源码和实现细节是理解其高可用性、分布式一致性和高效性的关键。以下是 Zookeeper 的核心实现细节,涵盖其架构、关键模块、以及具体的代码实现机制。 1. Zookeeper 架构 Zookeeper 采用 Leader-Follower 架构,集群由多个节点组成。主要分为三种角色:Leader、Fol…

web相关知识学习笔记

一&#xff0c; web安全属于网络信息安全的一个分支&#xff0c;www即全球广域网&#xff0c;也叫万维网&#xff0c;是一个分布式图形信息系统 二&#xff0c; 1.①安全领域&#xff0c;通常将用户端&#xff08;浏览器端&#xff09;称为前端&#xff0c;服务器端称为后端 ②…

【SAP Abap】X-DOC:SAP ABAP系统变量SY-INDEX与SY-TABIX差异点

【SAP Abap】X-DOC&#xff1a;SAP ABAP系统变量SY-INDEX与SY-TABIX差异点 SAP ABAP系统变量SY-INDEX与SY-TABIX差异点 ABAP系统变量SY-INDEX与SY-TABIX都是用来记录循环的次数。 SY-INDEX在do…endo和while…endwhile中有效&#xff0c; SY-TABIX在对内表的循环查询中有效&…

从“摸黑”到“透视”:AORO A23热成像防爆手机如何改变工业检测?

在工业检测领域&#xff0c;传统的检测手段常因效率低下、精度不足和潜在的安全风险而受到诟病。随着科技的不断进步&#xff0c;一种新兴的检测技术——红外热成像技术&#xff0c;正逐渐在该领域崭露头角。近期&#xff0c;小编对一款集成红外热成像技术的AORO A23防爆手机进…