plsql中导入csvs_在命令行中使用sql分析csvs

plsql中导入csvs

If you are familiar with coding in SQL, there is a strong chance you do it in PgAdmin, MySQL, BigQuery, SQL Server, etc. But there are times you just want to use your SQL skills for quick analysis on a small/medium sized dataset.

如果您熟悉SQL编码,则很有可能在PgAdmin , MySQL , BigQuery , SQL Server等中进行编码。但是有时您只想使用SQL技能来对中小型大小进行快速分析。数据集。

With csvkit you can run any SQL on your CSV files right in your command line.

使用csvkit您可以在命令行中直接在CSV文件上运行任何SQL。

csvkit is a suite of command-line tools for converting to and working with CSV, the king of tabular file formats. Once you have csvkit installed you can use csvsql to run your SQL commands.

csvkit是一套命令行工具,用于转换为表格格式文件之王CSV并与其一起使用。 一旦你有csvkit安装就可以使用csvsql来运行SQL命令。

1.安装 (1. Installation)

If you don’t have csvkit installed, head over here and follow the installation instructions or if you’re familiar with pip you can do the following.

如果您没有安装csvkit ,请csvkit 此处并按照安装说明进行操作,或者如果您熟悉pip ,则可以执行以下操作。

pip install csvkit

You can view the csvkit documentation using below.

您可以使用以下方法查看csvkit文档。

csvsql -h

2.语法 (2. Syntax)

Now that you are all set up, you can follow this simple structure to run your queries. It is essential to note the SQL query must be written in quotation marks and must be in a single line. No line breaks.

现在您已经完成了所有设置,可以按照以下简单结构运行查询。 请务必注意,SQL查询必须用引号引起来并且必须在一行中。 没有换行符。

csvsql --query "ENTER YOUR SQL QUERY HERE"
FILE_NAME.csv

That’s it! Follow this basic code skeleton, and you are good to go.

而已! 遵循此基本代码框架,您就可以开始工作了。

Make sure you are in the same working directory as where the CSV file is located.

确保您与CSV文件位于同一工作目录中。

3.例子 (3. Example)

Below is an example of setting the directory and getting our first SQL command up and running in.

以下是设置目录并启动并运行我们的第一个SQL命令的示例。

检查目录 (Check Directory)

pwd

设置工作目录 (Set Working Directory)

Make sure the file you plan to use is in the same directory. My CSV file is in the /Documents folder.

确保计划使用的文件位于同一目录中。 我的CSV文件位于/Documents文件夹中。

cd ~/Documents

运行查询 (Run Query)

Next, we can run the query usingcsvsql

接下来,我们可以使用csvsql运行查询

Image for post
code
Image for post
output
输出

使用csvlook格式化查询输出 (Format Query Output with csvlook)

Piping with | csvlook can improve how your outputted query format.

| csvlook | csvlook可以改善输出查询格式的方式。

Image for post
code
Image for post
output
输出

将查询输出保存到新的CSV (Save Query Output to a New CSV)

Using redirection with > you can send you query output to a new file/location. Note running the code below will not output anything, since we are saving the output to a new file. The new query will save the output to the new csv file store_sales.csv

通过>使用重定向,您可以将查询输出发送到新文件/位置。 请注意,由于我们将输出保存到新文件中,因此运行下面的代码不会输出任何内容。 新查询会将输出保存到新的csv文件store_sales.csv

Image for post

You are all set! Now you can run SQL on your CSV files for quick insights without the need to go through a database.

你们都准备好了! 现在,您可以在CSV文件上运行SQL,以快速了解情况,而无需通过数据库。

If you are looking to learn more about SQL, check out my other articles.

如果您想了解有关SQL的更多信息,请查看我的其他文章。

  • SQL Cheatsheet

    SQL备忘单

  • Date/Time Functions in SQL

    SQL中的日期/时间函数

  • Using CTEs in SQL

    在SQL中使用CTE

  • Introduction to Window Functions

    窗口功能介绍

翻译自: https://towardsdatascience.com/analyze-csvs-with-sql-in-command-line-233202dc1241

plsql中导入csvs

本文来自互联网用户投稿,该文观点仅代表作者本人,不代表本站立场。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如若转载,请注明出处:http://www.mzph.cn/news/389413.shtml

如若内容造成侵权/违法违规/事实不符,请联系多彩编程网进行投诉反馈email:809451989@qq.com,一经查实,立即删除!

相关文章

第十八篇 Linux环境下常用软件安装和使用指南

提醒:如果之后要安装virtualenvwrapper的话,可以直接跳到安装virtualenvwrapper的方法,而不需要先安装好virtualenv安装virtualenv和生成虚拟环境安装virtualenv:yum -y install python-virtualenv生成虚拟环境:先切换…

莫烦Matplotlib可视化第三章画图种类代码学习

3.1散点图 import matplotlib.pyplot as plt import numpy as npn 1024 X np.random.normal(0,1,n) Y np.random.normal(0,1,n) T np.arctan2(Y,X) #用于计算颜色plt.scatter(X,Y,s75,cT,alpha0.5)#alpha是透明度 #plt.scatter(np.arange(5),np.arange(5)) #一条线的散点…

计算机科学必读书籍_5篇关于数据科学家的产品分类必读文章

计算机科学必读书籍Product categorization/product classification is the organization of products into their respective departments or categories. As well, a large part of the process is the design of the product taxonomy as a whole.产品分类/产品分类是将产品…

es6解决回调地狱问题

本文摘抄自阮一峰老师的 http://es6.ruanyifeng.com/#docs/generator-async 异步 所谓"异步",简单说就是一个任务不是连续完成的,可以理解成该任务被人为分成两段,先执行第一段,然后转而执行其他任务,等做好…

交替最小二乘矩阵分解_使用交替最小二乘矩阵分解与pyspark建立推荐系统

交替最小二乘矩阵分解pyspark上的动手推荐系统 (Hands-on recommender system on pyspark) Recommender System is an information filtering tool that seeks to predict which product a user will like, and based on that, recommends a few products to the users. For ex…

莫烦Matplotlib可视化第四章多图合并显示代码学习

4.1Subplot多合一显示 import matplotlib.pyplot as plt import numpy as npplt.figure() """ 每个图占一个位置 """ # plt.subplot(2,2,1) #将画板分成两行两列,选取第一个位置,可以去掉逗号 # plt.plot([0,1],[0,1]) # # plt.su…

python 网页编程_通过Python编程检索网页

python 网页编程The internet and the World Wide Web (WWW), is probably the most prominent source of information today. Most of that information is retrievable through HTTP. HTTP was invented originally to share pages of hypertext (hence the name Hypertext T…

Python+Selenium自动化篇-5-获取页面信息

1.获取页面title title:获取当前页面的标题显示的字段from selenium import webdriver import time browser webdriver.Chrome() browser.get(https://www.baidu.com) #打印网页标题 print(browser.title) #输出内容:百度一下,你就知道 2.…

火种 ctf_分析我的火种数据

火种 ctfOriginally published at https://www.linkedin.com on March 27, 2020 (data up to date as of March 20, 2020).最初于 2020年3月27日 在 https://www.linkedin.com 上 发布 (数据截至2020年3月20日)。 Day 3 of social distancing.社会疏离的第三天。 As I sit on…

莫烦Matplotlib可视化第五章动画代码学习

5.1 Animation 动画 import numpy as np import matplotlib.pyplot as plt from matplotlib import animationfig,ax plt.subplots()x np.arange(0,2*np.pi,0.01) line, ax.plot(x,np.sin(x))def animate(i):line.set_ydata(np.sin(xi/10))return line,def init():line.set…

data studio_面向营销人员的Data Studio —报表指南

data studioIn this guide, we describe both the theoretical and practical sides of reporting with Google Data Studio. You can use this guide as a comprehensive cheat sheet in your everyday marketing.在本指南中,我们描述了使用Google Data Studio进行…

人流量统计系统介绍_统计介绍

人流量统计系统介绍Its very important to know about statistics . May you be a from a finance background, may you be data scientist or a data analyst, life is all about mathematics. As per the wiki definition “Statistics is the discipline that concerns the …

pyhive 连接 Hive 时错误

一、User: xx is not allowed to impersonate xxx 解决办法&#xff1a;修改 core-site.xml 文件&#xff0c;加入下面的内容后重启 hadoop。 <property><name>hadoop.proxyuser.xx.hosts</name><value>*</value> </property><property…

乐高ev3 读取外部数据_数据就是新乐高

乐高ev3 读取外部数据When I was a kid, I used to love playing with Lego. My brother and I built almost all kinds of stuff with Lego — animals, cars, houses, and even spaceships. As time went on, our creations became more ambitious and realistic. There were…

图像灰度化与二值化

图像灰度化 什么是图像灰度化&#xff1f; 图像灰度化并不是将单纯的图像变成灰色&#xff0c;而是将图片的BGR各通道以某种规律综合起来&#xff0c;使图片显示位灰色。 规律如下&#xff1a; 手动实现灰度化 首先我们采用手动灰度化的方式&#xff1a; 其思想就是&#…

分析citibike数据eda

数据科学 (Data Science) CitiBike is New York City’s famous bike rental company and the largest in the USA. CitiBike launched in May 2013 and has become an essential part of the transportation network. They make commute fun, efficient, and affordable — no…

jvm感知docker容器参数

docker中的jvm检测到的是宿主机的内存信息&#xff0c;它无法感知容器的资源上限&#xff0c;这样可能会导致意外的情况。 -m参数用于限制容器使用内存的大小&#xff0c;超过大小时会被OOMKilled。 -Xmx: 默认为物理内存的1/4。 4核CPU16G内存的宿主机 java 7 docker run -m …

Flask之flask-script 指定端口

简介 Flask-Scropt插件为在Flask里编写额外的脚本提供了支持。这包括运行一个开发服务器&#xff0c;一个定制的Python命令行&#xff0c;用于执行初始化数据库、定时任务和其他属于web应用之外的命令行任务的脚本。 安装 用命令pip和easy_install安装&#xff1a; pip install…

上采样(放大图像)和下采样(缩小图像)(最邻近插值和双线性插值的理解和实现)

上采样和下采样 什么是上采样和下采样&#xff1f; • 缩小图像&#xff08;或称为下采样&#xff08;subsampled&#xff09;或降采样&#xff08;downsampled&#xff09;&#xff09;的主要目的有 两个&#xff1a;1、使得图像符合显示区域的大小&#xff1b;2、生成对应图…

r语言绘制雷达图_用r绘制雷达蜘蛛图

r语言绘制雷达图I’ve tried several different types of NBA analytical articles within my readership who are a group of true fans of basketball. I found that the most popular articles are not those with state-of-the-art machine learning technologies, but tho…