dsq(DataStation Query)是用來查詢資料檔案的SQL工具,使用的是SQLite的SQL語法,支援下列檔案格式:
- CSV
- JSON
- Excel (XLSX與XLS)
- Apache log (error and access): 尚不完整
- Nginx Access Logs: 尚不完整
- ...
[!INFO] 支援格式
https://github.com/multiprocessio/dsq#supported-data-types
1. 安裝
直接到GutHub的Releases下載執行檔,不用安裝。
2. 使用範例
dsq covid19.json "select * from {}"
dsq --pretty covid19.json "select * from {}" | more
dsq --pretty covid19.json "select count(1) from {}"
dsq --pretty covid19.json "select * from {} where Cast(deaths as number) > 100000 order by deaths"
rem csv轉JSON
dsq covid19_global_cases_and_deaths.csv > cases.json
rem Excel
dsq Report2.xlsx "SELECT * FROM {\"Sheet2\"}"
rem 多檔操作 (join)
dsq testdata/join/users.csv testdata/join/ages.json "select {0}.name, {1}.age from {0} join {1} on {0}.id = {1}.id"
[!WARNING] 注意
- 欄位要用小寫
- from {\"工作表\"}: Windows的SQL敘述裡用到雙引號時要加反斜線
3. 相關鏈接
- multiprocessio/dsq: Commandline tool for running SQL queries against JSON, CSV, Excel, Parquet, and more.
- multiprocessio/datastation: Easily query, script, and visualize data from every database, file, and API.
4. 教學影片
##
