← 返回命令列表

Linux command

parquet 命令

文件

复制后可按需替换文件名、目录或参数。

常用示例

Show file schema

parquet-tools schema [file.parquet]

Show metadata

parquet-tools meta [file.parquet]

Show first rows

parquet-tools head [file.parquet]

Convert to JSON

parquet-tools cat --json [file.parquet]

Show row count

parquet-tools rowcount [file.parquet]

Merge files

parquet-tools merge [file1.parquet] [file2.parquet] [output.parquet]

说明

Parquet is a columnar storage format for big data. parquet-tools (or parquet-cli) inspects and manipulates Parquet files, showing schema, metadata, and contents. Parquet provides efficient compression and encoding for analytics workloads.

参数

schema
Show schema.
meta
Show metadata.
head
Show first rows.
cat
Output all rows.
rowcount
Count rows.
merge
Merge files.
--json
JSON output.
-n _num_
Number of rows.

FAQ

What is the parquet command used for?

Parquet is a columnar storage format for big data. parquet-tools (or parquet-cli) inspects and manipulates Parquet files, showing schema, metadata, and contents. Parquet provides efficient compression and encoding for analytics workloads.

How do I run a basic parquet example?

Run `parquet-tools schema [file.parquet]` in a terminal, then adjust file names, paths, flags, or remote targets for your system.

What does schema do in parquet?

Show schema.