Linux command
parquet 命令
文件
复制后可按需替换文件名、目录或参数。
常用示例
Show file schema
parquet-tools schema [file.parquet]
Show metadata
parquet-tools meta [file.parquet]
Show first rows
parquet-tools head [file.parquet]
Convert to JSON
parquet-tools cat --json [file.parquet]
Show row count
parquet-tools rowcount [file.parquet]
Merge files
parquet-tools merge [file1.parquet] [file2.parquet] [output.parquet]
说明
Parquet is a columnar storage format for big data. parquet-tools (or parquet-cli) inspects and manipulates Parquet files, showing schema, metadata, and contents. Parquet provides efficient compression and encoding for analytics workloads.
参数
- schema
- Show schema.
- meta
- Show metadata.
- head
- Show first rows.
- cat
- Output all rows.
- rowcount
- Count rows.
- merge
- Merge files.
- --json
- JSON output.
- -n _num_
- Number of rows.
FAQ
What is the parquet command used for?
Parquet is a columnar storage format for big data. parquet-tools (or parquet-cli) inspects and manipulates Parquet files, showing schema, metadata, and contents. Parquet provides efficient compression and encoding for analytics workloads.
How do I run a basic parquet example?
Run `parquet-tools schema [file.parquet]` in a terminal, then adjust file names, paths, flags, or remote targets for your system.
What does schema do in parquet?
Show schema.