parquet-generator
Here are 14 public repositories matching this topic...
Synthetic Data Values Generator
- Updated
- Go
This is a simple Java POC to create Parquet files This is a Spring Boot project.
- Updated
- Java
ParquetToHuggingFace processes raw audio data, converts it into Parquet files, and uploads them to Hugging Face. The README explains how to set up the environment, configure paths, and run the scripts to generate and upload the data.
- Updated
- Python
CSV 2 Parquet and CSV2 to ORC converter with aligned interface
- Updated
- Java
ParqBridge focuses on zero PHP dependency bloat while still producing spec-compliant Parquet files by delegating the final write step to a tiny, embedded Python script using PyArrow (or any custom CLI you prefer). You keep full Laravel DX for configuration and Storage; we bridge your data to Parquet.
- Updated
- PHP
A command line tool for inspecting parquet files with PyArrow.
- Updated
- Python
Python utility to convert TXT and CSV files to Parquet
- Updated
- Python
A program that allows you to generate Escher parquets
- Updated
- Python
This tool acts as a dedicated interface between machine-efficient binary data (.parquet) and the necessity of manual human intervention. It automates the conversion process, flattens nested structures, and recompiles data back into optimized binary files directly within the GitHub ecosystem.
- Updated
- Python
Easly view, create and edit parquet files with the desktop application Parqueditor
- Updated
- Java
CLI tool for giving CSV files a schema and cast them to Parquet
- Updated
- Scala
Jupyter Notebook analyzing GitHub repository metadata using Python, Parquet, Pandas, and DuckDB
- Updated
- Jupyter Notebook
Improve this page
Add a description, image, and links to the parquet-generator topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the parquet-generator topic, visit your repo's landing page and select "manage topics."
