#parquet

  1. parquet

    Apache Parquet implementation in Rust

    v57.3.0 3.6M #apache-arrow #hadoop
  2. arrow

    Apache Arrow

    v57.3.0 2.3M #apache-arrow #ipc #pretty-print #arrow-format #compute #parquet #memory-data #chrono-tz #csv #csv-parser
  3. arrow-select

    Selection kernels for arrow arrays

    v57.3.0 3.8M #apache-arrow #kernel #arrow-format #parquet #selection #in-memory
  4. odbc2parquet

    Query an ODBC data source and store the result in a Parquet file

    v8.1.4 #parquet #sql #odbc
  5. parquet2

    Safe implementation of parquet IO

    v0.17.2 241K #parquet #compression #analytics
  6. datafusion-python

    Apache DataFusion DataFrame and SQL Query Engine

    v51.0.0 1.4K #data-fusion #python-bindings #sql-query-engine #apache-arrow #dataframe #data-fusion-query #protobuf #parquet #session-context
  7. tsdb_timon

    Efficient local storage and Amazon S3-compatible data synchronization for time-series data, leveraging Parquet for storage and DataFusion for querying, all wrapped in a simple and intuitive API

    v1.1.1 #data-fusion #parquet #apache-arrow #tsdb #s3-storage-sync
  8. parquet-variant

    Apache Parquet Variant implementation in Rust

    v57.3.0 86K #parquet #variant #arrow
  9. dataprof

    High-performance data profiler with ISO 8000/25012 quality metrics for CSV, JSON/JSONL, and Parquet files

    v0.4.85 #quality-metrics #parquet #data-quality #analysis #data-analysis
  10. parquet-variant-compute

    Apache Parquet Variant Batch Processing

    v57.3.0 86K #variant-array #parquet #variant
  11. polars-view

    A fast and interactive viewer for CSV, Json and Parquet data

    v0.52.2 #polars #csv #json #view #parquet
  12. dms-cdc-operator

    Rust-based utility for comparing the state of a list of tables in an Amazon RDS database with data stored in Parquet files on Amazon S3, particularly useful for change data capture (CDC) scenarios

    v0.1.26 1.5K #amazon-s3 #cdc #postgresql #parquet #polars
  13. nu_plugin_parquet

    nu plugin to add parquet support

    v0.20.0 #parquet #nu-plugin #nu-shell-plugin #add #metadata
  14. dms-cdc-operator-client

    Rust-based client for comparing the state of a list of tables in an Amazon RDS database with data stored in Parquet files on Amazon S3, particularly useful for change data capture (CDC) scenarios

    v0.1.24 1.3K #amazon-s3 #postgresql #parquet #cdc #polars
  15. nail-parquet

    Lightning-fast CLI for data analysis: explore, filter, transform Parquet/CSV/Excel files with SQL-powered operations

    v1.6.6 #parquet #data-fusion #data-analysis #csv #etl
  16. sas7bdat

    + CLI for decoding SAS7BDAT datasets and streaming them to modern formats

    v0.2.0 #sas #statistics #parquet #data
  17. parquet-geospatial

    Apache Parquet Geometry and Geography implementation in Rust

    v57.3.0 #geography #parquet #geometry
  18. parquet-variant-json

    Apache Parquet Variant to/from JSON

    v57.3.0 86K #parquet #variant #arrow
  19. ironbeam

    A batch processing clone of Apache Beam in Rust

    v1.1.0 #gzip #metrics-collection #data-pipeline #csv #parquet #parallel-execution #checkpointing #execution-pipeline #batch-processing #beam
  20. excelstream

    High-performance streaming Excel & CSV library with S3/GCS cloud support and Parquet conversion - Ultra-low memory usage

    v0.20.0 #xlsx #parquet #excel #csv #streaming
  21. aisle

    Metadata-driven Parquet pruning for Rust: Skip irrelevant data before reading

    v0.2.0 #data-fusion #pruning #parquet #metadata #arrow
  22. polars-parquet

    Apache Parquet I/O operations for Polars

    v0.53.0 382K #arrow #apache-arrow #polars #parquet #query-engine #polars-dataframe
  23. pg2parquet

    Command line tool for exporting PostgreSQL tables or queries into Parquet files

    v0.2.2 #parquet #postgresql
  24. datafusion-datasource-parquet

    v52.1.0 490K #parquet #data-fusion #apache-arrow #data-fusion-query #in-memory
  25. cc2p

    Convert a CSV to parquet file format

    v0.6.0 #csv #parquet #cli
  26. dply

    A command line data manipulation tool inspired by the dplyr grammar

    v0.3.5 850 #csv #parquet #command-line-data #json #column #command-line-tool #polars #grammar
  27. tabiew

    A lightweight TUI application to view and query tabular data files, such as CSV, TSV, and parquet

    v0.12.0 #csv #csv-tsv #parquet #sql #tui-applications #tabular-data #query-data #excel #plot
  28. cloudfront-logs

    AWS CloudFront log line parser

    v0.9.1 1.0K #log-parser #cloud-front #aws #parquet #logging
  29. ushcn

    US Historical Climatology Network data downloader

    v0.2.5 280 #parquet #downloader #station #data-processing #daily #historical #lat-lon #climate #cache #parallel-processing
  30. alimentar

    Data Loading, Distribution and Tooling in Pure Rust

    v0.2.3 1.5K #dataset #parquet #ml #arrow
  31. dataset-writer

    write CSV/Arrow/Parquet files concurrently

    v2.0.0 280 #csv #dataset #parquet #arrow-ipc #feather
  32. tass

    A pager for tabular data

    v0.11.0 800 #tabular-data #pager #parquet #csv #column #schema-inference #string-search #csv-tsv
  33. tonbo

    Embedded database for serverless and edge runtimes, storing data as Parquet on S3

    v0.4.0-a0 #serverless #embedded-database #edge #record-batch #compaction #parquet #mvcc #object-storage #zero-copy #cas
  34. otlp2parquet

    Stream OpenTelemetry logs, metrics, and traces to Parquet files

    v0.9.0 #open-telemetry #observability #parquet #otlp #telemetry
  35. supertable-core

    Core library for SuperTable, a next-generation open table format

    v0.1.1 #table-format #arrow #iceberg #parquet
  36. tpchgen-cli

    Blazing fast pure Rust TPC-H data generator command line tool and library

    v2.0.2 8.2K #data-generator #benchmark #csv #tpc-h #data-generation #scale-factor #parquet #current-directory #tbl
  37. datu

    data file utility

    v0.2.3 #parquet #avro #format #convert #input-file #xlsx #csv #output-file #schema-file
  38. helios-sof

    complete implementation of the SQL-on-FHIR specification for Rust, enabling the transformation of FHIR resources into tabular data using declarative ViewDefinitions. It supports all major FHIR versions (R4…

    v0.1.33 #view-definition #csv #parquet #fhir-path #json #ndjson #query-parameters #json-output #cloud-storage #zip
  39. archive-to-parquet

    Recursively convert archives to parquet files

    v0.10.1 #archive #parquet #gzip #recursion #convert #text-output #compression #hash #zip #filesize
  40. pulse-ops

    Built-in operators and combinators for Pulse dataflows — map, filter, join, window, and custom processing logic

    v0.1.2 #event-time #stream-processing #pulse #operator #parquet #rocksdb #kafka #local-first #windowing #watermarks
  41. tinyetl

    Fast, zero-config ETL in a single binary for transforming data between formats and databases

    v0.10.0 #csv #etl-data #parquet #database #etl
  42. csvdb

    Convert between SQLite/DuckDB databases and CSV directories

    v0.2.6 #sqlite #csv #git #duck-db #parquet
  43. nc-to-pq

    A CLI tool to convert 1D variable NetCDF4 files to Apache Parquet format

    v0.2.0 #net-cdf #parquet #convert #cli
  44. csv2parquet

    Convert CSV files to Parquet

    v0.24.4 #parquet #csv #apache-arrow #convert #schema-file #arrow-tools #schema-json
  45. dsq-formats

    File format support for dsq - handles reading and writing various data formats

    v0.1.0 #parquet #csv #json
  46. json2parquet

    Convert JSON files to Parquet

    v0.24.4 #convert-json #apache-arrow #parquet #arrow-tools
  47. readervzrd

    A generic reader for csv and json data

    v0.3.2 310 #csv #json #json-reader #file-reader #record #nested-json #tabular-data #json-format #csv-reader #parquet
  48. datacell

    powerful CLI tool and library for spreadsheet manipulation with pandas-style operations. Supports CSV, Excel (XLSX, XLS, ODS), Parquet, and Avro formats with formula evaluation, data transformation…

    v0.1.7 #spreadsheet #csv #parquet #excel
  49. pulse-io

    Input/output connectors for Pulse — integrates with external systems such as Kafka, Arrow, and Parquet

    v0.1.2 #parquet #event-time #data-streaming #kafka #io #csv #rocksdb #local-first #windowing #flink
  50. parquet2json

    A command-line tool for streaming Parquet as line-delimited JSON

    v4.3.0 #parquet #streaming-json #json-output #line-delimited #file-format #command-line-tool
  51. smooth-json

    opinionated, customizable utility to flatten serde_json Value variants into serde_json Objects ready for use in columnar or table-like usages

    v0.2.7 800 #json #parquet #smoothing #flatten #unnest
  52. liveplot

    Realtime interactive plotting library using egui/eframe, with optional gRPC and Parquet export support

    v1.1.0 #parquet #fft #stream #export #csv #egui-plot #interactive-plot #screenshot #event-logging #data-analysis
  53. allsource-core

    High-performance event store core built in Rust

    v0.8.0 #cqrs #event-store #event-sourcing #event-sourcing-cqrs #parquet
  54. liquid-cache-storage

    10x lower latency for cloud-native DataFusion

    v0.1.10 #data-fusion #arrow-array #cache #pushdown #storage-layer #object-store #parquet #cloud-native
  55. evolution

    Efficiently evolve your old fixed-length data files into modern file formats

    v1.3.0 190 #parquet #data-engineering #etl #concurrency #arrow
  56. pulse-state

    State management utilities for Pulse — provides windowing, aggregations, and persistent operator state

    v0.1.2 #prometheus #rocksdb #data-streaming #windowing #operator #parquet #event-time #local-first #kafka #flink
  57. fusio-parquet

    Parquet reader and writer implementations for Fusio

    v0.6.0 #fusio #async-runtime #tokio-uring #parquet #storage #monoio #object-storage #storage-file #async-file #amazon-s3
  58. icepick

    Experimental Rust client for Apache Iceberg with WASM support for AWS S3 Tables and Cloudflare R2

    v0.4.1 #s3-tables #amazon-s3 #parquet #wasm
  59. parquet-viewer

    command-line tool to view Apache Parquet files

    v0.2.0 #parquet #dataframe #viewer
  60. parquet-record

    High-performance Rust library for moving structs to/from disk using Parquet format. Abstracts complex Arrow/Parquet usage while providing batch writing and parallel reading capabilities for maximum performance.

    v0.2.0 #record-batch #parquet #parallel #column #reading #arrow-format
  61. prestige

    file reading and writing utilities and tools

    v0.2.6 #amazon-s3 #parquet #serialization #file-upload #bucket #data-pipeline #file-rotation #crash-recovery #file-source #metrics
  62. pulse-core

    Core runtime and dataflow engine for Pulse — defines execution graph, operators, and streaming primitives

    v0.1.2 #prometheus #stream-processing #event-time #parquet #operator #rocksdb #kafka #local-first #metrics #watermarks
  63. xdl-dataframe

    DataFrame module for XDL - pandas/Spark-style data manipulation with support for CSV, TSV, Parquet, Avro

    v0.1.1 #dataframe #xdl #csv #tsv #data-analysis #avro #parquet #pandas #visualization #language-analysis
  64. parq

    A blazingly-fast tool for exploring and analyzing Apache Parquet file: inspect schema, view statistics, browse data, and dissect structures

    v0.1.2 #parquet #tui #data-analysis
  65. parquet_to_excel

    convert parquet file(s) to an/a excel/csv file with constant memory in rust

    v0.7.3 #parquet #xlsx #csv #convert #excel
  66. parquetry

    Runtime library for Parquet code generator

    v0.17.0 1.0K #parquet #codegen #run-time
  67. midas_processor

    High-performance Rust tool for converting UK Met Office MIDAS weather datasets from BADC-CSV to optimized Parquet format

    v1.2.0 #parquet #midas #climate #weather #meteorology
  68. daemon_rs

    High-performance structured logging daemon with Parquet storage

    v0.1.1 #logging #parquet #observability #high-performance
  69. dremio-rs

    Dremio Rust client

    v0.2.6 #apache-arrow #sql #client #service #flight-sql #parquet #es #query-data
  70. polars-cli

    CLI interface for running SQL queries with Polars as backend

    v0.9.0 600 #sql #command-line-interface #interface-for-running #apache-arrow #back-end #ipc #parquet #interactive-shell
  71. mdb-cli

    Command line client for the MarpleDB API

    v0.1.0 #iceberg #parquet #marple #database #aerospace
  72. pqrs

    Apache Parquet command-line tools and utilities

    v0.3.2 420 #parquet #arrow #cli
  73. nc2parquet

    High-performance NetCDF to Parquet converter with cloud storage support

    v0.1.1 #cloud-storage #net-cdf #parquet #climate #data-conversion
  74. dbt-fusion-workspace-hack

    workspace-hack package, managed by hakari

    v0.1.0 4.0K #dbt #sql #csv #parquet #json
  75. compact-thrift-parquet

    Parquet metadata structures generated from thrift definitions

    v0.3.1 #thrift #parquet
  76. geoparquet

    reader and writer

    v0.7.0 280 #reader-writer #geo-arrow #parquet #specification #upstream #geospatial #geometry #apache-arrow
  77. seqtable

    High-performance parallel FASTA/FASTQ sequence counter

    v0.1.1 #bioinformatics #fasta-sequence #fastq #fasta #parquet #bioinformatics-sequence
  78. parquet-key-management

    Implements the Parquet Key Management Tools API in Rust to enable integration with a Key Management Server when using Parquet modular encryption

    v0.6.0 #encryption #parquet #kms #arrow
  79. compact-thrift-runtime

    Runtime library for compact-thrift code generator

    v0.2.1 #thrift #parquet
  80. datasynth-output

    Output sinks for CSV, Parquet, JSON, and streaming formats

    v0.6.0 #streaming-json #csv #parquet #sap #synthetic-data #journal #arrow-schema #serialization #newline-delimited #storage-compression
  81. datatui

    fast, keyboard-first terminal data viewer

    v0.3.0 #column-width #csv-tsv #logging #viewer #sql #parquet #error-logging #key-bindings #lazy-evaluation #polars
  82. getmeta

    Not just gold builds anymore!

    v2026.2.10 #build #gold #amazon-s3 #parquet #hash #aws #ubuntu #macintosh
  83. cottas-rs

    working with compressed RDF files in the COTTAS format. COTTAS stores triples as a triple table in Apache Parquet. It is built on top of DuckDB and provides an HDT-like interface.

    v0.1.1 #rdf #parquet #duck-db
  84. datasetq

    A data processing tool with a jq-like syntax for structured data formats, including CSV, JSON, Parquet, Avro, and more

    v0.1.3 #csv #parquet #jq
  85. liquid-cache-common

    10x lower latency for cloud-native DataFusion

    v0.1.10 #data-fusion #object-store #cache #liquid-cache #rpc #cloud-native #parquet #pushdown #10x #shared-data-structures
  86. pq-utils

    reading parquet files

    v0.5.0 390 #parquet #display #schema #json #csv #file-content #cat
  87. parquet_aramid

    Query engine using Parquet tables as a Key-Value store

    v0.2.0 #query-engine #key-value-store #parquet
  88. parquet-format-safe

    Safe Parquet and Thrift reader and writer (sync and async)

    v0.2.4 327K #thrift #parquet
  89. shaha

    Hash database builder and reverse lookup tool

    v0.2.0 #parquet #cryptography #hash #security
  90. parquet-format

    Apache Parquet Format - thrift definition and generated Rust file

    v4.0.0 10K #parquet #hadoop
  91. shema

    Schema generation macros

    v0.1.2 #firehose #parquet #aws #aws-glue #schema
  92. tesser-ledger

    Ledger primitives for Tesser accounting

    v0.9.3 #ledger #accounting #ledger-repository #tesser #primitive #parquet #pnl #canonical #realized #journal
  93. csvs_convert

    Some Datapackage Conversion

    v0.12.1 #csv #convert #date-time #datapackage #csvs #data-format-conversion #statistics #postgresql #xlsx #parquet
  94. liquid-cache-parquet

    10x lower latency for cloud-native DataFusion

    v0.1.10 #data-fusion #parquet #cache #cloud-native #liquid #10x #pushdown
  95. rangebar-io

    I/O operations for rangebar data (CSV, Parquet, Arrow)

    v6.1.1 #parquet #finance #finance-trading #trading
  96. parquet_opendal

    parquet Integration for Apache OpenDAL

    v0.7.0 410 #opendal #parquet #opendal-integration #storage
  97. ecad-processor

    High-performance multi-metric weather data processor for European Climate Assessment & Dataset (ECA&D) archives with Parquet output

    v2.0.1 #data-processing #parquet #climate #weather #ecad
  98. tonbo-predicate

    Predicate evaluation for Tonbo embedded database

    v0.1.0 #serverless #tonbo #embedded-database #edge #predicate #parquet #operand #schema-query #happen #data-fusion
  99. parquet-format-async-temp

    Temporary crate containing thrift library + parquet definitions compiled to support read+write async

    v0.3.1 4.8K #thrift #parquet #hadoop
  100. tenrso-ooc

    Out-of-core processing with Arrow/Parquet for TenRSo

    v0.1.0-alpha.2 #arrow #tensor #chunked #mmap #parquet #ram #ipc #out-of-core #memory-mapping #in-memory
  101. sbbf-rs

    Split block bloom filter implementation

    v0.2.8 9.8K #bloom-filter #split #block #parquet #system
  102. wardenclyffe

    A tiny Rust query engine that supports SQL-like filters, CSV scanning, projections, and a custom DSL powered by Pest

    v0.1.1 #csv-parser #csv #query-engine #query-dsl #parquet #parser-dsl
  103. parquetry-gen

    Parquet code generator

    v0.17.0 1.0K #parquet #codegen #parquetry
  104. tpctools

    generating and converting TPC-H and TPC-DS data sets

    v0.7.0 240 #dataset #generator #tpc-h #convert #apache-arrow #apache-spark #data-generator #parquet #command-line-tool #data-fusion
  105. otlp2parquet-proto

    OTLP protobuf definitions for otlp2parquet

    v0.6.0 #metrics #logging #aws-lambda #protobuf #open-telemetry #parquet #otlp #apache-iceberg #duck-db
  106. liquid-cache-server

    10x lower latency for cloud-native DataFusion

    v0.1.10 #data-fusion #cache #arrow #server #flight #cloud-native #object-store #10x #parquet #apache-arrow
  107. deepbiop-fq

    Deep Learning Preprocessing Library for Fastq Format

    v0.1.16 #deep-learning #parquet #fastq
  108. liquid-cache-client

    10x lower latency for cloud-native DataFusion

    v0.1.10 #data-fusion #liquid-cache #object-store #flight #cloud-native #parquet #10x #pushdown #apache-arrow
  109. bdt

    viewing, querying, converting, and comparing files in popular data formats (CSV, Parquet, JSON, Avro)

    v0.18.0 170 #csv #parquet #avro #convert #json
  110. deepbiop-fa

    Deep Learning Preprocessing Library for Fastq Format

    v0.1.16 #bioinformatics #deep-learning #fasta #parquet
  111. otlp2parquet-writer

    Parquet writer for otlp2parquet

    v0.7.1 #metrics #logging #parquet #writer #aws-lambda #otlp #apache-iceberg #open-telemetry #duck-db
  112. otlp2parquet-handlers

    Stream OpenTelemetry logs, metrics, and traces to Parquet files

    v0.7.1 #metrics #logging #open-telemetry #amazon-s3 #parquet #duck-db
  113. parquetry-sort

    Runtime sorting library for Parquet code generator

    v0.17.0 750 #parquet #codegen #run-time
  114. prestige-cli

    CLI interface for manually fetching and reading Prestige-parquet files

    v0.2.6 #command-line-interface #parquet #deduplicate #compact #fetching #compression
  115. polars-parquet-format

    Safe Parquet and Thrift reader and writer (sync and async)

    v0.1.0 278K #thrift #parquet
  116. slim-runner

    Run SLiM simulation grid runs in parallel

    v0.1.2 120 #simulation #s-li-m #slim #run #parameters #parquet #csv
  117. sqlite2parquet

    Generate parquet files from sqlite databases

    v0.10.2 2.1K #sqlite #parquet #generate #database #archive
  118. metriken-query

    PromQL query engine and TSDB for metriken parquet files

    v0.1.2 #promql #query-engine #metrics #parquet #metriken #tsdb #time-series #distributed
  119. tbl-core

    reading and modifying tabular files

    v0.1.1 #parquet #tabular #schema #column #modify #army #amazon-s3 #write-operations #swiss-army
  120. otlp2parquet-common

    Stream OpenTelemetry logs, metrics, and traces to Parquet files

    v0.7.1 #metrics #logging #open-telemetry #parquet #stream
  121. liquid-cache-local

    10x lower latency for cloud-native DataFusion

    v0.1.10 #data-fusion #cache-local #liquid-cache #parquet #temporary-files #in-process #session-config #cloud-native #temp-dir #policies
  122. datagen

    An easy to use tool to generate fake data in bulk and export it as Avro, Parquet or directly into your database as tables

    v0.1.4 #fake-data #csv #database-table #parquet #fake-data-generator #json-schema #avro #data-export #avro-schema #export-json
  123. pgparquet

    High-performance CLI tool for streaming Parquet files from Google Cloud Storage into PostgreSQL

    v0.1.0 #gcs #postgresql #parquet #streaming
  124. daft-parquet

    Parquet processing for the Daft project

    v0.1.0 #parquet #daft #processing-for-daft
  125. evolution-builder

    Builder implementations for evolution

    v1.3.0 #fixed-length #builder #data-science #file-format #evolution #iceberg #evolve #parquet #indra-db #data-analytics
  126. Try searching with DuckDuckGo.

  127. parqeye

    Parquet viewer for the command line

    v0.0.2 #parquet #schema #tui #visualizer #parq
  128. warc-parquet

    converting WARC to Parquet

    v0.6.1 #parquet #warc #arrow
  129. prql-query

    pq: query and transform data with PRQL

    v0.0.15 #data-fusion #pq #prql #duck-db #csv #query-language #postgresql #parquet #data-query #transform-data
  130. parquet2lance

    Convert parquet files to lance

    v0.5.0 1.4K #lance #parquet #object-store
  131. tbl-cli

    tbl is a tool for reading and editing tabular data files

    v0.1.1 #parquet #schema #tabular-data #editing #column #swiss-army #json-output #ls #amazon-s3 #write-operations
  132. query-fuse

    An interactive SQL query engine for local columnar files (Parquet, Arrow, Feather)

    v0.1.0 #data-fusion #parquet #arrow #sql #cli
  133. dr

    Command-line data file processing in Rust

    v0.7.0 #csv #parquet #data-processing #command-line-data #sql #file-processing
  134. motif-scanner

    Command line tool for scanning DNA sequences for transcription factor binding sites

    v0.1.1 180 #transcription-factor #dna-sequence #csv #site #bindings #parallel-processing #dna-sequence-analysis #parquet #pwm #batch-processing
  135. glaredb_ext_parquet

    Apache Parquet extension for GlareDB. Originally forked from github.com/apache/arrow-rs

    v25.6.3 #parquet #sql #database #analytics-database #analytics
  136. azof

    Lakehouse format with event time travel

    v0.2.1 #time-travel #parquet #data-lake #lakehouse #event-time
  137. amadeus-parquet

    An Apache Parquet implementation in Rust

    v0.4.3 #parquet #amadeus #arrow #hadoop #data
  138. evolution-writer

    Output target writers for evolution

    v1.3.0 #fixed-length #target #writer #data-science #evolution #iceberg #evolve #parquet #schema-file #data-analytics
  139. tabler

    📊 Tabler: A lightweight TUI tool to view, query, and navigate CSV, TSV, and Parquet data files

    v0.1.1 #csv #parquet #data-query #tsv #csv-tsv #query-data
  140. xpq

    command line tool for analyzing parquet files

    v0.2.1 #command-line-tool #parquet #cli
  141. gosh-adaptor

    Adaptor for chemical model

    v0.4.1 #computational-chemistry #parquet #reader #chemical-model #vasp #gulp #chemistry-model #file-data
  142. evolution-converter

    Converter implementations for evolution

    v1.3.0 #fixed-length #converter #evolution #iceberg #file-format #parquet #evolve #indra-db #delta-lake #data-analytics
  143. range-reader

    Converts low-level APIs to read ranges of bytes to Read + Seek

    v0.2.0 #byte-range #read-seek #async-read #low-level-api #async-seek #apache-arrow #amazon-s3 #storage-api #parquet
  144. innofile

    InnoFile

    v0.1.0 #file-format #filesystem #write-file #convert #tokio #parquet #orc #tokio-file
  145. depyler-knowledge

    Sovereign Type Database for Python library type extraction

    v3.23.0 #python #stub #parquet #pyi
  146. rs-ints2parquet

    Converts the integers to a parquet

    v0.1.0 #integer #parquet #raw
  147. csv2pq

    CSV to Apache parquet converter

    v0.1.1 #parquet #csv #converter #apache-arrow #rm #int32 #int64 #float32
  148. athort

    Assortment of Parquet and other items

    v0.2.0 #parquet #assortment #item
  149. bazof

    Lakehouse format with event time travel

    v0.1.0 #time-travel #data-lake #lakehouse #parquet #event-time
  150. datahobbit

    that generates CSV or Parquet files with synthetic data based on a provided JSON schema

    v1.0.0 #json-schema #data-generator #parquet #csv #synthetic-data #json-output #json-generator #data-generation #output-file #filesize
  151. csv_generator

    that generates CSV or Parquet files with synthetic data based on a provided JSON schema

    v1.0.0 #json-schema #parquet #csv #generator #synthetic-data #json-output #data-generator #data-generation #filesize #output-file
  152. xpq2

    command line tool for analyzing parquet files

    v0.2.2 #parquet #cli
  153. rs-csv2parquet

    Converts a csv file to a parquet

    v0.1.0 #parquet #csv #arrow
  154. jlcpcb-to-parquet

    convert JLCPCB Parts Library to Parquet

    v0.4.2 320 #parquet #jlcpcb #part #convert
  155. otlp2parquet-core

    Core OTLP to Arrow/Parquet conversion logic

    v0.6.0 #logging #metrics #otlp #open-telemetry #parquet #duck-db #docker #amazon-s3 #azure #cloudflare
  156. rs-parquets2count

    Computes the total number of rows of the parquet files

    v0.1.0 #parquet #arrow #count
  157. pack-it

    Packer for Parquet tables

    v0.2.2 #packer #table #parquet #data-fusion
  158. azof-cli

    CLI utility for azof lakehouse format

    v0.2.1 #data-fusion #lakehouse #event-time #time-travel #table #parquet #storage-formats
  159. rs-splited2parquet

    Converts csv-like lines to rows and saves as a parquet

    v0.1.0 #parquet #data-conversion #csv
  160. hypersync-client

    client library for hypersync

    v1.0.0 10K #blockchain #hyper-sync #arrow-format #api-token #retries #parquet #data-streaming #query-builder #envio
  161. rs-zips2meta2parquet

    Converts the metadata values of the zip files to a parquet

    v0.1.0 #parquet #data-processing #arrow
  162. glaciers

    decode raw EVM logs into decoded events

    v2.0.1 1.2K #event-logging #dataframe #contract-address #ethereum #algorithm #folder-path #parquet #function-signatures #config-file #user-defined
  163. parquet-py

    command-line interface & Python API for parquet

    v0.2.1-beta #parquet #csv #convert-json #list #row #command-line-interface
  164. dendritic-datasets

    Prebuilt datasets that can be imported for ML model training

    v1.5.0 #dataset #dendritic #student #iris #training #parquet #ml-model #purchase #cancer #airfoil
  165. parquet-flamegraph

    program to generate flamegraph and investigate parquet storage

    v0.1.1 #parquet #flame-graph #column #generate #investigate
  166. ethl-cli

    Tools for capturing, processing, archiving, and replaying Ethereum events

    v0.1.3 #ethereum #parquet #arrow
  167. parquet-lru

    Implement LRU cache reader for parquet::arrow::async_reader::AsyncFileReader

    v0.3.0 190 #lru-cache #file-reader #parquet #arrow #async-file