Skip to content
@capitalone

Capital One

We’re an open source-first organization — actively using, contributing to and managing open source software projects.

Pinned Loading

  1. DataProfiler DataProfiler Public

    What's in your data? Extract schema, statistics and entities from datasets

    Python 1.5k 178

  2. datacompy datacompy Public

    Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!

    Python 604 147

  3. locopy locopy Public

    locopy: Loading/Unloading to Redshift and Snowflake using Python.

    Python 113 50

  4. rubicon-ml rubicon-ml Public

    Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!

    Jupyter Notebook 138 36

  5. dataCompareR dataCompareR Public archive

    dataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.

    R 75 26

  6. edgetest edgetest Public

    edgetest is a tox-inspired python library that will loop through your project's dependencies, and check if your project is compatible with the latest version of each dependency

    Python 25 8

Repositories

Showing 10 of 48 repositories
  • federated-model-aggregation Public archive

    The Federated Model Aggregation (FMA) Service is a collection of installable python components that make up the generic workflow/infrastructure needed for federated learning.

    capitalone/federated-model-aggregation’s past year of commit activity
    Python 32 Apache-2.0 11 1 0 Updated Oct 6, 2025
  • dataCompareR Public archive

    dataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.

    capitalone/dataCompareR’s past year of commit activity
    R 75 26 0 0 Updated Oct 6, 2025
  • capitalone/c1s-slingshot-sdk-py’s past year of commit activity
    Python 1 Apache-2.0 2 0 0 Updated Oct 3, 2025
  • edgetest Public

    edgetest is a tox-inspired python library that will loop through your project's dependencies, and check if your project is compatible with the latest version of each dependency

    capitalone/edgetest’s past year of commit activity
    Python 25 Apache-2.0 8 4 (1 issue needs help) 0 Updated Oct 3, 2025
  • datacompy Public

    Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!

    capitalone/datacompy’s past year of commit activity
    Python 604 Apache-2.0 147 12 (1 issue needs help) 3 Updated Oct 3, 2025
  • Stratum-Observability Public

    A no-dependency library to send standardized events to observability and data platforms. Based on plugins, Stratum enables the cataloging of app-specific logic to define, validate, and publish events to your entire stack.

    capitalone/Stratum-Observability’s past year of commit activity
    TypeScript 24 Apache-2.0 9 6 1 Updated Sep 30, 2025
  • rubicon-ml Public

    Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!

    capitalone/rubicon-ml’s past year of commit activity
    Jupyter Notebook 138 Apache-2.0 36 9 2 Updated Sep 30, 2025
  • DataProfiler Public

    What's in your data? Extract schema, statistics and entities from datasets

    capitalone/DataProfiler’s past year of commit activity
    Python 1,521 Apache-2.0 178 67 (8 issues need help) 8 Updated Sep 26, 2025
  • capitalone/whitesource-config’s past year of commit activity
    0 1 0 0 Updated Sep 25, 2025
  • synthetic-data Public

    Generating complex, nonlinear datasets appropriate for use with deep learning/black box models which 'need' nonlinearity


    capitalone/synthetic-data’s past year of commit activity
    Python 44 Apache-2.0 29 3 3 Updated Sep 8, 2025