Skip to content
View sinhrks's full-sized avatar

Organizations

@pydata @stan-ja @dask @pandas-ml @pandas-dev

Block or report sinhrks

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Reproducibility for Humans: A lightweight tool to perform reproducible machine learning experiment.

Python 24 5 Updated Apr 24, 2019

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 16,041 3,873 Updated Oct 9, 2025

N-D labeled arrays and datasets in Python

Python 3,991 1,180 Updated Oct 8, 2025

Design documents and code for the pandas 2.0 effort.

Python 304 39 Updated Nov 9, 2018

It'll detect your anomalies! Part of the Kale stack.

Python 2,137 333 Updated Feb 22, 2016

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

Python 46,780 19,097 Updated Oct 9, 2025

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 42,729 15,755 Updated Oct 9, 2025

Apache Superset is a Data Visualization and Data Exploration Platform

TypeScript 68,417 15,949 Updated Oct 9, 2025

Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow

JavaScript 2,751 165 Updated Nov 19, 2021

Quickly and accurately render even the largest data.

Python 3,460 377 Updated Oct 9, 2025

Gaussian processes framework in python

Python 2,118 569 Updated Jun 19, 2025

pandas japanese extension

Python 81 9 Updated Jul 23, 2020

Stan models for state space time series

R 146 42 Updated Jul 3, 2017

pandas, scikit-learn, xgboost and seaborn integration

Python 319 78 Updated Aug 14, 2020
Python 10 4 Updated Mar 5, 2018

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

C++ 27,462 8,814 Updated Oct 9, 2025

Experimental multicore fork of Python 3

Python 586 25 Updated Dec 10, 2024

Airspeed Velocity: A simple Python benchmarking tool with web-based reporting

Python 951 195 Updated Oct 9, 2025

Parallel computing with task scheduling

Python 13,522 1,802 Updated Oct 9, 2025

the portable Python dataframe library

Python 6,143 674 Updated Oct 9, 2025

a web application framework for python

Python 830 156 Updated Mar 12, 2022

A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning al…

R 1,889 332 Updated Sep 16, 2022

A flexible framework of neural networks for deep learning

Python 5,905 1,360 Updated Aug 28, 2023

A Theano framework for building and training neural networks

Python 1,154 347 Updated Feb 19, 2019

R interface to Bokeh http://hafen.github.io/rbokeh/

R 311 64 Updated Nov 1, 2023

Data Migration for the Blaze Project

Python 1,003 131 Updated Jul 15, 2022

Recipes for using Python's pandas library

Jupyter Notebook 6,942 2,359 Updated Oct 24, 2024

NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.

Python 5,441 1,326 Updated Dec 22, 2020

IPython kernel for Torch with visualization and plotting

Jupyter Notebook 1,097 156 Updated Nov 10, 2017

Define fortify and autoplot functions to allow ggplot2 to handle some popular R packages.

R 535 68 Updated Jul 27, 2025
Next