categorical-variables

Here are 43 public repositories matching this topic...

AutoViML / featurewiz

Use advanced feature engineering strategies and select best features from your data set with a single line of code. Created by Ram Seshadri. Collaborators welcome.

feature-selection feature-extraction xgboost feature-engineering autoencoders categorical-variables mrmr rfe featuretools rfecv feature-encoding best-encoders feature-engg

Updated Feb 19, 2025
Python

FixedEffects / FixedEffectModels.jl

Star

Fast Estimation of Linear Models with IV and High Dimensional Categorical Variables

regression economics iv panel-data fixed-effects clustered-standard-errors instrumental-variables categorical-variables

Updated Mar 30, 2026
Julia

WinVector / vtreat

Star

vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under choice of GPL-2 or GPL-3 license.

r machine-learning-algorithms nested-models prepare-data categorical-variables

Updated Jan 9, 2025
HTML

imbi-heidelberg / DescrTab2

Star

This package provides functions to create descriptive statistics tables for continuous and categorical variables.

cran r statistics statistical-tests descriptive-statistics categorical-variables continuous-variable p-values

Updated Jan 30, 2024
R

CleverInsight / sparx

Star

Data Munging, Data Wrangling and Data Preparation Simplified

data-wrangling data-preprocessing data-preparation data-munging categorical-variables

Updated Jul 19, 2017
Python

nphdang / Bandit-BO

Star

Bayesian Optimization for Categorical and Continuous Inputs

machine-learning optimization thompson-sampling hyperparameter-optimization hyperopt gaussian-processes bayesian-optimization multi-armed-bandits hyperparameter-tuning automl automated-machine-learning smac categorical-variables continuous-variable acquisition-functions gpyopt batch-bayesian-optimization

Updated Jul 20, 2020
Python

bbopt / HyperNOMAD

Star

A library for the hyperparameter optimization of deep neural networks

python deep-neural-networks optimization pytorch hyperparameters hyperparameter-optimization nomad hyperparameter-tuning neural-architecture-search categorical-variables blackbox-optimization

Updated Jan 12, 2022
C++

chen0040 / java-statistical-inference

Star

Opinionated statistical inference engine with fluent api to make it easier for conducting statistical inference with little or no knowledge of statistical inference principles involved

independence statistical-analysis confidence-intervals hypothesis-testing kie categorical-variables independence-test java-statistical-inference

Updated May 25, 2017
Java

atecon / CategoryEncoders

Star

A set of gretl transformers for encoding categorical variables into numeric with different techniques

machine-learning statistics encoder econometrics categorical-variables gretl hansl

Updated Apr 3, 2021
Makefile

GurnaikLall / Kaggle-Intermediate-Machine-Learning

Star

How to deal with Missing Values, Categorical Variables, Pipelines, Cross-Validation, XGBoost, Data Leakage

python machine-learning pipelines kaggle xgboost categorical-variables data-leakage

Updated Jan 13, 2023
Jupyter Notebook

vaitybharati / P21.-Hypothesis-Testing-Chi2-Test-Athletes-and-Smokers-

Star

Hypothesis-Testing-Chi2-Test-Athletes-and-Smokers. Assume Null Hypothesis as Ho: Independence of categorical variables (Athlete and Smoking not related). Thus Alternate Hypothesis as Ha: Dependence of categorical variables (Athlete and Smoking is somewhat/significantly related). As (p_value = 0.00038) < (α = 0.05); Reject Null Hypothesis i.e. De…

python numpy p-value stats scipy independence-tests hypothesis-testing chi-square-test categorical-variables significance-testing null-hypothesis chi-square-statistics alternate-hypothesis chi2-contingency

Updated May 25, 2021
Jupyter Notebook

Ab2207 / Customer-Churn

Star

A Machine Learning project to predict Customer Churn including all stages of a project life cycle from data procurement to deployment.

html machine-learning correlation eda feature-selection flask-application outliers logistic-regression postgresql-database data-preprocessing feature-engineering google-cloud-platform smote scikitlearn-machine-learning categorical-variables

Updated Jun 30, 2021
Jupyter Notebook

MavericksDS / pycorr

Star

A simple library to calculate correlation between variables. Currently provides correlation between nominal variables.

correlation categorical-variables cramer pypi-package

Updated Mar 20, 2024
Python

JSzitas / categoryEncodings

Star

Multiple methods to (quickly) encode factor variables, using data.table

r r-package feature-engineering categorical-variables feature-encoding

Updated Sep 25, 2021
R

zjg540066169 / AuxSurvey

Star

Source Code for Paper: Williams, S.Z., Zou, J., Liu, Y., Si, Y., Galea, S. and Chen, Q. (2024), Improving Survey Inference Using Administrative Records Without Releasing Individual-Level Continuous Data. Statistics in Medicine, 43: 5803-5813. https://doi.org/10.1002/sim.10270.

survey-analysis categorical-variables auxilary-variables

Updated Dec 20, 2024
R

macarenasev / Exploring-Correlation-Students-Performance-and-PPS-package

Star

This is a Kaggle task inspired notebook: exploring correlation + bonus trying ppscore package

correlation pps categorical-variables label-encoding

Updated May 26, 2020
Jupyter Notebook

vigneshSs-07 / Complete-AtoZ-MLProjects

Star

This Repo Contains Machine Learning Projects covering Supervised and Unsupervised ML algorithms. Contains solutions of various hackathon solutions (kaggle, AV , ineuron)

supervised-learning retail multiclass-classification categorical-variables

Updated Nov 2, 2020
Jupyter Notebook

nglaz0v / approachingalmost

Star

📖 Approaching (Almost) Any Machine Learning Problem

docker machine-learning text-classification cross-validation feature-selection hyperparameter-optimization image-classification image-segmentation feature-engineering evaluation-metrics stacking categorical-variables ensembling

Updated Oct 28, 2024
Jupyter Notebook

abhmalik / categorical-feature-importances-without-one-hot-encoding-dummies

Star

Feature Importance of categorical variables by converting them into dummy variables (One-hot-encoding) can skewed or hard to interpret results. Here I present a method to get around this problem using H2O.

h2oai categorical-variables feature-importance one-hot-encode categorical-features

Updated Jun 10, 2019
Jupyter Notebook

bukanpeneliti / group

Star

Stata command for creating categorical variables from multiple logical conditions using power-of-two indexing

stata data-analysis categorical-variables stata-package statistical-software stata-ado

Updated Jun 23, 2025
Stata

Improve this page

Add a description, image, and links to the categorical-variables topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the categorical-variables topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

categorical-variables

Here are 43 public repositories matching this topic...

AutoViML / featurewiz

FixedEffects / FixedEffectModels.jl

WinVector / vtreat

imbi-heidelberg / DescrTab2

CleverInsight / sparx

nphdang / Bandit-BO

bbopt / HyperNOMAD

chen0040 / java-statistical-inference

atecon / CategoryEncoders

GurnaikLall / Kaggle-Intermediate-Machine-Learning

vaitybharati / P21.-Hypothesis-Testing-Chi2-Test-Athletes-and-Smokers-

Ab2207 / Customer-Churn

MavericksDS / pycorr

JSzitas / categoryEncodings

zjg540066169 / AuxSurvey

macarenasev / Exploring-Correlation-Students-Performance-and-PPS-package

vigneshSs-07 / Complete-AtoZ-MLProjects

nglaz0v / approachingalmost

abhmalik / categorical-feature-importances-without-one-hot-encoding-dummies

bukanpeneliti / group

Improve this page

Add this topic to your repo