Skip to content
View thegargiulian's full-sized avatar
🍵
🍵

Highlights

  • Pro

Block or report thegargiulian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
19 results for source starred repositories written in Python
Clear filter

Robust Speech Recognition via Large-Scale Weak Supervision

Python 90,462 11,328 Updated Sep 8, 2025

🦉 Data Versioning and ML Experiments

Python 15,064 1,254 Updated Nov 4, 2025

Open source annotation tool for machine learning practitioners.

Python 10,375 1,820 Updated Jun 16, 2025

A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning

Python 7,054 1,320 Updated Aug 26, 2025

Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts

Python 7,028 412 Updated Oct 26, 2025

A system for quickly generating training data with weak supervision

Python 5,921 858 Updated May 2, 2024

A 21 century R console

Python 2,213 85 Updated May 15, 2025

Neovim plugin for integration with Zotero

Python 233 20 Updated Nov 4, 2025

Learned string similarity for entity names using optimal transport.

Python 35 3 Updated Nov 17, 2020

Metaphone is a phonetic algorithm, an algorithm published in 1990 for indexing words by their English pronunciation. It fundamentally improves on the Soundex algorithm by using information about va…

Python 35 25 Updated Sep 22, 2013

An End-to-End Evaluation Framework for Entity Resolution Systems

Python 32 10 Updated Dec 3, 2023

Efficient String Comparison Functions and Fuzzy String Matching

Python 20 2 Updated Sep 21, 2025
Python 16 11 Updated Jan 31, 2025

Materials for workshop on "Using bibliometric data in demographic research". A report here: https://iussp.org/en/using-bibliometric-data-demographic-research-0

Python 12 Updated Jul 13, 2024

Datos de feminicidios en Latinoamerica

Python 10 4 Updated Jan 26, 2019

Deduplicate data using fuzzy and deterministic matching rules.

Python 8 Updated Mar 15, 2023

Principled Data Processing

Python 2 Updated Aug 2, 2025

code that manages processing MSE scripts through Amazon SQS

Python 1 1 Updated Mar 28, 2022