Skip to content
View geoninja's full-sized avatar

Block or report geoninja

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Compilation of resources for aspiring data scientists

Python 2,170 649 Updated Jun 14, 2024

Code for AMIA CRI 2016 paper "Learning Low-Dimensional Representations of Medical Concepts"

Python 250 77 Updated Aug 24, 2020

System for Medical Concept Extraction and Linking

Python 444 101 Updated Aug 12, 2024

Extract CUIs from MIMIC notes and represent them using cui2vec

Jupyter Notebook 8 Updated Jan 14, 2019
Jupyter Notebook 87 21 Updated Apr 3, 2020

A probabilistic programming language in TensorFlow. Deep generative models, variational inference.

Jupyter Notebook 4,841 744 Updated Mar 18, 2024

Python code for part 2 of the book Causal Inference: What If, by Miguel HernΓ‘n and James Robins

Jupyter Notebook 1,350 411 Updated Jan 8, 2022
JavaScript 1,675 146 Updated Jan 27, 2020

Synthetic Patient Population Simulator

Java 3,186 888 Updated May 28, 2026

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Python 22,960 3,598 Updated Jul 28, 2024

πŸ’₯ Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 10,814 1,124 Updated Jun 12, 2026

Scrape job websites into a single spreadsheet with no duplicates.

Python 2,152 261 Updated Dec 10, 2025

NYC WiMLDS scikit-learn open source sprint (Aug 24, 2019)

28 8 Updated Sep 3, 2019

MGH research

Jupyter Notebook 3 3 Updated Dec 6, 2018

Python suite to construct benchmark machine learning datasets from the MIMIC-III πŸ’Š clinical database.

Python 885 344 Updated Apr 16, 2023

MiME Repository

Python 105 28 Updated Nov 14, 2019

Generative adversarial network for generating electronic health records.

Python 286 96 Updated Aug 19, 2019

Code for the emrQA question answering dataset

Python 153 34 Updated Feb 9, 2022

Semantic segmentation on aerial and satellite imagery. Extracts features such as: buildings, parking lots, roads, water, clouds

Python 2,056 387 Updated Aug 27, 2020

Simple PyTorch Tutorials Zero to ALL!

Python 3,973 1,196 Updated Mar 23, 2024

A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks

Python 2,934 441 Updated Nov 7, 2022

Introduction to NLP with PyTorch Workshop Project

Jupyter Notebook 98 42 Updated Jul 25, 2024

πŸ’‘ Looking for inspiration for your next open source project? Or perhaps you've got a brilliant idea you can't wait to share with others? Open Source Ideas is a community built specifically for this! πŸ‘‹

6,788 229 Updated Sep 24, 2025

Open or Easy Access Clinical Data Sources for Biomedical Research

185 35 Updated Jan 5, 2017

MIMIC Code Repository: Code shared by the research community for the MIMIC family of databases

Jupyter Notebook 3,246 1,709 Updated Apr 14, 2026

Cluster Similar Customers of a Retailer using Machine Learning.

Jupyter Notebook 6 5 Updated Aug 6, 2017

A curated list of awesome network analysis resources.

R 4,061 631 Updated Apr 17, 2026

An introduction to network analysis and applied graph theory using Python and NetworkX

Jupyter Notebook 1,107 398 Updated Jun 10, 2026
Next