- San Francisco, CA
- https://AboutDataScience.wordpress.com
Stars
Compilation of resources for aspiring data scientists
Code for AMIA CRI 2016 paper "Learning Low-Dimensional Representations of Medical Concepts"
System for Medical Concept Extraction and Linking
Extract CUIs from MIMIC notes and represent them using cui2vec
A probabilistic programming language in TensorFlow. Deep generative models, variational inference.
Python code for part 2 of the book Causal Inference: What If, by Miguel HernΓ‘n and James Robins
Synthetic Patient Population Simulator
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
π₯ Fast State-of-the-Art Tokenizers optimized for Research and Production
Scrape job websites into a single spreadsheet with no duplicates.
NYC WiMLDS scikit-learn open source sprint (Aug 24, 2019)
Python suite to construct benchmark machine learning datasets from the MIMIC-III π clinical database.
Generative adversarial network for generating electronic health records.
Semantic segmentation on aerial and satellite imagery. Extracts features such as: buildings, parking lots, roads, water, clouds
Simple PyTorch Tutorials Zero to ALL!
A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks
Introduction to NLP with PyTorch Workshop Project
π‘ Looking for inspiration for your next open source project? Or perhaps you've got a brilliant idea you can't wait to share with others? Open Source Ideas is a community built specifically for this! π
Open or Easy Access Clinical Data Sources for Biomedical Research
MIMIC Code Repository: Code shared by the research community for the MIMIC family of databases
Cluster Similar Customers of a Retailer using Machine Learning.
A curated list of awesome network analysis resources.
An introduction to network analysis and applied graph theory using Python and NetworkX