- San Francisco, CA
Highlights
- Pro
Starred repositories
Python library to access and analyze SEC Edgar filings, XBRL financial statements, 10-K, 10-Q, and 8-K reports
A powerful and modular toolkit for record linkage and duplicate detection in Python
OpenRefine is a free, open source power tool for working with messy data and improving it
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
Doing dirty (but extremely useful) things with equals.
🐍 Quick reference guide to common patterns & functions in PySpark.
🍃 Automate your personal finances – for free, with no ads, and no data collection.
A spaCy wrapper for DBpedia Spotlight
A real-time transcription project using React and socketio
Dive into this repository, a comprehensive resource covering Data Structures, Algorithms, 450 DSA by Love Babbar, Striver DSA sheet, Apna College DSA Sheet, and FAANG Questions! 🚀 That's not all! W…
Turn (almost) any Python command line program into a full GUI application with one line
🐸 Identify anything. pyWhat easily lets you identify emails, IP addresses, and more. Feed it a .pcap file or some text and it'll tell you what it is! 🧙♀️
Specify a dynamic set of questions to ask a user and get their answers.
⚡️ Lightning-fast backtesting engine to find your trading edge
t-Digest data structure in Python. Useful for percentiles and quantiles, including distributed enviroments like PySpark
🗂 The perfect Front-End Checklist for modern websites and meticulous developers
Researches for Natural Language Processing for Financial Domain
This is a database of 300.000+ symbols containing Equities, ETFs, Funds, Indices, Currencies, Cryptocurrencies and Money Markets.
Collection of notebooks about quantitative finance, with interactive python code.
A unified framework for machine learning with time series
Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbations.
Mesa is an open-source Python library for agent-based modeling, ideal for simulating complex systems and exploring emergent behaviors.
A collection of helpers for Jupyter/IPython
Python module for interacting with nested dicts as a single level dict with delimited keys.
Async PRAW, an abbreviation for "Asynchronous Python Reddit API Wrapper", is a python package that allows for simple access to Reddit's API.
A Peer-to-peer Platform for Secure, Privacy-preserving, Decentralized Data Science