Stars
Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
A flexible, intuitive and fast forecasting library
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
Export Azure DevOps Wiki to PDF
SCD Merge Wizard is an application which will help you generate T-SQL statement for merging data from two tables into one table in minutes. At the end, generated T-SQL statement can be used to repl…
A web interface to create custom vector-based visualizations on top of RAWGraphs core
ffn - a financial function library for Python
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Collaborate & label any type of data, images, text, or documents, in an easy web interface or desktop app.
Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)
Top2Vec learns jointly embedded topic, document and word vectors.
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …
A cross tenant metadata driven processing framework for Azure Data Factory and Azure Synapse Analytics achieved by coupling orchestration pipelines with a SQL database and a set of Azure Functions.
Always know what to expect from your data.
A Microsoft Power BI Data Connector or Power Query Connector for the Power BI REST API
Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts
Natural Language Processing Best Practices & Examples
Using Extractive summarization to summarize medium posts
DataSpider is a meta-data crawler closely linked to the Kosh meta-data model. It is an application that can crawl across many types of enterprise data sources and populate the Kosh metadata reposit…
This repository is intended to contain educational and supplemental information for the BotWorks/Kosh/DataSpider ecosystem repositories.
BotWorks is a is an framework developed by Modak Analytics for GlaxoSmithKline to simplify data movement activities (ingestion and curation) in their Hadoop / Big Data environment. GlaxoSmithKline …
Deezer source separation library including pretrained models.
Streamlit app demonstrating an image browser for the Udacity self-driving-car dataset with realtime object detection using YOLO.
The aim of the toolkit is to provide a highly flexible, no-code way of implementing GAN models.
A distributed event bus that implements a RESTful API abstraction on top of Kafka-like queues
The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model Fine-tuning" https://arxiv.org/abs/1909.04761