Stars
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Open source annotation tool for machine learning practitioners.
Python client for Baidu Yun (Personal Cloud Storage) 百度云/百度网盘Python客户端
Neural network 3D visualization framework, build interactive and intuitive model in browsers, support pre-trained deep learning models from TensorFlow, Keras, TensorFlow.js
A PyTorch native platform for training generative AI models
⚙️ Convert HTML to Markdown. Even works with entire websites and can be extended through rules.
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
Data set of top third party web domains with rich metadata about them
Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"
Showcasing what in-app browsers do under the hood
XMap is a fast network scanner designed for performing Internet-wide IPv6 & IPv4 network research scanning.
Repository for the CookieBlock browser extension, which automatically enforces user privacy policy on browser cookies.
Fast computation of Krippendorff's alpha agreement measure in Python.
A tokenizer and sentence splitter for German and English web and social media texts.
Locality Sensitive Hashing for Apache Spark
The implementation that infers the temporal latent spaces for a sequence of dynamic graph snapshots
This repository holds the code for PolicyLint and PoliCheck, which identifies internal contradictions within privacy policies and analyzes data flow to privacy policy consistency.
Programmable UI-Automation Framework for Dynamic App Analysis
Code for the ICCV 2015 paper "Neural Activation Constellations: Unsupervised Part Model Discovery with Convolutional Networks."
Privacy Bot gathers, persists and analyzes privacy policies. #Mozilla Global Sprint Project