Lists (1)
Sort Name ascending (A-Z)
Stars
A production-ready template to kickstart your Generative AI projects with structure and scalability in mind.
This is a Phi Family of SLMs book for getting started with Phi Models. Phi a family of open sourced AI models developed by Microsoft. Phi models are the most capable and cost-effective small langua…
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search
Python tool for converting files and office documents to Markdown.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Llama-style transformer in PyTorch with multi-node / multi-GPU training. Includes pretraining, fine-tuning, DPO, LoRA, and knowledge distillation. Scripts for dataset mixing and training from scratch.
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
C++ library for converting text to phonemes for Piper
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…
A replica of the 2048 game, but in a contemporary format
📚 Biblioteca de livros essenciais da área da programação. (Confira o meu novo projeto `SendScriptWhatsapp`)
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
Converts profiling output to a dot graph.
Start building and deploying Python packages and Docker images for MLOps tasks.
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
A python true casing utility that restores case information for texts
Repositório com o 4º Tech Challenge da FIAP PosTech
Guia/Tutorial rápido do WSL2 + Docker
Machine Learning for Imbalanced Data, published by Packt
Friends don't let friends make certain types of data visualization - What are they and why are they bad.
Best Practices on Recommendation Systems