Build software better, together

Grulmex / UFund-Me-Qbot

AI-powered Quantitative Investment Research Platform.

machine-learning deep-learning bitcoin blockchain fintech quantitative-finance trademarks quantization funds strategies quantitative-trading pytrade qlib quant-trade trade-bot quant-trader

Updated Nov 10, 2025
HTML

ParagEkbote / quantized-containerized-models

Star

A project that demonstrates how to deploy AI models with significant improvements, within containerized environments using Cog. Ideal for reproducible, scalable and hardware-efficient inference.

flux ai cog torch quantization peft huggingface diffusers bitsandbytes unsloth torchao smollm3 pruna

Updated Nov 10, 2025
Python

savageplayerzx / HR_Policy_Query_Resolution_with_Retrieval_Augmented_Generation_RAG

Star

This repository contains an HR Policy Query Resolution system using Retrieval-Augmented Generation (RAG). It leverages a 4-bit quantized Mistral-7B-Instruct-v0.2 LLM and JP Morgan Chase’s publicly available Code of Conduct documents to generate accurate, contextually relevant responses for HR policy queries.

nlp data pipeline hr quantization rag large-language-models llm prompt-engineering retrieval-augmented-generation mistral-7b

Updated Nov 10, 2025

nunchaku-tech / nunchaku

Star

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

flux lora quantization iclr diffusion-models mlsys comfyui genai iclr2025

Updated Nov 10, 2025
Python

pytorch / ao

Star

PyTorch native quantization and sparsity for training and inference

training sparsity cuda inference optimizer pytorch transformer offloading llama quantization mx brrr dtypes float8

Updated Nov 10, 2025
Python

sylvesterkaczmarek / phisat2-trustworthy-onboard-ai

Star

Trustworthy onboard satellite AI in PyTorch→ONNX→INT8 with calibration, telemetry, and a PhiSat-2 EO tile-filter demo.

space telemetry calibration esa satellites cubesat quantization earth-observation int8 onnx edge-ai onnxruntime quantization-efficient-network satellite-security onboard-ai phisat-2 phisat2

Updated Nov 10, 2025
Python

sivenz / hybrid-agent-framework

Star

🤖 Build AI agents that combine OpenAI's orchestration and Claude's execution for effective production solutions.

python nlp reinforcement-learning coinbase mcp quantization research-and-development autonomous-agents reranking rag vector-database ai-models hybrid-search llm deepsearch ai-memory agentic-ai-cli memory-agents

Updated Nov 10, 2025
Python

vllm-project / llm-compressor

Sponsor

Star

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

sparsity compression quantization

Updated Nov 10, 2025
Python

PDewangan / neo4j-agentframework

Star

📊 Transform documents into a smart knowledge base using Neo4j and Azure AI for efficient, intelligent searching and answer generation.

python docker machine-learning neo4j knowledge-graph graph-database cypher quantization semantic-search ai-agents bitnet rag github-container-registry hybrid-search azure-openai llm-inference enterprise-ai zero-build-time

Updated Nov 10, 2025
Python

m1ns09 / Llama

Star

🌐 Run GGUF models directly in your web browser using JavaScript and WebAssembly for a seamless and flexible AI experience.

python nlp data machine-learning openai gpt quantization agents fine-tuning multi-agents finetuning langchain instruction-tuning llama-cpp ggml llamaindex qlora deepseek

Updated Nov 10, 2025
HTML

hiyouga / LLaMA-Factory

Star

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Updated Nov 10, 2025
Python

Jaypatel2710 / whisp

Star

javascript python macos swift text-to-speech voice transformers transformer voice-recognition change speech-recognition obs quantization svc vits singing-voice-conversion openvino-intel tensorrt-llm

Updated Nov 10, 2025
Kotlin

cenZO00 / autopack

Star

🚀 Simplify running, sharing, and shipping Hugging Face models with autopack; it quantizes and exports to multiple formats effortlessly.

react java shell bash docker minecraft ios haskell spring-boot keycloak model sphinx cabal quantization autodiscover ant-design huggingface large-language-models

Updated Nov 10, 2025
Python

paswell-chiks / Optimizing-RAG-with-Hybrid-Search

Star

🔍 Optimize RAG systems by exploring Lexical, Semantic, and Hybrid Search methods for better context retrieval and improved LLM responses.

docker information-retrieval retrieval celery quantization observability bm25 lama rag fastapi huggingface hybrid-search qdrant-vector-database semantic-cache chromadb retrieval-augmented-generation reciprocal-rank-fusion sementic-search

Updated Nov 10, 2025
Jupyter Notebook

kgeon1002 / quant-fund

Star

Open-source quant finance foundation unites trading tools and protocols, funds community projects, and boosts cross-project interoperability for collaboration 🐙

machine-learning ethereum blockchain solidity quantization social-impact governance erc20 stock-trading fundamental-analysis model-deployment options-trading erc20-tokens linear-quantization train-test hugging-face generative-ai downcasting

Updated Nov 10, 2025

openvinotoolkit / nncf

Star

Neural Network Compression Framework for enhanced OpenVINO™ inference

nlp sparsity compression deep-learning tensorflow transformers pytorch classification pruning object-detection quantization semantic-segmentation bert onnx openvino mixed-precision-training quantization-aware-training llm genai

Updated Nov 10, 2025
Python

ambv231 / tinyllama-coreml-ios18-quantization

Star

Quantize TinyLlama-1.1B-Chat from PyTorch to CoreML (float16, int8, int4) for efficient on-device inference on iOS 18+.

nlp mobile ai transformers pytorch llama quantization int8 coreml on-device huggingface apple-silicon int4 llm tinyllama ios18 mlpackage

Updated Nov 10, 2025
Python

robertocenteno / wrapture

Star

Wrapture lets you go from a Python-trained model to deployable JavaScript with a single command. It generates TypeScript bindings and a Web/Node-compatible wrapper, using WebGPU/WASM-ready ONNX runtimes.

javascript ruby rubygems machine-learning typescript model pytorch quantization webgpu simplifier onnx model-conversion wrapture

Updated Nov 10, 2025
TypeScript

prat555 / Data-Processing-for-3D-meshes

Star

This project implements a complete pipeline for 3D mesh preprocessing, normalization, quantization, and error analysis. The work simulates the data preparation phase for AI systems like SeamGPT that work with 3D meshes.

python numpy pandas matplotlib quantization normalization

Updated Nov 10, 2025
Python

intel / auto-round

Star

Advanced Quantization Algorithm for LLMs and VLMs, with support for CPU, Intel GPU, CUDA and HPU.

transformers rounding quantization int4 vllm mxfp4 nvfp4

Updated Nov 10, 2025
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

quantization

Here are 954 public repositories matching this topic...

Grulmex / UFund-Me-Qbot

ParagEkbote / quantized-containerized-models

savageplayerzx / HR_Policy_Query_Resolution_with_Retrieval_Augmented_Generation_RAG

nunchaku-tech / nunchaku

pytorch / ao

sylvesterkaczmarek / phisat2-trustworthy-onboard-ai

sivenz / hybrid-agent-framework

vllm-project / llm-compressor

PDewangan / neo4j-agentframework

m1ns09 / Llama

hiyouga / LLaMA-Factory

Jaypatel2710 / whisp

cenZO00 / autopack

paswell-chiks / Optimizing-RAG-with-Hybrid-Search

kgeon1002 / quant-fund

openvinotoolkit / nncf

ambv231 / tinyllama-coreml-ios18-quantization

robertocenteno / wrapture

prat555 / Data-Processing-for-3D-meshes

intel / auto-round

Improve this page

Add this topic to your repo