-
heartbyte.io
- The Earth
- https://diabhey.com
- @diabhey
- in/abhimanyuselvan
Lists (12)
Sort Name ascending (A-Z)
Stars
Stable Diffusion web UI
A high-throughput and memory-efficient inference and serving engine for LLMs
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
The world's simplest facial recognition api for Python and the command line
CLI platform to experiment with codegen. Precursor to: https://lovable.dev
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations, and production-ready dashboards.
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Open Source framework for voice and multimodal conversational AI
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models
Prometheus-based Kubernetes Resource Recommendations
Speech To Speech: an effort for an open-sourced and modular GPT4-o
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
Real time transcription with OpenAI Whisper.
The official Python SDK for the ElevenLabs API.
This repository contains a collection of awesome tools and scripts for Developers and Engineers seeking to automate routine tasks on AWS Cloud.
HyperGen - Optimized inference and fine-tuning framework for diffusion (image & video) models. Up to 3x faster & 80% less VRAM.
A ChatGPT bot for Kubernetes issues.
This repository contains tutorials and examples for Triton Inference Server
Provides end-to-end model development pipelines for LLMs and Multimodal models that can be launched on-prem or cloud-native.
Optimized local inference for LLMs with HuggingFace-like APIs for quantization, vision/language models, multimodal agents, speech, vector DB, and RAG.
Example DigitalOcean Kubernetes workload with service exposed through a DO load-balancer.
A toolkit for processing speech data and creating speech datasets
Tensorflow Object Detection API Web Service wrapper that works on any <video> tag and WebRTC streams