-
Ajari Technologies
- Indonesia
-
19:52
(UTC +07:00) - zanjabil2502.github.io/personal-profile
- in/zanjabila
Stars
What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers?
Run Google's Gemma 4 models entirely on-device, embedded in a Node.js process. Text, image, and audio in — text out. No API keys, no cloud, no network required after the initial model download.
📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG
Collection of leaked system prompts
Extracted system prompts from Anthropic - Claude Fable 5, Opus 4.8, Claude Code, Claude Design. OpenAI - ChatGPT 5.5 Thinking, GPT 5.5 Instant, Codex. Google - Gemini 3.5 Flash, 3.1 Pro, Antigravit…
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
SigNoz is an open-source observability platform native to OpenTelemetry with logs, traces and metrics in a single application. An open-source alternative to DataDog, NewRelic, etc. 🔥 🖥. 👉 Open sour…
Handbook of Marine Craft Hydrodynamics and Motion Control is an extensive study of the latest research in marine craft hydrodynamics, guidance, navigation, and control (GNC) systems.
The Python Vehicle Simulator is software that supplements the textbook "Handbook of Marine Craft Hydrodynamics and Motion Control," 2nd Edition, by T. I. Fossen, published in 2021 by John Wiley & S…
An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.
Build Real-Time Knowledge Graphs for AI Agents
A curated list of awesome open-source libraries for context engineering (Long-term memory, MCP: Model Context Protocol, Prompt/RAG Compression, Multi-Agent)
Llama Agents + Workflows are an event-driven, async-first, step-based way to control the execution flow of AI applications like agents.
In-depth tutorials on LLMs, RAGs and real-world AI agent applications.
Voice Activity Detector (VAD) : low-latency, high-performance and lightweight
High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.
An open-source AI agent that brings the power of Gemini directly into your terminal.
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
LLM for Long Text Summary (Comprehensive Bulleted Notes)
Official implementation of the paper "Vessel trajectory prediction with recurrent neural networks: An evaluation of datasets, features, and architectures"
Transforms complex documents like PDFs and Office docs into LLM-ready markdown/JSON for your Agentic workflows.
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Live real-time avatars from your webcam in the browser. No dedicated hardware or software installation needed. A pure Google Colab wrapper for live First-order-motion-model, aka Avatarify in the br…