Skip to content
View tacibey's full-sized avatar

Block or report tacibey

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A model-agnostic layered cognitive framework for LLMs. Improves coherence, emotional clarity, structural reasoning, and creative depth across GPT, Claude, Gemini, Grok, Mistral, and others—while re…

13 1 Updated Apr 7, 2026

Fullstack RAG as a service template

TypeScript 26 5 Updated May 15, 2025

The Deep Research Assistant is meticulously crafted on Mastra's modular, scalable architecture, designed for intelligent orchestration and seamless human-AI interaction. It's built to tackle comple…

TypeScript 31 4 Updated Sep 18, 2025

An example Nuxt 4 app using Mastra AI agent framework

TypeScript 33 2 Updated Jun 13, 2025

A2A Mastra Demo - Multi-Agent System with Amazon Bedrock

TypeScript 39 2 Updated Mar 28, 2026

Ship Agent2Agent in one line of code.

TypeScript 42 8 Updated Feb 20, 2026

An AI agent that searches the web and creates research reports

TypeScript 48 4 Updated Oct 22, 2025

Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web. Make your own persistent autonomous agent on top!

Python 4,270 377 Updated Apr 14, 2026

[NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient

Python 65 1 Updated Sep 27, 2025

Real-time, full-duplex AI voice bot integrating NVIDIA's PersonaPlex with Twilio Media Streams for natural speech-to-speech conversations.

Python 7 1 Updated Feb 1, 2026

Voice bridge connecting PersonaPlex (System 1 fast response) with Letta/Claude (System 2 reasoning). Talker-Reasoner coordination service.

TypeScript 8 1 Updated Feb 21, 2026

PersonaPlex code.

Python 9,271 1,297 Updated Mar 2, 2026

Kronos: A Foundation Model for the Language of Financial Markets

Python 17,610 3,294 Updated Apr 13, 2026

Memory library for building stateful agents

Python 2,361 271 Updated Apr 14, 2026

"DeepTutor: Agent-Native Personalized Learning Assistant"

Python 18,035 2,371 Updated Apr 14, 2026

A gallery that showcases on-device ML/GenAI use cases and allows people to try and use models locally.

Kotlin 21,014 1,991 Updated Apr 8, 2026

Code Implementation of SPA: A Simple but Tough-to-Beat Baseline for Knowledge Injection

Python 8 Updated Mar 24, 2026
TypeScript 1 Updated Mar 3, 2026

Gemma-4-E4B-it running locally in a browser with WebGPU.

TypeScript 1 Updated Apr 7, 2026

Local AI anywhere, for everyone — LLM inference, chat UI, voice, agents, workflows, RAG, and image generation. No cloud, no subscriptions.

Rust 488 141 Updated Apr 13, 2026

Run AI ✨ assistant locally! with simple API for Node.js 🚀

TypeScript 489 39 Updated Nov 16, 2025

My personal website.

TypeScript 2,249 251 Updated Apr 13, 2026

minimalist & responsive ai-powered portfolio template that creates an interactive ama (ask me anything) experience for your visitors

JavaScript 1 1 Updated Jan 27, 2025

LLM inference with 7x longer context. Pure C, zero dependencies. Lossless KV cache compression + single-header library.

C 379 43 Updated Apr 14, 2026

LLM inference in C/C++ with changes from Prism-ML to support 1Bit models

C++ 47 7 Updated Apr 3, 2026

Turbo1Bit: Combining 1-bit LLM weights (Bonsai) with TurboQuant KV cache compression for maximum inference efficiency. 4.2x KV cache compression + 16x weight compression = ~10x total memory reduction.

C 24 2 Updated Apr 2, 2026

Multimodal orchestration for LLM APIs. Source patterns, context caching, and structured output for text, PDFs, images, video, and YouTube - so you don't manage the complexity yourself.

Python 2 1 Updated Mar 21, 2026

AI agent for web scrapping using LLM, RAG and Vector DB

Jupyter Notebook 1 1 Updated Aug 3, 2025

AI pipeline built with the honc and workers-ai. vector embeddings, web scraping and processing with Cloudflare Workflows (beta)

TypeScript 33 1 Updated Apr 28, 2025

🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN

Python 63,980 6,558 Updated Apr 11, 2026
Next