Lists (5)
Sort Name ascending (A-Z)
Stars
Implement a reasoning LLM in PyTorch from scratch, step by step
📚 Tech blogs & talks by companies that run Apache Flink in production
High-performance data engine for AI and multimodal workloads. Process images, audio, video, and structured data at any scale
Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.
Open Lakehouse Format for Multimodal AI. Convert from Parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, DuckDB, Polars, Pyarrow, a…
Understanding Deep Learning - Simon J.D. Prince
Beta release of Archon OS - the knowledge and task management backbone for AI coding assistants.
This repository delivers end-to-end, code-first tutorials covering every layer of production-grade GenAI agents, guiding you from spark to scale with proven patterns and reusable blueprints for re…
Snowflake Data Source for Apache Spark.
Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
New file format for storage of large columnar datasets.
Hierarchical Reasoning Model Official Release
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
A Spark plugin for reading and writing Excel files
AWS MCP Servers — helping you get the most out of AWS, wherever you use MCP.
Apache DataFusion Comet Spark Accelerator
Horizontal scaling for PostgreSQL with automatic sharding.
Supercharge Your LLM with the Fastest KV Cache Layer
AI Crash Course to help busy builders catch up to the public frontier of AI research in 2 weeks
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
本仓库包含对 Claude Code v1.0.33 进行逆向工程的完整研究和分析资料。包括对混淆源代码的深度技术分析、系统架构文档,以及重构 Claude Code agent 系统的实现蓝图。主要发现包括实时 Steering 机制、多 Agent 架构、智能上下文管理和工具执行管道。该项目为理解现代 AI agent 系统设计和实现提供技术参考。
A beautiful, non-destructive, and GPU-accelerated RAW image editor built with performance in mind.
eBPF Developer Tutorial: Learning eBPF Step by Step with Examples
Claude Code is an agentic coding tool that lives in your terminal, understands your codebase, and helps you code faster by executing routine tasks, explaining complex code, and handling git workflo…
Chrome MCP Server is a Chrome extension-based Model Context Protocol (MCP) server that exposes your Chrome browser functionality to AI assistants like Claude, enabling complex browser automation, c…