-
dongwoo fine-chem - Data Scientist
- seoul, republic of korea
- http://lsjsj92.tistory.com/
- in/lsjsj92
Stars
An orchestration platform for the development, production, and observation of data assets.
"CLI-Anything: Making ALL Software Agent-Native" -- CLI-Hub: https://clianything.cc/
An open source trusted cloud native registry project that stores, signs, and scans content.
Use Garry Tan's exact Claude Code setup: 23 opinionated tools that serve as CEO, Designer, Eng Manager, Release Manager, Doc Engineer, and QA
Official Python SDK for the Agent2Agent (A2A) Protocol
AI agents running research on single-GPU nanochat training automatically
A set of ready to use Agent Skills for research, science, engineering, analysis, finance and writing.
An agentic skills framework & software development methodology that works.
Claude Code plugins for power users
A fast yet powerful Python Markdown parser with renderers and plugins.
OCR model that handles complex tables, forms, handwriting with full layout.
Toolkit for linearizing PDFs for LLM datasets/training
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
DuckDB is an analytical in-process SQL database management system
An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)
Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)
The absolute trainer to light up AI agents.
📄 Awesome OCR multiple programing languages toolkits based on ONNX Runtime, OpenVINO, MNN, PaddlePaddle, TensorRT and PyTorch.
The official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
Model Context Protocol Servers
A modular graph-based Retrieval-Augmented Generation (RAG) system
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
This repository delivers end-to-end, code-first tutorials covering every layer of production-grade GenAI agents, guiding you from spark to scale with proven patterns and reusable blueprints for re…
Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.