Starred repositories
A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of differenβ¦
Arch is an intelligent prompt gateway. Engineered with (fast) LLMs for the secure handling, robust observability, and seamless integration of prompts with APIs - all outside business logic. Built bβ¦
LangChain 곡μ Document, Cookbook, κ·Έ λ°μ μ€μ© μμ λ₯Ό λ°νμΌλ‘ μμ±ν νκ΅μ΄ νν 리μΌμ λλ€. λ³Έ νν 리μΌμ ν΅ν΄ LangChainμ λ μ½κ³ ν¨κ³Όμ μΌλ‘ μ¬μ©νλ λ°©λ²μ λ°°μΈ μ μμ΅λλ€.
AI-data warehouse to enrich, transform and analyze data from cloud storages
LlamaIndex is a data framework for your LLM applications
This is a workshop designed for Amazon Bedrock a foundational model service.
Modular and structured prompt caching for low-latency LLM inference
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
Plumb a PDF for detailed information about each char, rectangle, line, et cetera βΒ and easily extract text and tables.
Community maintained fork of pdfminer - we fathom PDF
Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
π¦π Build context-aware reasoning applications
Quickly clone an entire org/users repositories into one directory - Supports GitHub, GitLab, Bitbucket, and more ππ₯
νκΈ ν μ€νΈ μλ² λ© λͺ¨λΈ 리λ보λ
Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
CLI tool to generate terraform files from existing infrastructure (reverse Terraform). Infrastructure to Code
A character-wise tokenizer for morphologically rich languages
μλ‘λ¬λ‘ λ΄ λλμ΄ λ¬Όμ¬ λλ λͺ¨λ°μΌ μ²μ²©μ₯μ λλ€.
Sampling CPU and HEAP profiler for Java featuring AsyncGetCallTrace + perf_events
Common solutions and tools developed by Google Cloud's Professional Services team. This repository and its contents are not an officially supported Google product.
Google Cloud Platform Vertex AI end-to-end workflows for machine learning operations
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.