Skip to content
View su-park's full-sized avatar
💭
🍉
💭
🍉

Highlights

  • Pro

Block or report su-park

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Python tool for translating subtitles using Google Gemini AI

Python 327 41 Updated Jun 12, 2026

Cloud-native, data onboarding architecture for Google Cloud Datasets

Python 177 76 Updated May 6, 2026

An autonomous agent that conducts deep research on any data using any LLM providers

Python 27,669 3,725 Updated May 28, 2026

A list of AI analytics tools (assistants, chat with data, text-to-sql, benchmarks, etc.)

128 20 Updated Jan 23, 2026

[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!

Python 864 68 Updated Mar 17, 2025

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 4,414 372 Updated Jun 12, 2026

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,352 2,740 Updated May 19, 2026

This project uses the open-source model Mistral Small, deployed in Amazon SageMaker or invoked via API on Amazon Bedrock, to enable users to chat with their database using natural language, without…

Jupyter Notebook 24 4 Updated Mar 31, 2025

A framework for few-shot evaluation of language models.

Python 12,934 3,334 Updated Jun 2, 2026

자체 구축한 한국어 평가 데이터셋을 이용한 한국어 모델 평가

Python 31 2 Updated May 31, 2024

LLM Workshop by Sourab Mangrulkar

Jupyter Notebook 401 141 Updated Jun 16, 2024

Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.

Python 871 106 Updated May 4, 2026

Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.

Python 741 129 Updated May 18, 2026

💭 Aspect-Based-Sentiment-Analysis: Transformer & Explainable ML (TensorFlow)

Python 583 94 Updated May 24, 2026

Chinese-Vicuna: A Chinese Instruction-following LLaMA-based Model —— 一个中文低资源的llama+lora方案,结构参考alpaca

C 4,122 407 Updated Apr 18, 2025

This repository is for active development of the Azure SDK for Python. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/python/azure/ or our v…

Python 5,553 3,306 Updated Jun 13, 2026

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

7,526 406 Updated Jul 16, 2023

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,949 372 Updated Jun 3, 2026

Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

Python 6,081 519 Updated Jul 1, 2025

LLM inference in C/C++

C++ 116,293 19,520 Updated Jun 13, 2026

The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

Python 21,195 3,603 Updated Jul 4, 2024

A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.

Python 7,682 5,457 Updated Jun 12, 2026

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation and works with PyTorch.

Python 1,270 160 Updated Mar 12, 2026

Best Practices on Recommendation Systems

Python 21,762 3,325 Updated Jun 5, 2026

Hive support for Feast offline store

Python 34 26 Updated Oct 28, 2022

A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch

Python 1,415 197 Updated Apr 30, 2026

특허분야 특화된 한국어 AI언어모델 KorPatBERT

70 9 Updated Jan 31, 2024

💫 Industrial-strength Natural Language Processing (NLP) in Python

Python 33,658 4,688 Updated May 19, 2026

Techniques for deep learning with satellite & aerial imagery

10,174 1,642 Updated Jun 7, 2026
Next