Skip to content
View thtang's full-sized avatar
🏸
For fun
🏸
For fun
  • Shopee
  • Singapore

Highlights

  • Pro

Block or report thtang

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 37 5 Updated Jun 8, 2025

❗ uplift modeling in scikit-learn style in python 🐍

Python 783 102 Updated Oct 21, 2023

Uplift modeling and causal inference with machine learning algorithms

Python 5,604 838 Updated Sep 26, 2025
Python 1,458 86 Updated Sep 30, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

64,660 7,190 Updated Jun 4, 2025

Train transformer language models with reinforcement learning.

Python 15,784 2,226 Updated Oct 9, 2025

The AdTEC dataset is designed to evaluate the quality of ad texts from multiple aspects, considering practical advertising operations.

5 1 Updated Jun 26, 2025

CAMERA3: An Evaluation Dataset for Controllable Ad Text Generation in Japanese

4 Updated May 15, 2024

Multimodal dataset for ad text generation in Japanese [Mita+, ACL2024]

26 2 Updated Aug 13, 2024

Bandit algorithms simulations for online learning

Jupyter Notebook 88 34 Updated May 13, 2020

Python implementations of contextual bandits algorithms

Python 805 149 Updated Jun 17, 2025

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive lea…

C++ 8,610 1,934 Updated Oct 17, 2024

A configurable, tunable, and reproducible library for CTR prediction https://fuxictr.github.io

Python 1,291 210 Updated Jun 16, 2025

Code and data for the VLDB 2024 benchmark paper: Are Large Language Models a Good Replacement of Taxonomies?

Python 7 3 Updated Jan 7, 2025

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 4,147 338 Updated Jun 30, 2025

[ACL 2025 Oral] 🔥🔥 MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval

Jupyter Notebook 225 9 Updated May 22, 2025

Your agent in your terminal, equipped with local tools: writes code, uses the terminal, browses the web, vision.

Python 4,030 334 Updated Oct 9, 2025

Codes and Datasets for the ACL2023 Findings Paper: FolkScope: Intention Knowledge Graph Construction for Discovering E-commerce Commonsense

Python 36 5 Updated Mar 3, 2025

A high-performance distributed file system designed to address the challenges of AI training and inference workloads.

C++ 9,363 945 Updated Sep 23, 2025

🤗 smolagents: a barebones library for agents that think in code.

Python 23,290 2,043 Updated Oct 9, 2025

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 7,141 610 Updated Oct 9, 2025

This repository provides programs to build Retrieval Augmented Generation (RAG) code for Generative AI with LlamaIndex, Deep Lake, and Pinecone leveraging the power of OpenAI and Hugging Face model…

Jupyter Notebook 518 169 Updated Sep 23, 2025

The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.

Python 6,233 665 Updated Aug 17, 2025

E5-V: Universal Embeddings with Multimodal Large Language Models

Python 271 10 Updated Dec 23, 2024

Explore a comprehensive collection of resources, tutorials, papers, tools, and best practices for fine-tuning Large Language Models (LLMs). Perfect for ML practitioners and researchers!

9 1 Updated Dec 2, 2024

Finance specialized RAG System for the ACM-ICAIF '24 Competition.

Python 54 9 Updated Nov 27, 2024

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 74,942 10,961 Updated Oct 9, 2025

(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions

Python 258 25 Updated Apr 14, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,298 722 Updated Sep 22, 2025
Next