Stars
Instruct-tune LLaMA on consumer hardware
This repository contains demos I made with the Transformers library by HuggingFace.
Fault-tolerant, highly scalable GPU orchestration, and a machine learning framework designed for training models with billions to trillions of parameters
Benchmarking large language models' complex reasoning ability with chain-of-thought prompting
[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
bert-base-chinese example
研究生数学建模,华为杯数学建模,2021D题(数模之星),乳腺癌,机器学习,数据分析
My coursework for Machine Learning (2021 Spring) at National Taiwan University (NTU)
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations
Framework to learn Named Entity Recognition models without labelled data using weak supervision.
This repo contains my customised style python based plots for NLP papers, and includes my reproduction for my favourite papers' plots
☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models