Skip to content
View raojay7's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report raojay7

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Your own personal AI assistant. Any OS. Any Platform. The lobster way. 🦞

TypeScript 341,138 67,325 Updated Mar 30, 2026

🚀enhanced GRPO with more verifiable rewards and real-time evaluators

Python 37 Updated Jan 27, 2026

SFT of Reasoning LLMs with Megatron-LM

Python 20 1 Updated Jun 19, 2025

This is a repo with links to everything you'd ever want to learn about data engineering

Jupyter Notebook 40,757 7,762 Updated Mar 18, 2026

Codebase for Iterative DPO Using Rule-based Rewards

Python 270 34 Updated Apr 11, 2025
Python 762 47 Updated Dec 23, 2025

The official repo for paper, LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods.

548 27 Updated Jul 29, 2025

The code of our paper "RaSeRec: Retrieval-Augmented Sequential Recommendation"

Python 28 4 Updated Jan 7, 2025

A reading list on LLM based Synthetic Data Generation 🔥

1,532 91 Updated Jun 5, 2025

A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide researchers, practitioners, and enthusiasts with insights i…

HTML 1,809 99 Updated Mar 28, 2026

DISC-FinLLM,中文金融大语言模型(LLM),旨在为用户提供金融场景下专业、智能、全面的金融咨询服务。DISC-FinLLM, a Chinese financial large language model (LLM) designed to provide users with professional, intelligent, and comprehensive financ…

Python 856 92 Updated Nov 1, 2023

Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]

Python 149 10 Updated Oct 27, 2024

FinQwen: 致力于构建一个开放、稳定、高质量的金融大模型项目,基于大模型搭建金融场景智能问答系统,利用开源开放来促进「AI+金融」。

Jupyter Notebook 433 49 Updated Jun 11, 2024

This repository contains a collection of the best system prompts for ChatGPT, a conversational AI model developed by OpenAI. Star this repository to help us reach 5,000 stars!

1,192 137 Updated Dec 11, 2024

Evaluate your LLM's response with Prometheus and GPT4 💯

Python 1,062 69 Updated Apr 25, 2025

[CSUR 2025] Continual Learning of Large Language Models: A Comprehensive Survey

533 22 Updated Dec 23, 2025

Scalable toolkit for efficient model alignment

Python 850 106 Updated Oct 6, 2025

Ongoing research training transformer models at scale

Python 15,848 3,768 Updated Mar 30, 2026

大麦自动抢票,支持人员、城市、日期场次、价格选择

Python 6,156 751 Updated Feb 4, 2026

大麦网演唱会演出抢票脚本。

Python 662 96 Updated Aug 7, 2024

The official implementation of Self-Play Fine-Tuning (SPIN)

Python 1,235 105 Updated May 8, 2024

This repository collects papers for "A Survey on Knowledge Distillation of Large Language Models". We break down KD into Knowledge Elicitation and Distillation Algorithms, and explore the Skill & V…

1,268 72 Updated Mar 9, 2025

A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.

Python 824 56 Updated Jul 15, 2025

Free ChatGPT&DeepSeek API Key,免费ChatGPT&DeepSeek API。免费接入DeepSeek API和GPT4 API,支持 gpt | deepseek | claude | gemini | grok 等排名靠前的常用大模型。

Python 36,974 2,583 Updated Mar 6, 2026

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,931 767 Updated Sep 22, 2025

contrastive decoding

Python 206 14 Updated Nov 14, 2022

[CVPR 2024 Highlight] Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive Decoding

Python 390 26 Updated Oct 7, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,809 752 Updated Mar 30, 2026

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,933 371 Updated Dec 7, 2024
Next