Skip to content
View meettyj's full-sized avatar
💻
Focusing
💻
Focusing

Block or report meettyj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

We propose Reinforcement Learning from Community Feedback (RLCF), a training paradigm that uses large-scale community signals as supervision, and formulate scientific taste learning as a preference…

391 10 Updated Mar 29, 2026

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…

Python 6,601 621 Updated Apr 14, 2026

An unofficial, typed, asynchronous Python SDK for Tastytrade!

Python 218 74 Updated Apr 8, 2026

Let Claude manage your tastytrade portfolio.

Python 61 23 Updated Apr 10, 2026

This repository contains the toolkit for replicating results from our technical report.

Python 245 26 Updated Sep 3, 2025

"what, how, where, and how well? a survey on test-time scaling in large language models" repository

HTML 95 3 Updated Apr 14, 2026

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Python 679 72 Updated Apr 13, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,424 541 Updated Apr 14, 2026

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,796 167 Updated Feb 27, 2026

TradingAgents: Multi-Agents LLM Financial Trading Framework

Python 50,430 9,110 Updated Apr 13, 2026

"AI-Trader: 100% Fully-Automated Agent-Native Trading"

Python 13,373 2,259 Updated Apr 10, 2026

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 73,306 7,902 Updated Mar 11, 2026

Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, i…

Python 40,719 6,407 Updated Apr 14, 2026

The best ChatGPT that $100 can buy.

Python 51,839 6,891 Updated Apr 14, 2026

Implement a reasoning LLM in PyTorch from scratch, step by step

Jupyter Notebook 4,114 584 Updated Apr 8, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 90,768 13,928 Updated Apr 11, 2026

[arXiv 25] OCRGenBench: A Comprehensive Benchmark for Evaluating OCR Generative Capabilities

Python 262 4 Updated Apr 13, 2026

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 108,782 12,627 Updated Apr 14, 2026
Python 13 Updated Sep 30, 2025

A search engine dedicated to CS conferences. It provides useful filters for conferences and year range.

JavaScript 222 7 Updated Mar 30, 2026

This is an open-source toolkit for Heterogeneous Graph Neural Network(OpenHGNN) based on DGL.

Python 970 164 Updated Sep 22, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 159,373 32,869 Updated Apr 14, 2026

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.5, GPT-OSS, Llama, and more!

Python 9,164 791 Updated Apr 14, 2026

[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

762 37 Updated Feb 28, 2026

Implementation of Reinforcement Pre-Training (RPT) for Language Models - ArXiv:2506.08007

Python 22 2 Updated Jul 19, 2025

Repo for paper "On The Design Choices of Next Level LLMs"

9 Updated Jun 22, 2025

⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)

Python 3,453 297 Updated Apr 10, 2026

复现大模型相关算法及一些学习记录

Python 3,254 439 Updated Mar 21, 2026

LLaMA 2 implemented from scratch in PyTorch

Python 369 71 Updated Sep 25, 2023

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 56,659 9,693 Updated Nov 12, 2025
Next