Skip to content
View meettyj's full-sized avatar
💻
Focusing
💻
Focusing

Block or report meettyj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is the official code for the paper: ALDEN: Reinforcement Learning for Active Navigation and Evidence Gathering in Long Documents

Python 3 Updated Apr 9, 2026

The agent that grows with you

Python 96,536 13,560 Updated Apr 17, 2026

We propose Reinforcement Learning from Community Feedback (RLCF), a training paradigm that uses large-scale community signals as supervision, and formulate scientific taste learning as a preference…

391 10 Updated Mar 29, 2026

ARIS ⚔️ (Auto-Research-In-Sleep) — Lightweight Markdown-only skills for autonomous ML research: cross-model review loops, idea discovery, and experiment automation. No framework, no lock-in — works…

Python 6,928 642 Updated Apr 17, 2026

An unofficial, typed, asynchronous Python SDK for Tastytrade!

Python 218 74 Updated Apr 8, 2026

Let Claude manage your tastytrade portfolio.

Python 61 24 Updated Apr 10, 2026

This repository contains the toolkit for replicating results from our technical report.

Python 246 27 Updated Sep 3, 2025

"what, how, where, and how well? a survey on test-time scaling in large language models" repository

HTML 95 3 Updated Apr 16, 2026

DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents

Python 684 72 Updated Apr 16, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,440 542 Updated Apr 17, 2026

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,807 173 Updated Feb 27, 2026

TradingAgents: Multi-Agents LLM Financial Trading Framework

Python 51,196 9,237 Updated Apr 13, 2026

"AI-Trader: 100% Fully-Automated Agent-Native Trading"

Python 13,479 2,282 Updated Apr 10, 2026

🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.

MDX 73,439 7,919 Updated Mar 11, 2026

Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, i…

Python 40,867 6,432 Updated Apr 17, 2026

The best ChatGPT that $100 can buy.

Python 52,054 6,913 Updated Apr 14, 2026

Implement a reasoning LLM in PyTorch from scratch, step by step

Jupyter Notebook 4,131 586 Updated Apr 17, 2026

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 90,973 13,978 Updated Apr 16, 2026

[arXiv 25] OCRGenBench: A Comprehensive Benchmark for Evaluating OCR Generative Capabilities

Python 262 4 Updated Apr 13, 2026

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 109,108 12,674 Updated Apr 17, 2026
Python 13 Updated Sep 30, 2025

A search engine dedicated to CS conferences. It provides useful filters for conferences and year range.

JavaScript 226 7 Updated Mar 30, 2026

This is an open-source toolkit for Heterogeneous Graph Neural Network(OpenHGNN) based on DGL.

Python 971 165 Updated Sep 22, 2025

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 159,523 32,903 Updated Apr 17, 2026

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.5, GPT-OSS, Llama, and more!

Python 9,174 793 Updated Apr 16, 2026

[TMLR 2025] Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models

762 37 Updated Feb 28, 2026

Implementation of Reinforcement Pre-Training (RPT) for Language Models - ArXiv:2506.08007

Python 22 2 Updated Jul 19, 2025

Repo for paper "On The Design Choices of Next Level LLMs"

9 Updated Jun 22, 2025

⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)

Python 3,459 297 Updated Apr 10, 2026

复现大模型相关算法及一些学习记录

Python 3,272 439 Updated Mar 21, 2026
Next