Skip to content
View WhirlFirst's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report WhirlFirst

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL

Python 4,309 372 Updated Nov 13, 2025

Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch

Python 1,936 203 Updated Feb 9, 2026

Open Source Landscapes and Insights Produced by AntOSS

TypeScript 399 20 Updated Mar 20, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,261 3,519 Updated Mar 27, 2026

Democratizing Reinforcement Learning for LLMs

Python 5,298 528 Updated Mar 27, 2026

Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.

Python 4,924 433 Updated Mar 27, 2026

verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in-Group Policy Optimization for LLM Agent Training"

Python 1,736 157 Updated Feb 27, 2026
Python 5 2 Updated Nov 12, 2025

Lightspeed Geometric Dataset Distance via Sliced Optimal Transport (ICML 2025)

Python 5 Updated May 7, 2025

An interface library for RL post training with environments.

Python 1,398 227 Updated Mar 26, 2026

This is the read counts obtained from RNAseq data, which is deposited in the European Nucleotide Archive (ENA) (Leinonen et al., 2011), under accession number PRJEB23709. The work is published in C…

1 1 Updated Dec 7, 2021

The implemetation of the EMNLP 2022 paper "MetaFill: Text Infilling for Meta-Path Generation on Heterogeneous Information Networks."

Python 3 Updated Oct 17, 2022

GenoTEX: An expert-curated benchmark for evaluating LLM agents on real-world gene expression analysis tasks. (MLCB 2025 Oral)

Jupyter Notebook 64 7 Updated Oct 13, 2025

Official repository for the paper "Topological Neural Discrete Representation Learning à la Kohonen" (ICML 2023 Workshop on Sampling and Optimization in Discrete Space)

Python 12 1 Updated Jun 11, 2025

AIDE: AI-Driven Exploration in the Space of Code. The machine Learning engineering agent that automates AI R&D.

Python 1,186 180 Updated Feb 12, 2026

🤗 smolagents: a barebones library for agents that think in code.

Python 26,300 2,405 Updated Mar 13, 2026

Renderer for the harmony response format to be used with gpt-oss

Rust 4,244 263 Updated Mar 26, 2026

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,950 2,063 Updated Mar 26, 2026

Democratizing AI scientists with ToolUniverse

Python 1,184 186 Updated Mar 27, 2026

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

2,499 169 Updated Dec 26, 2024

A benchmark designed to assess AI scientists' ability to generate biological discoveries through data analysis and reasoning with external knowledge.

Jupyter Notebook 12 Updated Mar 18, 2026

Cell Painting Gallery

123 16 Updated Mar 27, 2026

The code for "Learning Molecular Representation in a Cell"

Python 40 8 Updated Mar 6, 2025
Python 41 3 Updated Sep 23, 2025

A curated list of LLM powered AI Agents in Biomedical Research. Medical Image Analysis, Multi-omics Genomics Analysis, Biomedical Scientific Discoveries...

74 5 Updated Sep 28, 2025

Madrigal: Multimodal AI predicts clinical outcomes of drug combinations from preclinical data

Jupyter Notebook 41 9 Updated Jul 31, 2025

A powerful and flexible machine learning platform for drug discovery

Python 1,574 219 Updated Aug 12, 2024

Arc Virtual Cell Atlas

Jupyter Notebook 508 50 Updated Feb 12, 2026

Minimal hackable GRPO implementation

Python 333 43 Updated Jan 31, 2025
Next