-
Shanghai Jiao Tong University
- Shanghai, China
-
21:42
(UTC +08:00) - https://orcid.org/0009-0001-9893-4148
- https://space.bilibili.com/35128368
- https://space.bilibili.com/3546595134015939
- in/siyuanwang0227
Highlights
- Pro
-
-
OpenHands Public
Forked from OpenHands/OpenHands🙌 OpenHands: Code Less, Make More
Python Other UpdatedDec 18, 2025 -
software-agent-sdk Public
Forked from OpenHands/software-agent-sdkA clean, modular SDK for building AI agents with OpenHands V1.
Python MIT License UpdatedDec 18, 2025 -
KeepGPU Public
KeepGPU is a simple CLI app that keeps your GPUs running.
-
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedNov 28, 2025 -
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedNov 22, 2025 -
HELMET Public
Forked from princeton-nlp/HELMETThe HELMET Benchmark
Jupyter Notebook MIT License UpdatedNov 16, 2025 -
MCTS-GSM8k-Demo Public
This is a repo for showcasing using MCTS with LLMs to solve gsm8k problems
-
llm-sandbox Public
Forked from vndee/llm-sandboxLightweight and portable LLM sandbox runtime (code interpreter) Python library.
Python MIT License UpdatedNov 11, 2025 -
verl Public
Forked from volcengine/verlverl: Volcano Engine Reinforcement Learning for LLMs
Python Apache License 2.0 UpdatedSep 16, 2025 -
-
-
-
LongBench Public
Forked from THUDM/LongBench[ACL 2024] LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
Python MIT License UpdatedSep 1, 2025 -
-
RULER Public
Forked from NVIDIA/RULERThis repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
Python Apache License 2.0 UpdatedAug 30, 2025 -
VSCode-Tunnel-Manager Public
A python wrapper manager for VSCode tunnel.
-
Qwen2.5-Math Public
Forked from QwenLM/Qwen2.5-MathA series of math-specific large language models of our Qwen2 series.
Python UpdatedAug 6, 2025 -
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of language models.
Python MIT License UpdatedJul 31, 2025 -
MInference Public
Forked from microsoft/MInferenceTo speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.
Python MIT License UpdatedJul 31, 2025 -
Awesome-LLM-Reasoning Public
Forked from atfortes/Awesome-LLM-ReasoningReasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓
MIT License UpdatedJun 20, 2025 -
NoLiMa Public
Forked from adobe-research/NoLiMaOfficial repository for "NoLiMa: Long-Context Evaluation Beyond Literal Matching"
Python Other UpdatedJun 12, 2025 -
SPMM Public
Forked from jinhojsk515/SPMM[Nat. Comm. 2024] Multimodal learning for chemical domain, with SMILES and properties.
Python Apache License 2.0 UpdatedMay 14, 2025 -
MolCA Public
Forked from acharkq/MolCACode for EMNLP2023 paper "MolCA: Molecular Graph-Language Modeling with Cross-Modal Projector and Uni-Modal Adapter".
Python UpdatedMay 2, 2025 -
-
-
InfiniteBench Public
Forked from OpenBMB/InfiniteBenchCodes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718
Python MIT License UpdatedMar 26, 2025 -
Slow_Thinking_with_LLMs Public
Forked from RUCAIBox/Slow_Thinking_with_LLMsA series of technical report on Slow Thinking with LLM
Python UpdatedFeb 14, 2025 -
needle-threading Public
Forked from jonathan-roberts1/needle-threadingMIT License UpdatedFeb 11, 2025