swj0419

Weijia Shi swj0419

237 followers · 23 following

https://weijia-shi.netlify.app/

Achievements

Stars

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,443 1,995 Updated Nov 1, 2025

zhaochen0110 / Awesome_Think_With_Images

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,214 39 Updated Oct 4, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 4,881 467 Updated Dec 21, 2025

hamishivi / EasyLM

Forked from young-geng/EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 76 16 Updated Aug 17, 2024

alvin-zyl / CoLA

Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation

Python 25 2 Updated Feb 18, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 2,083 119 Updated Jun 2, 2025

allenai / OLMoE

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 931 87 Updated Sep 23, 2025

1jsingh / negtome

Official Implementation for paper: Negative Token Merging: Image-based Adversarial Feature Guidance

Jupyter Notebook 75 2 Updated Jun 23, 2025

lucidrains / transfusion-pytorch

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 1,286 66 Updated Dec 4, 2025

VikParuchuri / textbook_quality

Generate textbook-quality synthetic LLM pretraining data

Python 508 49 Updated Oct 19, 2023

zhichaoxu-shufe / context-aware-decoding-qfs

Python 14 Updated Jan 10, 2024

InfiAgent / InfiAgent

InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)

Python 162 21 Updated May 29, 2025

TRI-ML / prismatic-vlms

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 884 835 Updated Jul 4, 2024

minimario / math-retrieval

Python 2 Updated Jan 24, 2024

EvolvingLMMs-Lab / Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,283 209 Updated Mar 5, 2024

swj0419 / detect-pretrain-code

This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Ajith, Mengzhou Xia, Yangsibo Huang, Daogao Liu , Terra Blevins…

Python 237 27 Updated Nov 3, 2023

JieyuZ2 / EcoAssistant

EcoAssistant: using LLM assistant more affordably and accurately

Python 133 8 Updated Jun 30, 2024

kernelmachine / silo-lm

SILO Language Models code repository

Python 83 11 Updated Feb 23, 2024

huggingface / OBELICS

Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M documents, 115B text tokens and 353M images.

Python 211 11 Updated Aug 28, 2024

giuven95 / chatgpt-failures

Failure archive for ChatGPT and similar models

Python 599 24 Updated Apr 7, 2023

wyu97 / RACo

Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.

23 1 Updated Nov 23, 2022

suzgunmirac / BIG-Bench-Hard

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

536 32 Updated Jun 25, 2024

afiaka87 / retrieval-augmented-diffusion

Forked from CompVis/latent-diffusion

Retrieval augmented diffusion from CompVis.

Jupyter Notebook 53 7 Updated Aug 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Weijia Shi swj0419

Achievements

Achievements

Block or report swj0419

Stars

openai / gpt-oss

zhaochen0110 / Awesome_Think_With_Images

rllm-org / rllm

hamishivi / EasyLM

alvin-zyl / CoLA

Open-Reasoner-Zero / Open-Reasoner-Zero

allenai / OLMoE

1jsingh / negtome

lucidrains / transfusion-pytorch

VikParuchuri / textbook_quality

zhichaoxu-shufe / context-aware-decoding-qfs

InfiAgent / InfiAgent

TRI-ML / prismatic-vlms

minimario / math-retrieval

EvolvingLMMs-Lab / Otter

swj0419 / detect-pretrain-code

JieyuZ2 / EcoAssistant

kernelmachine / silo-lm

huggingface / OBELICS

giuven95 / chatgpt-failures

wyu97 / RACo

suzgunmirac / BIG-Bench-Hard

afiaka87 / retrieval-augmented-diffusion

allenai / RL4LMs

DevSinghSachan / art

texttron / tevatron

castorini / pyserini

neulab / knn-transformers

r2llab / wrangl

princeton-nlp / TRIME