Skip to content
View swj0419's full-sized avatar

Block or report swj0419

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,443 1,995 Updated Nov 1, 2025

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,214 39 Updated Oct 4, 2025

Democratizing Reinforcement Learning for LLMs

Python 4,881 467 Updated Dec 21, 2025

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 76 16 Updated Aug 17, 2024

Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation

Python 25 2 Updated Feb 18, 2025

Official Repo for Open-Reasoner-Zero

Python 2,083 119 Updated Jun 2, 2025

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 931 87 Updated Sep 23, 2025

Official Implementation for paper: Negative Token Merging: Image-based Adversarial Feature Guidance

Jupyter Notebook 75 2 Updated Jun 23, 2025

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 1,286 66 Updated Dec 4, 2025

Generate textbook-quality synthetic LLM pretraining data

Python 508 49 Updated Oct 19, 2023

InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)

Python 162 21 Updated May 29, 2025

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 884 835 Updated Jul 4, 2024
Python 2 Updated Jan 24, 2024

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,283 209 Updated Mar 5, 2024

This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Ajith, Mengzhou Xia, Yangsibo Huang, Daogao Liu , Terra Blevins…

Python 237 27 Updated Nov 3, 2023

EcoAssistant: using LLM assistant more affordably and accurately

Python 133 8 Updated Jun 30, 2024

SILO Language Models code repository

Python 83 11 Updated Feb 23, 2024

Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M documents, 115B text tokens and 353M images.

Python 211 11 Updated Aug 28, 2024

Failure archive for ChatGPT and similar models

Python 599 24 Updated Apr 7, 2023

Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.

23 1 Updated Nov 23, 2022

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

536 32 Updated Jun 25, 2024

Retrieval augmented diffusion from CompVis.

Jupyter Notebook 53 7 Updated Aug 20, 2022

A modular RL library to fine-tune language models to human preferences

Python 2,378 203 Updated Mar 1, 2024

Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"

Python 62 4 Updated Dec 27, 2022

Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.

Python 712 121 Updated Dec 14, 2025

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Python 1,992 472 Updated Dec 20, 2025

PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an implementation of kNN-LM and kNN-MT

Python 285 24 Updated Oct 20, 2022

Parallel data preprocessing for NLP and ML.

Python 34 2 Updated Nov 1, 2024

[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674

Python 196 13 Updated Jun 14, 2023
Next