Skywork series models are pre-trained on 3.2TB of high-quality multilingual (mainly Chinese and English) and code data. We have open-sourced the model, training data, evaluation data, evaluation me…

Python 1,459 142 Updated Mar 7, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 39,246 4,974 Updated Dec 23, 2025

aakriti1kumar / replication-data-and-code-when-LLMs-reliable-empathic-communication

Replication data and code for the paper: When LLMs are Reliable for Judging Empathic Communication

Jupyter Notebook 3 1 Updated Sep 30, 2025

MichiganNLP / micromodels

Micromodels -- A framework for accurate, explainable, data efficient, and reusable NLP models.

Python 14 3 Updated Feb 7, 2023

rmovva / HypotheSAEs

[ICML 2025] HypotheSAEs: Hypothesizing interpretable relationships in text datasets using sparse autoencoders. https://arxiv.org/abs/2502.04382

Jupyter Notebook 67 21 Updated Oct 29, 2025

minalee-research / coauthor-interface

JavaScript 100 25 Updated May 24, 2024

juliamendelsohn / bridging-nations

Code, data, and models for the ICWSM 2023 paper "Bridging Nations: Quantifying the Role of Multilinguals in Communication on Social Media"

Jupyter Notebook 1 Updated Apr 10, 2023

joshnguyen99 / moral_dilemma_topics

Topic modeling on 100,000 r/AmItheAsshole threads.

Python 7 1 Updated Feb 13, 2023

mitmedialab / persona-conflicts-corpus-emnlp-2025

2 Updated Aug 22, 2025

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 81,666 12,216 Updated Dec 21, 2025

thinking-machines-lab / batch_invariant_ops

Python 941 72 Updated Nov 4, 2025

ELITR / incromin-test-calls

This repository contains published parts test cross-lingual calls, collected and annotated as part of the InCroMin project (an FSTP under the EU project UTTER.

1 Updated Jul 8, 2025

allenai / understanding_mcqa

Code for the arXiv preprint "Answer, Assemble, Ace: Understanding How Transformers Answer Multiple Choice Questions"

Python 15 2 Updated Aug 2, 2025

THUDM / LongCite

LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA

Python 515 32 Updated Dec 31, 2024

facebookresearch / SelfCite

Code for the ICML 2025 paper "SelfCite Self-Supervised Alignment for Context Attribution in Large Language Models"

Python 20 1 Updated Dec 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yu (Hope) Hou houyu0930

Achievements

Achievements

Block or report houyu0930

Stars

stanfordnlp / pyvene

easystats / performance

singmann / mixed_model_workshop_2day

singmann / mixed_model_workshop

allenai / olmo-cookbook

MadryLab / context-cite

volcengine / verl

oliviergimenez / fit-glmm-with-brms

huashen218 / bidirectional-alignment-reading-list

huashen218 / value_action_gap

dayeonki / mt_quality_feedback

dayeonki / graphicbench

srush / LLM-Training-Puzzles

relai-ai / relai-sdk

lmarena / search-arena

SkyworkAI / Skywork