Skip to content
View NoviScl's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@WING-NUS

Block or report NoviScl

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 102 39 Updated Feb 24, 2026

a little presentation timer thing

Python 1 Updated Oct 30, 2024

Code for the paper "Searching Privacy Risks in Multi-Agent Systems via Simulation"

Jupyter Notebook 20 2 Updated Oct 13, 2025

A multi-lingual program repair benchmark set based on the Quixey Challenge

Java 1 1 Updated Sep 3, 2025

Collection of scripts and notebooks for OpenAI's latest GPT OSS models

Jupyter Notebook 501 52 Updated Aug 25, 2025

Minimal and annotated implementations of key ideas from modern deep learning research.

Python 1,290 107 Updated Jan 29, 2026

ChatGPT Timestamp Chrome Extension

JavaScript 63 7 Updated Mar 15, 2026
Python 137 8 Updated Dec 9, 2025

The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in language modeling.

Jupyter Notebook 138 12 Updated Feb 21, 2026

ScienceMeter: Tracking Scientific Knowledge Updates in Language Models

Python 17 Updated Jun 28, 2025

Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"

Jupyter Notebook 604 54 Updated Oct 7, 2025

Official repo for Learning to Reason for Long-Form Story Generation

Python 77 10 Updated Apr 19, 2025

[NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"

Python 683 60 Updated Mar 16, 2025

Fully open reproduction of DeepSeek-R1

Python 25,965 2,417 Updated Nov 24, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,979 1,584 Updated Feb 27, 2026
Jupyter Notebook 3 Updated Jan 13, 2025

KernelBench: Can LLMs Write GPU Kernels? - Benchmark + Toolkit with Torch -> CUDA (+ more DSLs)

Jupyter Notebook 885 148 Updated Mar 24, 2026

This repository contains ScholarQABench data and evaluation pipeline.

Python 145 12 Updated Aug 13, 2025

This repository contains expert evaluation interface and data evaluation script for the OpenScholar project.

HTML 39 5 Updated Nov 19, 2024

Recipes to train reward model for RLHF.

Python 1,521 108 Updated Apr 24, 2025

Resources for cultural NLP research

117 16 Updated Sep 28, 2025

ICLR dataset

Jupyter Notebook 53 12 Updated Jan 7, 2026

This repository contains the dataset for "I Can’t Reply with That": Characterizing Problematic Email Reply Suggestions

3 Updated Jan 10, 2021

Code to compute AnthroScore, a computational linguistic measure of anthropomorphism in text

Python 18 1 Updated Mar 31, 2025

Stanford NLP Python library for understanding and improving PyTorch models via interventions

Python 868 101 Updated Mar 6, 2026

GPT4 based personalized ArXiv paper assistant bot

Python 546 144 Updated Mar 26, 2024

Repository for materials of the HAI Diversity Paper

Python 8 Updated Aug 31, 2023
Next