Skip to content
View willxxy's full-sized avatar
🐒
🐒

Organizations

@ELM-Research

Block or report willxxy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs

Python 98 8 Updated Nov 17, 2024

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…

Python 40,082 6,656 Updated Mar 23, 2026

Autonomous experiment loop extension for pi

TypeScript 2,822 137 Updated Mar 21, 2026

[VLDB' 25] ChatTS: Understanding, Chat, Reasoning about Time Series with TS-MLLM

Python 443 45 Updated Jan 12, 2026

EE-LLM is a framework for large-scale training and inference of early-exit (EE) large language models (LLMs).

Python 78 7 Updated Jun 14, 2024

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,650 2,484 Updated Mar 5, 2026

Official code for "Interpretable Language Modeling via Induction-head Ngram Models"

Python 13 1 Updated Oct 23, 2025

Forecast evaluation library

Python 154 16 Updated Mar 17, 2026

A python module to repair invalid JSON from LLMs

Python 4,606 176 Updated Mar 16, 2026

AI agents running research on single-GPU nanochat training automatically

Python 52,039 7,257 Updated Mar 21, 2026
Python 12 1 Updated Mar 10, 2026

Nanochat in MLX

6 Updated Mar 6, 2026

SleepLM: Natural-Language Intelligence for Human Sleep

Jupyter Notebook 32 4 Updated Mar 10, 2026

OSF: On Pre-training and Scaling of Sleep Foundation Models

Jupyter Notebook 21 2 Updated Mar 12, 2026

Code and website for Self-Flow: Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis

Python 388 13 Updated Mar 15, 2026

[AAAI-23 Oral] Official implementation of the paper "Are Transformers Effective for Time Series Forecasting?"

Python 2,436 502 Updated Jan 27, 2024

Official implementation of the paper "Understanding Language Prior of LVLMs by Contrasting Chain-of-Embedding"

Python 16 1 Updated Sep 30, 2025

Gas Town - multi-agent workspace manager

Go 12,827 1,114 Updated Mar 23, 2026

Label Studio is a multi-type data labeling and annotation tool with standardized output format

TypeScript 26,809 3,447 Updated Mar 23, 2026
TeX 6 Updated Feb 25, 2026

A Training and Evaluation Framework for ECG-Language Models (ELMs)

Python 9 2 Updated Mar 18, 2026

A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows

Python 47,221 4,855 Updated Feb 19, 2026

Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"

Python 1,128 70 Updated Dec 22, 2025

Research-oriented pretraining and evaluation pipelines for ECG-specific neural networks

Python 4 1 Updated Mar 17, 2026

Image Markov Chain Monte Carlo

Python 247 37 Updated Nov 12, 2021

Build compute kernels and load them from the Hub.

Python 530 57 Updated Mar 23, 2026

Set up a specific version of NVIDIA CUDA in GitHub Actions on Linux x86_64, arm64 (Debian and Fedora based distribution) and Windows

TypeScript 6 Updated Mar 18, 2026

Provide with pre-build flash-attention package wheels on Linux and Windows platforms using GitHub Actions

Python 1,127 60 Updated Mar 23, 2026

Qwen3.5 is the large language model series developed by Qwen team, Alibaba Cloud.

2,254 120 Updated Mar 2, 2026
Next