Skip to content
View willxxy's full-sized avatar
🐒
🐒

Organizations

@ELM-Research

Block or report willxxy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs

Python 98 8 Updated Nov 17, 2024

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…

Python 39,856 6,601 Updated Mar 21, 2026

Autonomous experiment loop extension for pi

TypeScript 2,583 120 Updated Mar 21, 2026

[VLDB' 25] ChatTS: Understanding, Chat, Reasoning about Time Series with TS-MLLM

Python 441 45 Updated Jan 12, 2026

EE-LLM is a framework for large-scale training and inference of early-exit (EE) large language models (LLMs).

Python 78 7 Updated Jun 14, 2024

Wan: Open and Advanced Large-Scale Video Generative Models

Python 15,638 2,476 Updated Mar 5, 2026

Official code for "Interpretable Language Modeling via Induction-head Ngram Models"

Python 13 1 Updated Oct 23, 2025

Forecast evaluation library

Python 154 16 Updated Mar 17, 2026

A python module to repair invalid JSON from LLMs

Python 4,602 176 Updated Mar 16, 2026

AI agents running research on single-GPU nanochat training automatically

Python 47,898 6,646 Updated Mar 21, 2026
Python 12 1 Updated Mar 10, 2026

Nanochat in MLX

6 Updated Mar 6, 2026

SleepLM: Natural-Language Intelligence for Human Sleep

Jupyter Notebook 31 4 Updated Mar 10, 2026

OSF: On Pre-training and Scaling of Sleep Foundation Models

Jupyter Notebook 21 2 Updated Mar 12, 2026

Code and website for Self-Flow: Self-Supervised Flow Matching for Scalable Multi-Modal Synthesis

Python 381 13 Updated Mar 15, 2026

[AAAI-23 Oral] Official implementation of the paper "Are Transformers Effective for Time Series Forecasting?"

Python 2,435 502 Updated Jan 27, 2024

Official implementation of the paper "Understanding Language Prior of LVLMs by Contrasting Chain-of-Embedding"

Python 16 1 Updated Sep 30, 2025

Gas Town - multi-agent workspace manager

Go 12,704 1,093 Updated Mar 21, 2026

Label Studio is a multi-type data labeling and annotation tool with standardized output format

TypeScript 26,787 3,445 Updated Mar 20, 2026
TeX 6 Updated Feb 25, 2026

A Training and Evaluation Framework for ECG-Language Models (ELMs)

Python 9 2 Updated Mar 18, 2026

A curated list of awesome Claude Skills, resources, and tools for customizing Claude AI workflows

Python 46,647 4,757 Updated Feb 19, 2026

Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"

Python 1,124 70 Updated Dec 22, 2025

Research-oriented pretraining and evaluation pipelines for ECG-specific neural networks

Python 4 1 Updated Mar 17, 2026

Image Markov Chain Monte Carlo

Python 247 37 Updated Nov 12, 2021

Build compute kernels and load them from the Hub.

Python 523 57 Updated Mar 21, 2026

Set up a specific version of NVIDIA CUDA in GitHub Actions on Linux x86_64, arm64 (Debian and Fedora based distribution) and Windows

TypeScript 6 Updated Mar 18, 2026

Provide with pre-build flash-attention package wheels on Linux and Windows platforms using GitHub Actions

Python 1,117 60 Updated Mar 21, 2026

Qwen3.5 is the large language model series developed by Qwen team, Alibaba Cloud.

2,235 116 Updated Mar 2, 2026
Next