-
arena-hard-auto Public
Forked from PlusRoss/arena-hard-autoArena-Hard-Auto: An automatic LLM benchmark.
Jupyter Notebook Apache License 2.0 UpdatedAug 2, 2024 -
vllm Public
Forked from xiaoxiawu-microsoft/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedJul 1, 2024 -
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of language models.
Python MIT License UpdatedJun 15, 2024 -
task-aware-distillation Public
Less is More: Task-aware Layer-wise Distillation for Language Model Compression (ICML2023)
-
CAMERO Public
CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight Sharing (ACL 2022)
-
SAGE Public
No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)
-
Token-wise Curriculum Learning for Neural Machine Translation
-
-
Super Tickets in Pre-Trained Language Models: From Model Compression to Improving Generalization (ACL 2021)
-
BOND Public
BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision
-
tutorials Public
Forked from pytorch/tutorialsPyTorch tutorials.
Jupyter Notebook BSD 3-Clause "New" or "Revised" License UpdatedMay 13, 2020 -
fairseq Public
Forked from facebookresearch/fairseqFacebook AI Research Sequence-to-Sequence Toolkit written in Python.
Python MIT License UpdatedOct 26, 2019 -
Jenga_DQN_player Public
DQN Implementation of Multi-level Jenga Game With Internal Physical Env Simulation
Python UpdatedApr 26, 2019 -
-
simplified version of adclick fraud detection with visualization build-in based on Kaggle's talkingdata challenge
-
OpenSeq2Seq Public
Forked from NVIDIA/OpenSeq2SeqToolkit for efficient experimentation with various sequence-to-sequence models
Python MIT License UpdatedAug 25, 2018 -
tensor2tensor Public
Forked from tensorflow/tensor2tensorLibrary of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Jupyter Notebook Apache License 2.0 UpdatedMay 30, 2018 -
-
tensorflow Public
Forked from tensorflow/tensorflowComputation using data flow graphs for scalable machine learning
C++ Apache License 2.0 UpdatedApr 18, 2018 -
Python MIT License Updated
Apr 12, 2018 -
models Public
Forked from tensorflow/modelsModels and examples built with TensorFlow
Python Apache License 2.0 UpdatedOct 30, 2017