RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,595 858 Updated Oct 21, 2024

tomohideshibata / BERT-related-papers

BERT-related papers

2,032 282 Updated Aug 12, 2023

ScriptSmith / socialreaper

Social media scraping / data collection library for Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs

Python 553 93 Updated Jun 25, 2020

pytorch-tpu / fairseq

Forked from facebookresearch/fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 22 7 Updated Jan 25, 2023

microsoft / COCO-LM

[NeurIPS 2021] COCO-LM: Correcting and Contrasting Text Sequences for Language Model Pretraining

Python 118 14 Updated Jul 25, 2023

gonglinyuan / metro_t0

Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)

Python 22 3 Updated Nov 1, 2023

segment-any-text / wtpsplit

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

Python 714 41 Updated Oct 27, 2024

junegunn / fzf

🌸 A command-line fuzzy finder

Go 65,143 2,397 Updated Oct 30, 2024

chubin / cheat.sh

the only cheat sheet you need

Python 38,359 1,791 Updated Jun 22, 2024

fe1ixxu / BiBERT

This is the repository of the EMNLP 2021 paper "BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translation".

Python 31 6 Updated Nov 28, 2022

shayne-longpre / a-pretrainers-guide

73 1 Updated May 22, 2023

guidance-ai / guidance

A guidance language for controlling large language models.

Jupyter Notebook 18,970 1,042 Updated Oct 29, 2024

AI4Bharat / Indic-BERT-v1

Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.com/AI4Bharat/IndicBERT

Python 276 41 Updated May 11, 2023

ltgoslo / ltg-bert

LTG-Bert

Python 27 4 Updated Jan 8, 2024

vega / vega

A visualization grammar.

JavaScript 11,212 1,505 Updated Oct 29, 2024

microsoft / JARVIS

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 23,654 1,966 Updated Sep 26, 2024

sufe-nlp / transformer-alignment

Code for EMNLP 2020 paper Accurate Word Alignment Induction from Neural Machine Translation

Python 2 Updated Apr 4, 2023

twitter / the-algorithm

Source code for Twitter's Recommendation Algorithm

Scala 62,257 12,151 Updated Jul 10, 2024

twitter / the-algorithm-ml

Source code for Twitter's Recommendation Algorithm

Python 10,091 2,206 Updated Jul 10, 2024

AI4Finance-Foundation / FinGPT

FinGPT: Open-Source Financial Large Language Models! Revolutionize 🔥 We release the trained model on HuggingFace.

Jupyter Notebook 13,915 1,930 Updated Oct 1, 2024

alexa / massive

Tools and Modeling Code for the MASSIVE dataset

Python 538 57 Updated Nov 28, 2022

microsoft / DeBERTa

The implementation of DeBERTa

Python 1,984 224 Updated Sep 29, 2023

SKTBrain / KoBERT

Korean BERT pre-trained cased (KoBERT)

Jupyter Notebook 1,295 368 Updated Oct 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RuanVisser

Highlights

Block or report RuanVisser

Stars

shawwn / tpunicorn

facebookresearch / faiss

cleanlab / cleanlab

google-research / electra

JonasGeiping / cramming

verazuo / jailbreak_llms

allenai / peS2o

BlinkDL / RWKV-LM