Highlights
Lists (1)
Sort Name ascending (A-Z)
Stars
- All languages
- C
- C#
- C++
- CMake
- CSS
- ChucK
- Crystal
- Cuda
- Cython
- Dart
- Dockerfile
- Gherkin
- Go
- HCL
- HTML
- Java
- JavaScript
- Jinja
- Julia
- Jupyter Notebook
- Kotlin
- Lex
- Lua
- MATLAB
- MDX
- Macaulay2
- Makefile
- Markdown
- Nim
- Objective-C
- PHP
- Perl
- Python
- R
- Rich Text Format
- Roff
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Singularity
- Swift
- TSQL
- TeX
- TypeScript
- Vim Script
- Vue
mesolitica / UniCodec-fix
Forked from Jiang-Yidi/UniCodec[ACL 2025 Main] UniCodec: a unified audio codec with a single codebook to support multi-domain audio data, including speech, music, and sound
theblackcat102 / ievals
Forked from iKala/ievalsOfficial github repo for TMMLU+, Large scale traditional chinese massive multitask language understanding
deepspeedai / Megatron-DeepSpeed
Forked from NVIDIA/Megatron-LMOngoing research training transformer language models at scale, including: BERT & GPT-2
ericsunkuan / TalkNet-ASD
Forked from TaoRuijie/TalkNet-ASDACM MM 2021: 'Is Someone Speaking? Exploring Long-term Temporal Features for Audio-visual Active Speaker Detection'
A collection of prompts, system prompts and LLM instructions
δ½Ώη¨ηΉι«δΈζθ³ζιεη Embedding 樑εθ©ζΈ¬
George0828Zhang / stable-ts
Forked from jianfch/stable-tsTranscription, forced alignment, and audio indexing with OpenAI's Whisper
hbwu-ntu / speech-trident
Forked from ga642381/speech-tridentAwesome speech/audio LLMs, representation learning, and codec models
Megatron-LM setup in the smol-cluster
CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. γεΊδΊ PyTorβ¦
serp-ai / bark-with-voice-clone
Forked from suno-ai/barkπ Text-prompted Generative Audio Model - With the ability to clone voices
Python tools for processing the stackexchange data dumps into a text dataset for Language Models
voidful / paperCrawler
Forked from paulpeng-popo/paperCrawlerA crawler for https://ndltd.ncl.edu.tw
cimeister / typical-sampling
Forked from huggingface/transformersπ€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
MTG / Podcastmix
Forked from nschmidtg/PodcastmixPodcastMix A dataset for separating music and speech in podcasts.
Blealtan / RWKV-LM-LoRA
Forked from BlinkDL/RWKV-LMRWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, β¦
Python tools for processing the stackexchange data dumps into a text dataset for Language Models
Streamlit APPs that leverage the power of spaCy to assist language learning
APCLab / jieba-tw
Forked from ldkrsi/jieba-zh_TWη΅ε·΄δΈζζ·θ©ε°η£ηΉι«ηζ¬
facebookresearch / data2vec_vision
Forked from microsoft/unilmLarge-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
ouhenio / stylegan3-projector
Forked from NVlabs/stylegan3StyleGAN3 + Inversion
Jack000 / DALLE-pytorch
Forked from lucidrains/DALLE-pytorchImplementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
debauchee / barrier
Forked from deskflow/deskflowOpen-source KVM software
Erfaniaa / text-to-commit-history
Forked from gelstudios/gitfitiWrite a large text on your GitHub profile, with your commits history (contribution graph).
Pre-trained ELECTRA from Hong Kong data