Skip to content
View zhaoc1's full-sized avatar

Organizations

@sunbeam-labs

Block or report zhaoc1

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

FAPLM: A Drop-in Efficient Pytorch Implementation of Protein Language Models

Python 151 12 Updated Jul 30, 2025

Fast and memory-efficient exact attention

Python 19,852 2,044 Updated Oct 8, 2025

Awesome-LLM: a curated list of Large Language Model

25,228 2,130 Updated Jul 31, 2025
Jupyter Notebook 4 Updated Feb 11, 2023

A PyTorch Lightning solution to training OpenAI's CLIP from scratch.

Python 712 85 Updated Apr 15, 2022

NanoGPT (124M) in 3 minutes

Python 3,174 441 Updated Jul 17, 2025

Discovering Interpretable Features in Protein Language Models via Sparse Autoencoders

Python 230 30 Updated Oct 2, 2025

A framework for few-shot evaluation of language models.

Python 10,308 2,771 Updated Oct 9, 2025

Data repository for "Fine-tuning protein language models boosts predictions across diverse tasks"

Jupyter Notebook 47 7 Updated Aug 28, 2024

CLEAN: a contrastive learning model for high-quality functional prediction of proteins

Python 285 58 Updated Apr 6, 2025

Gene cluster comparison figure generator

Python 615 73 Updated Nov 25, 2024

Bioinformatics'2020: BioBERT: a pre-trained biomedical language representation model for biomedical text mining

Python 2,149 478 Updated Aug 13, 2023

Neural Networks: Zero to Hero

Jupyter Notebook 17,873 2,449 Updated Aug 18, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 44,929 7,655 Updated Dec 9, 2024

LLM training in simple, raw C/CUDA

Cuda 27,780 3,211 Updated Jun 26, 2025

Video+code lecture on building nanoGPT from scratch

Python 4,420 697 Updated Aug 13, 2024

My Solution to Assignments of CS234(Stanford / Fall 2019)

Python 15 6 Updated Sep 3, 2020

My Solution to Assignments of CS231n in Winter2016

Jupyter Notebook 72 13 Updated May 9, 2019

Shortest solutions for CS231n 2021-2025

Jupyter Notebook 399 76 Updated Sep 26, 2025

Predict the function of phage hypothetical proteins using an LSTM model trained with Phage Synteny

PureBasic 54 3 Updated Jun 4, 2025

fast phage annotation program

Python 179 25 Updated Oct 7, 2025

Phage Annotation using Protein Structures

Python 128 10 Updated Sep 16, 2025

Making Protein folding accessible to all!

Jupyter Notebook 2,456 641 Updated Sep 18, 2025

Bilingual Language Model for Protein Sequence and Structure

Jupyter Notebook 272 29 Updated Jan 2, 2025

Foldseek enables fast and sensitive comparisons of large structure sets.

C 1,076 133 Updated Sep 4, 2025

⚙️ SpikeHunter: A Deep Learning Tool for Identifying Phage Tailspike Proteins

Python 5 1 Updated Oct 7, 2024

ProtTrans is providing state of the art pretrained language models for proteins. ProtTrans was trained on thousands of GPUs from Summit and hundreds of Google TPUs using Transformers Models.

Jupyter Notebook 1,259 163 Updated May 22, 2025

Detection of phage RBPs based on protein domains and machine learning

Jupyter Notebook 25 3 Updated Feb 5, 2025

GFF/GTF utility providing format conversions, region filtering, FASTA sequence extraction and more

C++ 440 42 Updated Dec 26, 2024

MIT Deep Learning Book in PDF format (complete and parts) by Ian Goodfellow, Yoshua Bengio and Aaron Courville

Java 13,639 2,852 Updated Oct 9, 2023
Next